Berkeley Structural Genomics Center

Info for MP014

This gene is gi number 1673660 from Mycoplasma pneumonia.

Available sections:


Potential Target Homologues of MP014

Targetlog10(PSIBLAST E)% identical% coverage (of MP gene)Latest Status
1047B-54.7015.8998.37Suggested work stop (7248 7248 7248 8230 8230 8230 8231 8231)
1435T-11.3031.9178.33Override suggest work stop (No sequence homology to the known crystal structures )
1484T-16.0042.31100.00Work stopped (uncertainty in predicting domain boundaries)
1485T-21.7022.99100.00Work stopped (uncertainty in predicting domain boundaries)
1502T-15.5245.2896.36Work stopped (uncertainty in predicting domain boundaries)
1607T-18.0060.7198.25Work stopped (uncertainty in predicting domain boundaries)
1608T-22.5258.8298.84Work stopped (uncertainty in predicting domain boundaries)
1730T-23.70100.00100.00Work stopped (uncertainty in predicting domain boundaries)
1731T-17.52100.00100.00Work stopped (uncertainty in predicting domain boundaries)
1847T-12.4038.0094.34Work stopped (uncertainty in predicting domain boundaries)
1852T-17.7038.89100.00Work stopped (uncertainty in predicting domain boundaries)
1861T-17.2237.7498.15Work stopped (uncertainty in predicting domain boundaries)
1864T-17.3037.7498.15Work stopped (uncertainty in predicting domain boundaries)
1868T-17.7032.7398.21Work stopped (uncertainty in predicting domain boundaries)
1915T-13.1537.5088.89Work stopped (uncertainty in predicting domain boundaries)
1916T-19.0519.1098.89Work stopped (uncertainty in predicting domain boundaries)
1929T-15.3035.0998.28Work stopped (uncertainty in predicting domain boundaries)

Back to Top


MP014 DNA sequence

ATGAATAGCCAAGTACACCGCAAGGGCTCTATTGCAGAAGCAGTTAGTGCCATCCAAGCTCATGATAAGATCGTGATCTT
CCACCACATTCGTCCTGATGGCGATTGTTTGGGCGCACAACACGGCTTAGCGCGTTTAATCCAAACTAACTTTCCCCACA
AGCAGGTCTTCTGTGTTGGTGATCCCAAACACAACTTTCCCTGATTGGAGATGGTTTTCACTCCAAAGGAACAGATTACC
CCGGAGTTAATGCAACAAGCCTTAGCCGTTATTGTTGATGCCAACTATAAGGAACGGATTGAGTGCCGGGACTTATTAGA
CCAAAACCAGTTTAAGGCAGTATTGCGGATTGACCACCACCCCAATGAGGACGATCTCAATACGACCCATAACTTCGTTG
ATGCGTCTTACATTGCCGCGGCTGAACAAGTGGTAGATCTAGCGGTGCAGGCCAAATGGAAGCTTAGCCCCCCAGCGGCT
ACGGCGCTGTATCTAGGTATTTATACAGATAGTAATAGGTTTCTATATAGTAATACATCATGAAGAACACTATATCTAGG
ATCTATGCTATATAGAGCTCAAGCTAATATAGCTAAGATCCATGATGAGTTAAACCACACTTCCTTAAAGGACATCCAGT
TTAAACAATATGTCTTTAAAAACTTTCAGACCTTTCAGAATGTTATTTACTTTGTGGCCGATAAGAAGTTCCAAAAGAAA
TTAAAGGTAACACCCTTAGAATGTGCACGGGTAAATATCCTAGCTAACATTGAACAATTCCACATTTGGCTGTTCTTTAT
AGAAGAGGGTAAGAACCACTATCGGGTCGAATTCCGTAGTAACGGAATTAACGTACGCGAAGTAGCTTTAAAGTATGGTG
GCGGGGGTCATATTCAGGCCAGCGGTGCAGTTCTTAAAAGCAAGCGCGACATAATTCGTGTAGTTCAAGATTGCCAAAAG
CAAATTGCTGTATAA

Back to Top


MP014 Codon Usage

(325 codons)
fields: [triplet] [frequency: per thousand] ([number])

UUU30.8(  10)  UCU9.2(   3)  UAU27.7(   9)  UGU9.2(   3)  
UUC24.6(   8)  UCC3.1(   1)  UAC6.2(   2)  UGC6.2(   2)  
UUA33.8(  11)  UCA3.1(   1)  UAA3.1(   1)  UGA6.2(   2)  
UUG9.2(   3)  UCG0.0(   0)  UAG0.0(   0)  UGG6.2(   2)  

CUU6.2(   2)  CCU3.1(   1)  CAU12.3(   4)  CGU12.3(   4)  
CUC3.1(   1)  CCC18.5(   6)  CAC33.8(  11)  CGC9.2(   3)  
CUA21.5(   7)  CCA6.2(   2)  CAA46.2(  15)  CGA0.0(   0)  
CUG6.2(   2)  CCG3.1(   1)  CAG24.6(   8)  CGG15.4(   5)  

AUU46.2(  15)  ACU9.2(   3)  AAU24.6(   8)  AGU12.3(   4)  
AUC21.5(   7)  ACC9.2(   3)  AAC33.8(  11)  AGC12.3(   4)  
AUA9.2(   3)  ACA12.3(   4)  AAA18.5(   6)  AGA6.2(   2)  
AUG12.3(   4)  ACG6.2(   2)  AAG52.3(  17)  AGG3.1(   1)  

GUU27.7(   9)  GCU27.7(   9)  GAU36.9(  12)  GGU18.5(   6)  
GUC9.2(   3)  GCC24.6(   8)  GAC18.5(   6)  GGC15.4(   5)  
GUA27.7(   9)  GCA18.5(   6)  GAA27.7(   9)  GGA6.2(   2)  
GUG12.3(   4)  GCG18.5(   6)  GAG18.5(   6)  GGG3.1(   1)  

Back to Top


MP014 Translation

Note: This was generated using the standard codon usage table; UGA codons in MP/MP genes will show up as terminators ("*")

atgaatagccaagtacaccgcaagggctctattgcagaagcagttagtgccatccaagct
 M  N  S  Q  V  H  R  K  G  S  I  A  E  A  V  S  A  I  Q  A 
catgataagatcgtgatcttccaccacattcgtcctgatggcgattgtttgggcgcacaa
 H  D  K  I  V  I  F  H  H  I  R  P  D  G  D  C  L  G  A  Q 
cacggcttagcgcgtttaatccaaactaactttccccacaagcaggtcttctgtgttggt
 H  G  L  A  R  L  I  Q  T  N  F  P  H  K  Q  V  F  C  V  G 
gatcccaaacacaactttccctgattggagatggttttcactccaaaggaacagattacc
 D  P  K  H  N  F  P  *  L  E  M  V  F  T  P  K  E  Q  I  T 
ccggagttaatgcaacaagccttagccgttattgttgatgccaactataaggaacggatt
 P  E  L  M  Q  Q  A  L  A  V  I  V  D  A  N  Y  K  E  R  I 
gagtgccgggacttattagaccaaaaccagtttaaggcagtattgcggattgaccaccac
 E  C  R  D  L  L  D  Q  N  Q  F  K  A  V  L  R  I  D  H  H 
cccaatgaggacgatctcaatacgacccataacttcgttgatgcgtcttacattgccgcg
 P  N  E  D  D  L  N  T  T  H  N  F  V  D  A  S  Y  I  A  A 
gctgaacaagtggtagatctagcggtgcaggccaaatggaagcttagccccccagcggct
 A  E  Q  V  V  D  L  A  V  Q  A  K  W  K  L  S  P  P  A  A 
acggcgctgtatctaggtatttatacagatagtaataggtttctatatagtaatacatca
 T  A  L  Y  L  G  I  Y  T  D  S  N  R  F  L  Y  S  N  T  S 
tgaagaacactatatctaggatctatgctatatagagctcaagctaatatagctaagatc
 *  R  T  L  Y  L  G  S  M  L  Y  R  A  Q  A  N  I  A  K  I 
catgatgagttaaaccacacttccttaaaggacatccagtttaaacaatatgtctttaaa
 H  D  E  L  N  H  T  S  L  K  D  I  Q  F  K  Q  Y  V  F  K 
aactttcagacctttcagaatgttatttactttgtggccgataagaagttccaaaagaaa
 N  F  Q  T  F  Q  N  V  I  Y  F  V  A  D  K  K  F  Q  K  K 
ttaaaggtaacacccttagaatgtgcacgggtaaatatcctagctaacattgaacaattc
 L  K  V  T  P  L  E  C  A  R  V  N  I  L  A  N  I  E  Q  F 
cacatttggctgttctttatagaagagggtaagaaccactatcgggtcgaattccgtagt
 H  I  W  L  F  F  I  E  E  G  K  N  H  Y  R  V  E  F  R  S 
aacggaattaacgtacgcgaagtagctttaaagtatggtggcgggggtcatattcaggcc
 N  G  I  N  V  R  E  V  A  L  K  Y  G  G  G  G  H  I  Q  A 
agcggtgcagttcttaaaagcaagcgcgacataattcgtgtagttcaagattgccaaaag
 S  G  A  V  L  K  S  K  R  D  I  I  R  V  V  Q  D  C  Q  K 
caaattgctgtataa
 Q  I  A  V  * 

Back to Top


MP014 AA Sequence

MNSQVHRKGSIAEAVSAIQAHDKIVIFHHIRPDGDCLGAQHGLARLIQTNFPHKQVFCVGDPKHNFPWLEMVFTPKEQIT
PELMQQALAVIVDANYKERIECRDLLDQNQFKAVLRIDHHPNEDDLNTTHNFVDASYIAAAEQVVDLAVQAKWKLSPPAA
TALYLGIYTDSNRFLYSNTSWRTLYLGSMLYRAQANIAKIHDELNHTSLKDIQFKQYVFKNFQTFQNVIYFVADKKFQKK
LKVTPLECARVNILANIEQFHIWLFFIEEGKNHYRVEFRSNGINVREVALKYGGGGHIQASGAVLKSKRDIIRVVQDCQK
QIAV

Back to Top


MP014 Protein Parameters

This information was obtained using the Protein Parameters tool on the ExPASy Molecular Biology Server.

Number of amino acids: 324

Molecular weight: 37144.5

Theoretical pI: 8.81

Amino acid composition:

Ala (A)  29	  9.0%
Arg (R)  15	  4.6%
Asn (N)  19	  5.9%
Asp (D)  18	  5.6%
Cys (C)   5	  1.5%
Gln (Q)  23	  7.1%
Glu (E)  15	  4.6%
Gly (G)  14	  4.3%
His (H)  15	  4.6%
Ile (I)  25	  7.7%
Leu (L)  26	  8.0%
Lys (K)  23	  7.1%
Met (M)   4	  1.2%
Phe (F)  18	  5.6%
Pro (P)  10	  3.1%
Ser (S)  13	  4.0%
Thr (T)  12	  3.7%
Trp (W)   4	  1.2%
Tyr (Y)  11	  3.4%
Val (V)  25	  7.7%

Asx (B)   0	  0.0%
Glx (Z)   0	  0.0%
Xaa (X)   0	  0.0%

Total number of negatively charged residues (Asp + Glu): 33
Total number of positively charged residues (Arg + Lys): 38

Atomic composition:

Carbon      C	      1679
Hydrogen    H	      2610
Nitrogen    N	       468
Oxygen      O	       469
Sulfur      S	         9

Formula: C1679H2610N468O469S9
Total number of atoms: 5235

Extinction coefficients:

Conditions: 6.0 M guanidium hydrochloride
            0.02 M phosphate buffer
            pH 6.5

Extinction coefficients are in units of  M-1 cm-1 .

The first table lists values computed assuming ALL Cys 
residues appear as half cystines, whereas the second table 
assumes that NONE do. 

                      276     278     279     280     282
                       nm      nm      nm      nm      nm
Ext. coefficient    37840   38054   37675   37080   35840
Abs 0.1% (=1 g/l)   1.019   1.024   1.014   0.998   0.965



                      276     278     279     280     282
                       nm      nm      nm      nm      nm
Ext. coefficient    37550   37800   37435   36840   35600
Abs 0.1% (=1 g/l)   1.011   1.018   1.008   0.992   0.958


Estimated half-life:

The N-terminal of the sequence considered is M (Met).

The estimated half-life is: 30 hours (mammalian reticulocytes, in vitro).
                            >20 hours (yeast, in vivo).
                            >10 hours (Escherichia coli, in vivo).


Instability index:

The instability index (II) is computed to be 30.86
This classifies the protein as stable.



Aliphatic index: 92.72

Grand average of hydropathicity (GRAVY): -0.269

Back to Top


Predicted properties of MP014

ModelPercentage of MP Gene
Coiled Coil (CCP)0
Disordered (SEG)0
Transmembrane (PHDhtm)0
Transmembrane (TMHMM)0
Homologous to known structure (PSIBLAST)72

Back to Top


MP014 Prediction Details

Sequence: Amino acid sequence.
CCP (C or -): coiled coil prediction from the ccp program (NCBI toolkit)
SEG (D or -): low complexity regions (possibly disordered) from
SEG
PHDhtm (H or -): Transmembrane prediction from PHDhtm
TMHMM (H or -): Transmembrane prediction from TMHMM
PSIBLAST (3 or -): Regions potentially homologous to a protein of known 3D structure, according to PSIBLAST
Pred2ary (H, E, or -): Secondary structure prediction from Pred2ary

                 10        20        30        40        50        60        70        80
                  |         |         |         |         |         |         |         |
Sequence MNSQVHRKGSIAEAVSAIQAHDKIVIFHHIRPDGDCLGAQHGLARLIQTNFPHKQVFCVGDPKHNFPWLEMVFTPKEQIT
CCP      --------------------------------------------------------------------------------
SEG      --------------------------------------------------------------------------------
PHDhtm   --------------------------------------------------------------------------------
TMHMM    --------------------------------------------------------------------------------
PSIBLAST ----------------------3333333333333333333333333333333333333333333333333333333333
Pred2ary ---HHHHHHHHHHHHHHHHH---EEEEE-------HHHHHHHHHHHHHHH----EEEE----------------------

                 90       100       110       120       130       140       150       160
                  |         |         |         |         |         |         |         |
Sequence PELMQQALAVIVDANYKERIECRDLLDQNQFKAVLRIDHHPNEDDLNTTHNFVDASYIAAAEQVVDLAVQAKWKLSPPAA
CCP      --------------------------------------------------------------------------------
SEG      --------------------------------------------------------------------------------
PHDhtm   --------------------------------------------------------------------------------
TMHMM    --------------------------------------------------------------------------------
PSIBLAST 33333333333333333333333333333333333333333333333333333333333333333333333333333333
Pred2ary -------EEEEE-------HHHHHHHHH----EEEEEE------------EE-------HHHHHHHHHHHH-------HH

                170       180       190       200       210       220       230       240
                  |         |         |         |         |         |         |         |
Sequence TALYLGIYTDSNRFLYSNTSWRTLYLGSMLYRAQANIAKIHDELNHTSLKDIQFKQYVFKNFQTFQNVIYFVADKKFQKK
CCP      --------------------------------------------------------------------------------
SEG      --------------------------------------------------------------------------------
PHDhtm   --------------------------------------------------------------------------------
TMHMM    --------------------------------------------------------------------------------
PSIBLAST 33333333333333333333333333333333333333333333333333333333333333333333333333333333
Pred2ary HHHHHHHH-------------HHHHHHHHHHH----HHHHHHHHH----HHHHHHHHHHHHHHHH---EEEEE---HHHH

                250       260       270       280       290       300       310       320
                  |         |         |         |         |         |         |         |
Sequence LKVTPLECARVNILANIEQFHIWLFFIEEGKNHYRVEFRSNGINVREVALKYGGGGHIQASGAVLKSKRDIIRVVQDCQK
CCP      --------------------------------------------------------------------------------
SEG      --------------------------------------------------------------------------------
PHDhtm   --------------------------------------------------------------------------------
TMHMM    --------------------------------------------------------------------------------
PSIBLAST 3333333333333333----------------------------------------------------------------
Pred2ary -----HHHHHHHHHHHH----EEEEEEE----EEEEEE-----HHHHHHHHH-----HHHHH-----HHHHHHHHHHHHH

         
         
Sequence QIAV
CCP      ----
SEG      ----
PHDhtm   ----
TMHMM    ----
PSIBLAST ----
Pred2ary H---

Back to Top