Berkeley Structural Genomics Center

Info for MP070

This gene is gi number 1673723 from Mycoplasma pneumonia.

Available sections:


Potential Target Homologues of MP070

Targetlog10(PSIBLAST E)% identical% coverage (of MP gene)Latest Status
1981B-7.0531.43100.00Expression tested
1984B-8.0538.5464.00Expression tested

Back to Top


MP070 DNA sequence

ATGCAATTAACTACAAGTCAACAAATGCAAACTAACTTTCTTACTACAAATAAACATCACTTATTGAAATACACCAACGG
TTTGATTTGGTGTTGGTGATTATTTGTTATTTCTTTAGTTTTAGCTTCATCGACATTTAGAGGTTTCTTTCTCGGCACTA
TCAATATTGTGAATTTTGTGTTTTGGATATTAGCACTGATTTTTGGTGTAGCTGTCGCTTTCATTAATGGTGTTTTGAGT
AGTGAATTAAAAGAAAATTCAGTTTTTCAAGAAGAACAAAAACGATTCTTCTTGGGCTTTTTCTTTCCACAAATGGCTTT
CTGTAATGCGCTTTGACTGAAACTTAAATTAAGCTATCTAAATTCTGAAAGAGAAAACTTATTAGAAAAAATAAAGCAAA
AACTAAAGAAGCTTACCTTGTCTGTTTTTGTTGTTTGAGGAATTTATTGTGTTTTAGCAACCTCAATTTATTTACCAAAT
GCATTAAGAATTTTAAACATTTATCAAATACCTAACTTAATCGCTTTAATAAATAATAGATTGAGTGAAATTTTGCCAGA
TGGTAACCGTTATTTTTTAGGTCATTCTTCATTCGCTTTTCATTACTATGAAATAGTTTCAAGGATTCCTTTTTTAGTTT
TCTTTATTATTCCAACAATAACCTTAATTACTTTAGGTTGCTATTTATTCGCTTACTTAAGGTTTATAAACTCAAATAAG
TTAAGAAAACCACTTTCTACTTTATCAATAGTTATCATGCTGACTGATGTTGTGGGAATAATTCAATGAATAATTATTGA
TATCTTATTAATTTGGCTTAATGTACCCTTTGTTATATTTGTAATTTTTTGAGTAATAAAACTTGTTTTGCCTTTAGCAA
TGATAGGTACGTTTGTCAGCTCCTTAACTATTTACAAAAAAGTAACAAGCAAAGAATGATTAGCAATAAAAGAAGAACAA
ATTAATTTAACGACAATGAATATTAATATTAATATGGGAGAGCAATCATCCAAAAACATGAATTCTTTTGAAAATCATGA
ATCAAATGAACGAAATAGTCTTCAAATTTATCAACAACATTCATCTATGATGTCAGAAACAAAGAGAAAACAAAGTAGCT
TAAGTTATGATGCCAGAATCCTTTTACCTAAAAGTCCCTATAACACAAAGAAAACTTTATTTTTAATTATTTTCTTTTCC
ATTATCTCCTTGATATTGGCAACTATAGGTAGTGTGTTCATTTCTTTTGCTATAGTTCAAATTTCTATACCTTTTTACGT
TATAGGTGGAGTTATTTGATTTTTTACTTTTATTTCTTTATAG

Back to Top


MP070 Codon Usage

(441 codons)
fields: [triplet] [frequency: per thousand] ([number])

UUU63.5(  28)  UCU22.7(  10)  UAU22.7(  10)  UGU6.8(   3)  
UUC24.9(  11)  UCC9.1(   4)  UAC11.3(   5)  UGC2.3(   1)  
UUA79.4(  35)  UCA24.9(  11)  UAA0.0(   0)  UGA15.9(   7)  
UUG22.7(  10)  UCG2.3(   1)  UAG2.3(   1)  UGG9.1(   4)  

CUU20.4(   9)  CCU11.3(   5)  CAU11.3(   5)  CGU2.3(   1)  
CUC2.3(   1)  CCC4.5(   2)  CAC2.3(   1)  CGC0.0(   0)  
CUA4.5(   2)  CCA11.3(   5)  CAA38.5(  17)  CGA4.5(   2)  
CUG6.8(   3)  CCG0.0(   0)  CAG0.0(   0)  CGG0.0(   0)  

AUU70.3(  31)  ACU24.9(  11)  AAU45.4(  20)  AGU20.4(   9)  
AUC13.6(   6)  ACC9.1(   4)  AAC20.4(   9)  AGC9.1(   4)  
AUA43.1(  19)  ACA18.1(   8)  AAA40.8(  18)  AGA15.9(   7)  
AUG22.7(  10)  ACG4.5(   2)  AAG13.6(   6)  AGG4.5(   2)  

GUU38.5(  17)  GCU18.1(   8)  GAU9.1(   4)  GGU22.7(  10)  
GUC4.5(   2)  GCC2.3(   1)  GAC0.0(   0)  GGC4.5(   2)  
GUA11.3(   5)  GCA13.6(   6)  GAA36.3(  16)  GGA9.1(   4)  
GUG9.1(   4)  GCG2.3(   1)  GAG2.3(   1)  GGG0.0(   0)  

Back to Top


MP070 Translation

Note: This was generated using the standard codon usage table; UGA codons in MP/MP genes will show up as terminators ("*")

atgcaattaactacaagtcaacaaatgcaaactaactttcttactacaaataaacatcac
 M  Q  L  T  T  S  Q  Q  M  Q  T  N  F  L  T  T  N  K  H  H 
ttattgaaatacaccaacggtttgatttggtgttggtgattatttgttatttctttagtt
 L  L  K  Y  T  N  G  L  I  W  C  W  *  L  F  V  I  S  L  V 
ttagcttcatcgacatttagaggtttctttctcggcactatcaatattgtgaattttgtg
 L  A  S  S  T  F  R  G  F  F  L  G  T  I  N  I  V  N  F  V 
ttttggatattagcactgatttttggtgtagctgtcgctttcattaatggtgttttgagt
 F  W  I  L  A  L  I  F  G  V  A  V  A  F  I  N  G  V  L  S 
agtgaattaaaagaaaattcagtttttcaagaagaacaaaaacgattcttcttgggcttt
 S  E  L  K  E  N  S  V  F  Q  E  E  Q  K  R  F  F  L  G  F 
ttctttccacaaatggctttctgtaatgcgctttgactgaaacttaaattaagctatcta
 F  F  P  Q  M  A  F  C  N  A  L  *  L  K  L  K  L  S  Y  L 
aattctgaaagagaaaacttattagaaaaaataaagcaaaaactaaagaagcttaccttg
 N  S  E  R  E  N  L  L  E  K  I  K  Q  K  L  K  K  L  T  L 
tctgtttttgttgtttgaggaatttattgtgttttagcaacctcaatttatttaccaaat
 S  V  F  V  V  *  G  I  Y  C  V  L  A  T  S  I  Y  L  P  N 
gcattaagaattttaaacatttatcaaatacctaacttaatcgctttaataaataataga
 A  L  R  I  L  N  I  Y  Q  I  P  N  L  I  A  L  I  N  N  R 
ttgagtgaaattttgccagatggtaaccgttattttttaggtcattcttcattcgctttt
 L  S  E  I  L  P  D  G  N  R  Y  F  L  G  H  S  S  F  A  F 
cattactatgaaatagtttcaaggattccttttttagttttctttattattccaacaata
 H  Y  Y  E  I  V  S  R  I  P  F  L  V  F  F  I  I  P  T  I 
accttaattactttaggttgctatttattcgcttacttaaggtttataaactcaaataag
 T  L  I  T  L  G  C  Y  L  F  A  Y  L  R  F  I  N  S  N  K 
ttaagaaaaccactttctactttatcaatagttatcatgctgactgatgttgtgggaata
 L  R  K  P  L  S  T  L  S  I  V  I  M  L  T  D  V  V  G  I 
attcaatgaataattattgatatcttattaatttggcttaatgtaccctttgttatattt
 I  Q  *  I  I  I  D  I  L  L  I  W  L  N  V  P  F  V  I  F 
gtaattttttgagtaataaaacttgttttgcctttagcaatgataggtacgtttgtcagc
 V  I  F  *  V  I  K  L  V  L  P  L  A  M  I  G  T  F  V  S 
tccttaactatttacaaaaaagtaacaagcaaagaatgattagcaataaaagaagaacaa
 S  L  T  I  Y  K  K  V  T  S  K  E  *  L  A  I  K  E  E  Q 
attaatttaacgacaatgaatattaatattaatatgggagagcaatcatccaaaaacatg
 I  N  L  T  T  M  N  I  N  I  N  M  G  E  Q  S  S  K  N  M 
aattcttttgaaaatcatgaatcaaatgaacgaaatagtcttcaaatttatcaacaacat
 N  S  F  E  N  H  E  S  N  E  R  N  S  L  Q  I  Y  Q  Q  H 
tcatctatgatgtcagaaacaaagagaaaacaaagtagcttaagttatgatgccagaatc
 S  S  M  M  S  E  T  K  R  K  Q  S  S  L  S  Y  D  A  R  I 
cttttacctaaaagtccctataacacaaagaaaactttatttttaattattttcttttcc
 L  L  P  K  S  P  Y  N  T  K  K  T  L  F  L  I  I  F  F  S 
attatctccttgatattggcaactataggtagtgtgttcatttcttttgctatagttcaa
 I  I  S  L  I  L  A  T  I  G  S  V  F  I  S  F  A  I  V  Q 
atttctatacctttttacgttataggtggagttatttgattttttacttttatttcttta
 I  S  I  P  F  Y  V  I  G  G  V  I  *  F  F  T  F  I  S  L 
tag
 * 

Back to Top


MP070 AA Sequence

MQLTTSQQMQTNFLTTNKHHLLKYTNGLIWCWWLFVISLVLASSTFRGFFLGTINIVNFVFWILALIFGVAVAFINGVLS
SELKENSVFQEEQKRFFLGFFFPQMAFCNALWLKLKLSYLNSERENLLEKIKQKLKKLTLSVFVVWGIYCVLATSIYLPN
ALRILNIYQIPNLIALINNRLSEILPDGNRYFLGHSSFAFHYYEIVSRIPFLVFFIIPTITLITLGCYLFAYLRFINSNK
LRKPLSTLSIVIMLTDVVGIIQWIIIDILLIWLNVPFVIFVIFWVIKLVLPLAMIGTFVSSLTIYKKVTSKEWLAIKEEQ
INLTTMNININMGEQSSKNMNSFENHESNERNSLQIYQQHSSMMSETKRKQSSLSYDARILLPKSPYNTKKTLFLIIFFS
IISLILATIGSVFISFAIVQISIPFYVIGGVIWFFTFISL

Back to Top


MP070 Protein Parameters

This information was obtained using the Protein Parameters tool on the ExPASy Molecular Biology Server.

Number of amino acids: 440

Molecular weight: 50935.5

Theoretical pI: 9.58

Amino acid composition:

Ala (A)  16	  3.6%
Arg (R)  12	  2.7%
Asn (N)  29	  6.6%
Asp (D)   4	  0.9%
Cys (C)   4	  0.9%
Gln (Q)  17	  3.9%
Glu (E)  17	  3.9%
Gly (G)  16	  3.6%
His (H)   6	  1.4%
Ile (I)  56	 12.7%
Leu (L)  60	 13.6%
Lys (K)  24	  5.5%
Met (M)  10	  2.3%
Phe (F)  39	  8.9%
Pro (P)  12	  2.7%
Ser (S)  39	  8.9%
Thr (T)  25	  5.7%
Trp (W)  11	  2.5%
Tyr (Y)  15	  3.4%
Val (V)  28	  6.4%

Asx (B)   0	  0.0%
Glx (Z)   0	  0.0%
Xaa (X)   0	  0.0%

Total number of negatively charged residues (Asp + Glu): 21
Total number of positively charged residues (Arg + Lys): 36

Atomic composition:

Carbon      C	      2416
Hydrogen    H	      3741
Nitrogen    N	       569
Oxygen      O	       608
Sulfur      S	        14

Formula: C2416H3741N569O608S14
Total number of atoms: 7348

Extinction coefficients:

Conditions: 6.0 M guanidium hydrochloride
            0.02 M phosphate buffer
            pH 6.5

Extinction coefficients are in units of  M-1 cm-1 .

The first table lists values computed assuming ALL Cys 
residues appear as half cystines, whereas the second table 
assumes that NONE do. 

                      276     278     279     280     282
                       nm      nm      nm      nm      nm
Ext. coefficient    81440   82854   82675   82030   79840
Abs 0.1% (=1 g/l)   1.599   1.627   1.623   1.610   1.567



                      276     278     279     280     282
                       nm      nm      nm      nm      nm
Ext. coefficient    81150   82600   82435   81790   79600
Abs 0.1% (=1 g/l)   1.593   1.622   1.618   1.606   1.563


Estimated half-life:

The N-terminal of the sequence considered is M (Met).

The estimated half-life is: 30 hours (mammalian reticulocytes, in vitro).
                            >20 hours (yeast, in vivo).
                            >10 hours (Escherichia coli, in vivo).


Instability index:

The instability index (II) is computed to be 37.56
This classifies the protein as stable.



Aliphatic index: 124.91

Grand average of hydropathicity (GRAVY): 0.590

Back to Top


Predicted properties of MP070

ModelPercentage of MP Gene
Coiled Coil (CCP)6
Disordered (SEG)18
Transmembrane (PHDhtm)57
Transmembrane (TMHMM)45
Homologous to known structure (PSIBLAST)0

Back to Top


MP070 Prediction Details

Sequence: Amino acid sequence.
CCP (C or -): coiled coil prediction from the ccp program (NCBI toolkit)
SEG (D or -): low complexity regions (possibly disordered) from
SEG
PHDhtm (H or -): Transmembrane prediction from PHDhtm
TMHMM (H or -): Transmembrane prediction from TMHMM
PSIBLAST (3 or -): Regions potentially homologous to a protein of known 3D structure, according to PSIBLAST
Pred2ary (H, E, or -): Secondary structure prediction from Pred2ary

                 10        20        30        40        50        60        70        80
                  |         |         |         |         |         |         |         |
Sequence MQLTTSQQMQTNFLTTNKHHLLKYTNGLIWCWWLFVISLVLASSTFRGFFLGTINIVNFVFWILALIFGVAVAFINGVLS
CCP      --------------------------------------------------------------------------------
SEG      --------------------------------------------------------------------------------
PHDhtm   --------------------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-
TMHMM    ----------------------------HHHHHHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHH
PSIBLAST --------------------------------------------------------------------------------
Pred2ary ------HHHHHH-------HHHH-----HEHHHHHHHHHHH-------EEE--EEEHHHHHHHHHHHHHHHHHHHH----

                 90       100       110       120       130       140       150       160
                  |         |         |         |         |         |         |         |
Sequence SELKENSVFQEEQKRFFLGFFFPQMAFCNALWLKLKLSYLNSERENLLEKIKQKLKKLTLSVFVVWGIYCVLATSIYLPN
CCP      ------------------------------------CCCCCCCCCCCCCCCCCCCCCCCCCC------------------
SEG      ----------------------------------------------DDDDDDDDDDDDDD--------------------
PHDhtm   --------------------HHHHHHHHHHHHHH--------------------------HHHHHHHHHHHHHHHHHHHH
TMHMM    --------------HHHHHHHHHHHHHHHHHHH--------------------------HHHHHHHHHHHHHHHHHHHHH
PSIBLAST --------------------------------------------------------------------------------
Pred2ary --------HHHHHHHHHH---HHHHHHHHHHHHHH------HHHHHHHHHHHHHH-----EEEEEEEEEEEEEEE---HH

                170       180       190       200       210       220       230       240
                  |         |         |         |         |         |         |         |
Sequence ALRILNIYQIPNLIALINNRLSEILPDGNRYFLGHSSFAFHYYEIVSRIPFLVFFIIPTITLITLGCYLFAYLRFINSNK
CCP      --------------------------------------------------------------------------------
SEG      ------------------------------------------------DDDDDDDDDDDDDDDDD---------------
PHDhtm   HHHHHHHHHHHHHHHHH--------------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----
TMHMM    HH----------------------------------------------HHHHHHHHHHHHHHHHHHHHHHH---------
PSIBLAST --------------------------------------------------------------------------------
Pred2ary HHHHHH-----HHHHHHH------------EEE----EEEEEEEEE----EEEEEEE---EEEHH-HHHHHHHHHH----

                250       260       270       280       290       300       310       320
                  |         |         |         |         |         |         |         |
Sequence LRKPLSTLSIVIMLTDVVGIIQWIIIDILLIWLNVPFVIFVIFWVIKLVLPLAMIGTFVSSLTIYKKVTSKEWLAIKEEQ
CCP      --------------------------------------------------------------------------------
SEG      -------------------DDDDDDDDDDDDDD---DDDDDDDDDD----------------------------------
PHDhtm   ------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---------------
TMHMM    ---HHHHHHHHHHHHHHHHHHHHHHH--------------HHHHHHHHHHHHHHHHHHHHHHH-----------------
PSIBLAST --------------------------------------------------------------------------------
Pred2ary --------EEEEEE--HHHHHHHHHHHHHHHHH---EEEEEEHHHHHHH----HH--EEEEEEEEEE---HHHHHHHHH-

                330       340       350       360       370       380       390       400
                  |         |         |         |         |         |         |         |
Sequence INLTTMNININMGEQSSKNMNSFENHESNERNSLQIYQQHSSMMSETKRKQSSLSYDARILLPKSPYNTKKTLFLIIFFS
CCP      --------------------------------------------------------------------------------
SEG      DDDDDDDDDDDD------------------------------------------------------------DDDDDDDD
PHDhtm   -----------------------------------------------------------------------HHHHHHHHH
TMHMM    ------------------------------------------------------------------------HHHHHHHH
PSIBLAST --------------------------------------------------------------------------------
Pred2ary ---EEEEEE-------------------HHHHHHHHHH---HHHHHHHH---------EEE----------EEEEEEHHH

                410       420       430       440
                  |         |         |         |
Sequence IISLILATIGSVFISFAIVQISIPFYVIGGVIWFFTFISL
CCP      ----------------------------------------
SEG      DDDDDD----------------------------------
PHDhtm   HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-
TMHMM    HHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHH--
PSIBLAST ----------------------------------------
Pred2ary HHHHHHHH---EEEEEEEEEEEE-EEEE--EEEEEEEE--

Back to Top