Berkeley Structural Genomics Center

Info for MP005

This gene is gi number 1673650 from Mycoplasma pneumonia.

Available sections:


Potential Target Homologues of MP005

Targetlog10(PSIBLAST E)% identical% coverage (of MP gene)Latest Status
1390B-24.7043.0626.47Selected
1392B-78.0551.2592.49Selected
1411B-200.00100.00100.00Selected
1416B-30.7094.5216.40Selected
1419B-32.1038.7830.43Selected

Back to Top


MP005 DNA sequence

ATGGCTGTTGGTATTTTTATTCTTTCACTAAACCCCTCATACGAACTCGTGGACTGAAAGCGGGTGGGTGATACCAAGTT
GGTGGCGCTGGTCCGCTCAGCGTTGGTCAGGGTGAAGTTTAATGATGGGACGAGTTCGGATTCTAATAACCAAGATACGA
ATCAAAATGCCTTGAGTTTTGATACCCAAGAATCACAGAAGGCACTTAATGGCTCGCAGAGTGGATCTTCTGACACTTCC
GGGTCTAACTCCCAAGACTTCGCCAGCTACATCCTCATCTTCCAAGCCGCGCCCAGGGCCACGTGGGTGTTTGAACGGAA
GATTAAGTTAGAGTTGCCCTACGTTAAAAACGAATCGGGAGCAGGGGACTCCACCACAACTAACAGTGGCTCCCTCTACA
CCACCCTCCAAGACCTCCTCGTCGAACAACCCGTGACCCCTTACACGCCCAATGCGGGGTTAGCCCGGGTGAATGGGGCT
GCTCAGGATACGGTTCATTTTGGTTCGGGTCAAGAATCGAGTTGGAATTCCCAACGTTCCCAAAAAGTCCTTAAAAACAA
CCCCGGACCCAAAGCCGTCACCGGCTTTAAGCTCGATAAGGGCCGCGCGTACCGGAAGCTAAATGAAGCTTGACCGGTGT
ATGAACCCCTGGATTCGACCAAGGATGGCAAGGGGAAGGATGAGAGCTCTTGGAACAGTTCGGAAAAAACAACGGCGGAA
AGTGATGCCCCGTTGGTGGGATCAACCGGGAGTCAAATGGCTGCAGTCACAGACAGTCAGCAATCAGGTGATAACAATGG
ATTGGTATCGCTTGCCCAGCGCTCCACTACTGTTGCTGTCCAAAAAAGTGACTCTTCTGGTTCTCAAGGCCAAGGCACAA
CCGATAACAAATTCCAAAAATACCTCAACACCACCCAGGCCCTGCACCAGATGGGGGTGATAGTGCCATCTTTGGAAACG
TGGGCGGGGGAAAACAAGTACTGGAATCGCTACCCGTGCTGCTGGTGGTGTTTCAGTCCAAGCAGCGACCCGGCAATCGT
CTTCCACAAATGAAGATCTACCGAATGTAATCACCCAGTTATACCATACTTCAACCGCCCAACTCGCTTACTTAAATGGC
CAGATCGTTGTGATGGGTTCGAACGCGGTACCGAGTTTGTGGTATTGGGTCGTGGATGAGCGAACCACGTCCGGAAGGGG
TACTTGATGAGCCCACACCGAGTTGAACTGGGGCACCGACAAGCAGAAGGAGTTTGTGGAGAATCAGCTTGGTTTCAATG
AAACCAGCGCCACCGACTCGCATAA

Back to Top


MP005 Codon Usage

(435 codons)
fields: [triplet] [frequency: per thousand] ([number])

UUU18.4(   8)  UCU23.0(  10)  UAU2.3(   1)  UGU9.2(   4)  
UUC16.1(   7)  UCC16.1(   7)  UAC25.3(  11)  UGC4.6(   2)  
UUA6.9(   3)  UCA16.1(   7)  UAA2.3(   1)  UGA11.5(   5)  
UUG20.7(   9)  UCG18.4(   8)  UAG0.0(   0)  UGG20.7(   9)  

CUU11.5(   5)  CCU2.3(   1)  CAU2.3(   1)  CGU6.9(   3)  
CUC20.7(   9)  CCC18.4(   8)  CAC16.1(   7)  CGC18.4(   8)  
CUA4.6(   2)  CCA16.1(   7)  CAA41.4(  18)  CGA6.9(   3)  
CUG9.2(   4)  CCG9.2(   4)  CAG18.4(   8)  CGG11.5(   5)  

AUU6.9(   3)  ACU11.5(   5)  AAU27.6(  12)  AGU25.3(  11)  
AUC6.9(   3)  ACC32.2(  14)  AAC34.5(  15)  AGC11.5(   5)  
AUA4.6(   2)  ACA9.2(   4)  AAA23.0(  10)  AGA2.3(   1)  
AUG9.2(   4)  ACG16.1(   7)  AAG32.2(  14)  AGG4.6(   2)  

GUU16.1(   7)  GCU16.1(   7)  GAU34.5(  15)  GGU18.4(   8)  
GUC20.7(   9)  GCC20.7(   9)  GAC18.4(   8)  GGC16.1(   7)  
GUA4.6(   2)  GCA13.8(   6)  GAA39.1(  17)  GGA18.4(   8)  
GUG27.6(  12)  GCG18.4(   8)  GAG6.9(   3)  GGG27.6(  12)  

Back to Top


MP005 Translation

Note: This was generated using the standard codon usage table; UGA codons in MP/MP genes will show up as terminators ("*")

atggctgttggtatttttattctttcactaaacccctcatacgaactcgtggactgaaag
 M  A  V  G  I  F  I  L  S  L  N  P  S  Y  E  L  V  D  *  K 
cgggtgggtgataccaagttggtggcgctggtccgctcagcgttggtcagggtgaagttt
 R  V  G  D  T  K  L  V  A  L  V  R  S  A  L  V  R  V  K  F 
aatgatgggacgagttcggattctaataaccaagatacgaatcaaaatgccttgagtttt
 N  D  G  T  S  S  D  S  N  N  Q  D  T  N  Q  N  A  L  S  F 
gatacccaagaatcacagaaggcacttaatggctcgcagagtggatcttctgacacttcc
 D  T  Q  E  S  Q  K  A  L  N  G  S  Q  S  G  S  S  D  T  S 
gggtctaactcccaagacttcgccagctacatcctcatcttccaagccgcgcccagggcc
 G  S  N  S  Q  D  F  A  S  Y  I  L  I  F  Q  A  A  P  R  A 
acgtgggtgtttgaacggaagattaagttagagttgccctacgttaaaaacgaatcggga
 T  W  V  F  E  R  K  I  K  L  E  L  P  Y  V  K  N  E  S  G 
gcaggggactccaccacaactaacagtggctccctctacaccaccctccaagacctcctc
 A  G  D  S  T  T  T  N  S  G  S  L  Y  T  T  L  Q  D  L  L 
gtcgaacaacccgtgaccccttacacgcccaatgcggggttagcccgggtgaatggggct
 V  E  Q  P  V  T  P  Y  T  P  N  A  G  L  A  R  V  N  G  A 
gctcaggatacggttcattttggttcgggtcaagaatcgagttggaattcccaacgttcc
 A  Q  D  T  V  H  F  G  S  G  Q  E  S  S  W  N  S  Q  R  S 
caaaaagtccttaaaaacaaccccggacccaaagccgtcaccggctttaagctcgataag
 Q  K  V  L  K  N  N  P  G  P  K  A  V  T  G  F  K  L  D  K 
ggccgcgcgtaccggaagctaaatgaagcttgaccggtgtatgaacccctggattcgacc
 G  R  A  Y  R  K  L  N  E  A  *  P  V  Y  E  P  L  D  S  T 
aaggatggcaaggggaaggatgagagctcttggaacagttcggaaaaaacaacggcggaa
 K  D  G  K  G  K  D  E  S  S  W  N  S  S  E  K  T  T  A  E 
agtgatgccccgttggtgggatcaaccgggagtcaaatggctgcagtcacagacagtcag
 S  D  A  P  L  V  G  S  T  G  S  Q  M  A  A  V  T  D  S  Q 
caatcaggtgataacaatggattggtatcgcttgcccagcgctccactactgttgctgtc
 Q  S  G  D  N  N  G  L  V  S  L  A  Q  R  S  T  T  V  A  V 
caaaaaagtgactcttctggttctcaaggccaaggcacaaccgataacaaattccaaaaa
 Q  K  S  D  S  S  G  S  Q  G  Q  G  T  T  D  N  K  F  Q  K 
tacctcaacaccacccaggccctgcaccagatgggggtgatagtgccatctttggaaacg
 Y  L  N  T  T  Q  A  L  H  Q  M  G  V  I  V  P  S  L  E  T 
tgggcgggggaaaacaagtactggaatcgctacccgtgctgctggtggtgtttcagtcca
 W  A  G  E  N  K  Y  W  N  R  Y  P  C  C  W  W  C  F  S  P 
agcagcgacccggcaatcgtcttccacaaatgaagatctaccgaatgtaatcacccagtt
 S  S  D  P  A  I  V  F  H  K  *  R  S  T  E  C  N  H  P  V 
ataccatacttcaaccgcccaactcgcttacttaaatggccagatcgttgtgatgggttc
 I  P  Y  F  N  R  P  T  R  L  L  K  W  P  D  R  C  D  G  F 
gaacgcggtaccgagtttgtggtattgggtcgtggatgagcgaaccacgtccggaagggg
 E  R  G  T  E  F  V  V  L  G  R  G  *  A  N  H  V  R  K  G 
tacttgatgagcccacaccgagttgaactggggcaccgacaagcagaaggagtttgtgga
 Y  L  M  S  P  H  R  V  E  L  G  H  R  Q  A  E  G  V  C  G 
gaatcagcttggtttcaatgaaaccagcgccaccgactcgcataa
 E  S  A  W  F  Q  *  N  Q  R  H  R  L  A  * 

Back to Top


MP005 AA Sequence

MAVGIFILSLNPSYELVDWKRVGDTKLVALVRSALVRVKFNDGTSSDSNNQDTNQNALSFDTQESQKALNGSQSGSSDTS
GSNSQDFASYILIFQAAPRATWVFERKIKLELPYVKNESGAGDSTTTNSGSLYTTLQDLLVEQPVTPYTPNAGLARVNGA
AQDTVHFGSGQESSWNSQRSQKVLKNNPGPKAVTGFKLDKGRAYRKLNEAWPVYEPLDSTKDGKGKDESSWNSSEKTTAE
SDAPLVGSTGSQMAAVTDSQQSGDNNGLVSLAQRSTTVAVQKSDSSGSQGQGTTDNKFQKYLNTTQALHQMGVIVPSLET
WAGENKYWNRYPCCWWCFSPSSDPAIVFHKWRSTECNHPVIPYFNRPTRLLKWPDRCDGFERGTEFVVLGRGWANHVRKG
YLMSPHRVELGHRQAEGVCGESAWFQWNQRHRLA

Back to Top


MP005 Protein Parameters

This information was obtained using the Protein Parameters tool on the ExPASy Molecular Biology Server.

Number of amino acids: 434

Molecular weight: 47970.1

Theoretical pI: 8.40

Amino acid composition:

Ala (A)  30	  6.9%
Arg (R)  22	  5.1%
Asn (N)  27	  6.2%
Asp (D)  23	  5.3%
Cys (C)   6	  1.4%
Gln (Q)  26	  6.0%
Glu (E)  20	  4.6%
Gly (G)  35	  8.1%
His (H)   8	  1.8%
Ile (I)   8	  1.8%
Leu (L)  32	  7.4%
Lys (K)  24	  5.5%
Met (M)   4	  0.9%
Phe (F)  15	  3.5%
Pro (P)  20	  4.6%
Ser (S)  48	 11.1%
Thr (T)  30	  6.9%
Trp (W)  14	  3.2%
Tyr (Y)  12	  2.8%
Val (V)  30	  6.9%

Asx (B)   0	  0.0%
Glx (Z)   0	  0.0%
Xaa (X)   0	  0.0%

Total number of negatively charged residues (Asp + Glu): 43
Total number of positively charged residues (Arg + Lys): 46

Atomic composition:

Carbon      C	      2103
Hydrogen    H	      3239
Nitrogen    N	       607
Oxygen      O	       664
Sulfur      S	        10

Formula: C2103H3239N607O664S10
Total number of atoms: 6623

Extinction coefficients:

Conditions: 6.0 M guanidium hydrochloride
            0.02 M phosphate buffer
            pH 6.5

Extinction coefficients are in units of  M-1 cm-1 .

The first table lists values computed assuming ALL Cys 
residues appear as half cystines, whereas the second table 
assumes that NONE do. 

                      276     278     279     280     282
                       nm      nm      nm      nm      nm
Ext. coefficient    93435   95581   95740   95380   93160
Abs 0.1% (=1 g/l)   1.948   1.993   1.996   1.988   1.942



                      276     278     279     280     282
                       nm      nm      nm      nm      nm
Ext. coefficient    93000   95200   95380   95020   92800
Abs 0.1% (=1 g/l)   1.939   1.985   1.988   1.981   1.935


Estimated half-life:

The N-terminal of the sequence considered is M (Met).

The estimated half-life is: 30 hours (mammalian reticulocytes, in vitro).
                            >20 hours (yeast, in vivo).
                            >10 hours (Escherichia coli, in vivo).


Instability index:

The instability index (II) is computed to be 34.18
This classifies the protein as stable.



Aliphatic index: 62.90

Grand average of hydropathicity (GRAVY): -0.658

Back to Top


Predicted properties of MP005

ModelPercentage of MP Gene
Coiled Coil (CCP)0
Disordered (SEG)13
Transmembrane (PHDhtm)0
Transmembrane (TMHMM)0
Homologous to known structure (PSIBLAST)0

Back to Top


MP005 Prediction Details

Sequence: Amino acid sequence.
CCP (C or -): coiled coil prediction from the ccp program (NCBI toolkit)
SEG (D or -): low complexity regions (possibly disordered) from
SEG
PHDhtm (H or -): Transmembrane prediction from PHDhtm
TMHMM (H or -): Transmembrane prediction from TMHMM
PSIBLAST (3 or -): Regions potentially homologous to a protein of known 3D structure, according to PSIBLAST
Pred2ary (H, E, or -): Secondary structure prediction from Pred2ary

                 10        20        30        40        50        60        70        80
                  |         |         |         |         |         |         |         |
Sequence MAVGIFILSLNPSYELVDWKRVGDTKLVALVRSALVRVKFNDGTSSDSNNQDTNQNALSFDTQESQKALNGSQSGSSDTS
CCP      --------------------------------------------------------------------------------
SEG      --------------------------DDDDDDDDDDDD--DDDDDDDDDDDDDDDD-------------DDDDDDDDDDD
PHDhtm   --------------------------------------------------------------------------------
TMHMM    --------------------------------------------------------------------------------
PSIBLAST --------------------------------------------------------------------------------
Pred2ary -EEEEE---------EH-HHHH----EEEE------EEE----------------------HHHHHHH------------

                 90       100       110       120       130       140       150       160
                  |         |         |         |         |         |         |         |
Sequence GSNSQDFASYILIFQAAPRATWVFERKIKLELPYVKNESGAGDSTTTNSGSLYTTLQDLLVEQPVTPYTPNAGLARVNGA
CCP      --------------------------------------------------------------------------------
SEG      DDDDDD--------------------------------------------------------------------------
PHDhtm   --------------------------------------------------------------------------------
TMHMM    --------------------------------------------------------------------------------
PSIBLAST --------------------------------------------------------------------------------
Pred2ary -------HHHHHHH------EEEEEEEEEE--------------------HHHHHHHHHH-----------HHHHHH---

                170       180       190       200       210       220       230       240
                  |         |         |         |         |         |         |         |
Sequence AQDTVHFGSGQESSWNSQRSQKVLKNNPGPKAVTGFKLDKGRAYRKLNEAWPVYEPLDSTKDGKGKDESSWNSSEKTTAE
CCP      --------------------------------------------------------------------------------
SEG      --------------------------------------------------------------------------------
PHDhtm   --------------------------------------------------------------------------------
TMHMM    --------------------------------------------------------------------------------
PSIBLAST --------------------------------------------------------------------------------
Pred2ary ---HE----------------EEE---------------HHHHHHHH---------------------------------

                250       260       270       280       290       300       310       320
                  |         |         |         |         |         |         |         |
Sequence SDAPLVGSTGSQMAAVTDSQQSGDNNGLVSLAQRSTTVAVQKSDSSGSQGQGTTDNKFQKYLNTTQALHQMGVIVPSLET
CCP      --------------------------------------------------------------------------------
SEG      ------------------------------------------DDDDDDDDDDDDD-------------------------
PHDhtm   --------------------------------------------------------------------------------
TMHMM    --------------------------------------------------------------------------------
PSIBLAST --------------------------------------------------------------------------------
Pred2ary ------------EEEEE----------EEEE-----EEEEEE----------------HHH-HHHHHHHH------EEEE

                330       340       350       360       370       380       390       400
                  |         |         |         |         |         |         |         |
Sequence WAGENKYWNRYPCCWWCFSPSSDPAIVFHKWRSTECNHPVIPYFNRPTRLLKWPDRCDGFERGTEFVVLGRGWANHVRKG
CCP      --------------------------------------------------------------------------------
SEG      --------------------------------------------------------------------------------
PHDhtm   --------------------------------------------------------------------------------
TMHMM    --------------------------------------------------------------------------------
PSIBLAST --------------------------------------------------------------------------------
Pred2ary EE-----------------------EEEE----------------------------------EEEEEE-----------

                410       420       430
                  |         |         |
Sequence YLMSPHRVELGHRQAEGVCGESAWFQWNQRHRLA
CCP      ----------------------------------
SEG      ----------------------------------
PHDhtm   ----------------------------------
TMHMM    ----------------------------------
PSIBLAST ----------------------------------
Pred2ary ----------------------------------

Back to Top