Berkeley Structural Genomics Center

Info for MP041

This gene is gi number 1673690 from Mycoplasma pneumonia.

Available sections:


Potential Target Homologues of MP041

No target homologues identified for this gene.

Back to Top


MP041 DNA sequence

ATGAATGTGTACCAAACGATTGCTAACTTTGGTACTAACATTTTGGAATCGCTAGGACTAGAGGCAGCACTCTCCCCAAC
TTTAAGTAATACCTATATCTATGGGATTCCGGTGTTAGGGGCAATCTTTTCTGGTTTGATTACCAAAAAGTTAACTAAAT
CAACCGCTAAAAGTCTGATTGTGCAAGCCTTGTTTGTCATTTTGGGTACCTTAGCCTTTTTAGTAATAACACTAGCTTCC
CCAGCCAAACCAGAAACTGGTGCATTTAACCAAGCTACCTTGTGAATAGCATTTGTGGTGATTTGCTTCATCATGTTCTT
TATTGGCGCGAACCGTTCCATTTTTTGGAGTACGATCACAGAACTGAAAGTCAATAAGGAAATAGTAGGACTGGCAGTTG
GTTTTATTTCGATTATTGGTTTTTCCAAAGATGTGTGGTTAAGTCCCTTACTGACTGGTACTACTAACCAGTTCATTGTC
AAAAACAGTCAGGGTACCTCGTTTTACAGCCAGCAAGCCTTGGTAGCCTGAGCCATTTTTGCCCTAATTAATGCTTGTTT
AGCACTACTAGTGACTTACATGATAGTGCGCAAAGTGAAGTATGGCAAAGTGTGAGTTAACCCTAAATTTAAAAAAGTCT
TTGCACTAGGAGAACAACAATATGGACACTAA

Back to Top


MP041 Codon Usage

(224 codons)
fields: [triplet] [frequency: per thousand] ([number])

UUU62.5(  14)  UCU4.5(   1)  UAU17.9(   4)  UGU4.5(   1)  
UUC13.4(   3)  UCC17.9(   4)  UAC13.4(   3)  UGC4.5(   1)  
UUA35.7(   8)  UCA4.5(   1)  UAA4.5(   1)  UGA13.4(   3)  
UUG26.8(   6)  UCG13.4(   3)  UAG0.0(   0)  UGG8.9(   2)  

CUU0.0(   0)  CCU4.5(   1)  CAU0.0(   0)  CGU4.5(   1)  
CUC4.5(   1)  CCC4.5(   1)  CAC4.5(   1)  CGC4.5(   1)  
CUA31.2(   7)  CCA13.4(   3)  CAA26.8(   6)  CGA0.0(   0)  
CUG17.9(   4)  CCG4.5(   1)  CAG13.4(   3)  CGG0.0(   0)  

AUU67.0(  15)  ACU35.7(   8)  AAU17.9(   4)  AGU22.3(   5)  
AUC17.9(   4)  ACC26.8(   6)  AAC31.2(   7)  AGC4.5(   1)  
AUA17.9(   4)  ACA8.9(   2)  AAA53.6(  12)  AGA0.0(   0)  
AUG13.4(   3)  ACG8.9(   2)  AAG13.4(   3)  AGG0.0(   0)  

GUU8.9(   2)  GCU22.3(   5)  GAU4.5(   1)  GGU35.7(   8)  
GUC17.9(   4)  GCC31.2(   7)  GAC0.0(   0)  GGC8.9(   2)  
GUA13.4(   3)  GCA35.7(   8)  GAA22.3(   5)  GGA17.9(   4)  
GUG44.6(  10)  GCG4.5(   1)  GAG4.5(   1)  GGG8.9(   2)  

Back to Top


MP041 Translation

Note: This was generated using the standard codon usage table; UGA codons in MP/MP genes will show up as terminators ("*")

atgaatgtgtaccaaacgattgctaactttggtactaacattttggaatcgctaggacta
 M  N  V  Y  Q  T  I  A  N  F  G  T  N  I  L  E  S  L  G  L 
gaggcagcactctccccaactttaagtaatacctatatctatgggattccggtgttaggg
 E  A  A  L  S  P  T  L  S  N  T  Y  I  Y  G  I  P  V  L  G 
gcaatcttttctggtttgattaccaaaaagttaactaaatcaaccgctaaaagtctgatt
 A  I  F  S  G  L  I  T  K  K  L  T  K  S  T  A  K  S  L  I 
gtgcaagccttgtttgtcattttgggtaccttagcctttttagtaataacactagcttcc
 V  Q  A  L  F  V  I  L  G  T  L  A  F  L  V  I  T  L  A  S 
ccagccaaaccagaaactggtgcatttaaccaagctaccttgtgaatagcatttgtggtg
 P  A  K  P  E  T  G  A  F  N  Q  A  T  L  *  I  A  F  V  V 
atttgcttcatcatgttctttattggcgcgaaccgttccattttttggagtacgatcaca
 I  C  F  I  M  F  F  I  G  A  N  R  S  I  F  W  S  T  I  T 
gaactgaaagtcaataaggaaatagtaggactggcagttggttttatttcgattattggt
 E  L  K  V  N  K  E  I  V  G  L  A  V  G  F  I  S  I  I  G 
ttttccaaagatgtgtggttaagtcccttactgactggtactactaaccagttcattgtc
 F  S  K  D  V  W  L  S  P  L  L  T  G  T  T  N  Q  F  I  V 
aaaaacagtcagggtacctcgttttacagccagcaagccttggtagcctgagccattttt
 K  N  S  Q  G  T  S  F  Y  S  Q  Q  A  L  V  A  *  A  I  F 
gccctaattaatgcttgtttagcactactagtgacttacatgatagtgcgcaaagtgaag
 A  L  I  N  A  C  L  A  L  L  V  T  Y  M  I  V  R  K  V  K 
tatggcaaagtgtgagttaaccctaaatttaaaaaagtctttgcactaggagaacaacaa
 Y  G  K  V  *  V  N  P  K  F  K  K  V  F  A  L  G  E  Q  Q 
tatggacactaa
 Y  G  H  * 

Back to Top


MP041 AA Sequence

MNVYQTIANFGTNILESLGLEAALSPTLSNTYIYGIPVLGAIFSGLITKKLTKSTAKSLIVQALFVILGTLAFLVITLAS
PAKPETGAFNQATLWIAFVVICFIMFFIGANRSIFWSTITELKVNKEIVGLAVGFISIIGFSKDVWLSPLLTGTTNQFIV
KNSQGTSFYSQQALVAWAIFALINACLALLVTYMIVRKVKYGKVWVNPKFKKVFALGEQQYGH

Back to Top


MP041 Protein Parameters

This information was obtained using the Protein Parameters tool on the ExPASy Molecular Biology Server.

Number of amino acids: 223

Molecular weight: 24405.9

Theoretical pI: 9.70

Amino acid composition:

Ala (A)  21	  9.4%
Arg (R)   2	  0.9%
Asn (N)  11	  4.9%
Asp (D)   1	  0.4%
Cys (C)   2	  0.9%
Gln (Q)   9	  4.0%
Glu (E)   6	  2.7%
Gly (G)  16	  7.2%
His (H)   1	  0.4%
Ile (I)  23	 10.3%
Leu (L)  26	 11.7%
Lys (K)  15	  6.7%
Met (M)   3	  1.3%
Phe (F)  17	  7.6%
Pro (P)   6	  2.7%
Ser (S)  15	  6.7%
Thr (T)  18	  8.1%
Trp (W)   5	  2.2%
Tyr (Y)   7	  3.1%
Val (V)  19	  8.5%

Asx (B)   0	  0.0%
Glx (Z)   0	  0.0%
Xaa (X)   0	  0.0%

Total number of negatively charged residues (Asp + Glu): 7
Total number of positively charged residues (Arg + Lys): 17

Atomic composition:

Carbon      C	      1154
Hydrogen    H	      1807
Nitrogen    N	       271
Oxygen      O	       298
Sulfur      S	         5

Formula: C1154H1807N271O298S5
Total number of atoms: 3535

Extinction coefficients:

Conditions: 6.0 M guanidium hydrochloride
            0.02 M phosphate buffer
            pH 6.5

Extinction coefficients are in units of  M-1 cm-1 .

The first table lists values computed assuming ALL Cys 
residues appear as half cystines, whereas the second table 
assumes that NONE do. 

                      276     278     279     280     282
                       nm      nm      nm      nm      nm
Ext. coefficient    37295   37927   37835   37530   36520
Abs 0.1% (=1 g/l)   1.528   1.554   1.550   1.538   1.496



                      276     278     279     280     282
                       nm      nm      nm      nm      nm
Ext. coefficient    37150   37800   37715   37410   36400
Abs 0.1% (=1 g/l)   1.522   1.549   1.545   1.533   1.491


Estimated half-life:

The N-terminal of the sequence considered is M (Met).

The estimated half-life is: 30 hours (mammalian reticulocytes, in vitro).
                            >20 hours (yeast, in vivo).
                            >10 hours (Escherichia coli, in vivo).


Instability index:

The instability index (II) is computed to be 22.60
This classifies the protein as stable.



Aliphatic index: 119.82

Grand average of hydropathicity (GRAVY): 0.712

Back to Top


Predicted properties of MP041

ModelPercentage of MP Gene
Coiled Coil (CCP)0
Disordered (SEG)7
Transmembrane (PHDhtm)55
Transmembrane (TMHMM)49
Homologous to known structure (PSIBLAST)0

Back to Top


MP041 Prediction Details

Sequence: Amino acid sequence.
CCP (C or -): coiled coil prediction from the ccp program (NCBI toolkit)
SEG (D or -): low complexity regions (possibly disordered) from
SEG
PHDhtm (H or -): Transmembrane prediction from PHDhtm
TMHMM (H or -): Transmembrane prediction from TMHMM
PSIBLAST (3 or -): Regions potentially homologous to a protein of known 3D structure, according to PSIBLAST
Pred2ary (H, E, or -): Secondary structure prediction from Pred2ary

                 10        20        30        40        50        60        70        80
                  |         |         |         |         |         |         |         |
Sequence MNVYQTIANFGTNILESLGLEAALSPTLSNTYIYGIPVLGAIFSGLITKKLTKSTAKSLIVQALFVILGTLAFLVITLAS
CCP      --------------------------------------------------------------------------------
SEG      ---------------------------------------------DDDDDDDDDDDDDDD--------------------
PHDhtm   ----------------------------HHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHHHHHHHHHHHHHHHH
TMHMM    ---------------------------HHHHHHHHHHHHHHHHHHHH-----------HHHHHHHHHHHHHHHHHHHH--
PSIBLAST --------------------------------------------------------------------------------
Pred2ary ---EEEH-HHHHHHHHHH------------EEEE---HHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHEHEEE---

                 90       100       110       120       130       140       150       160
                  |         |         |         |         |         |         |         |
Sequence PAKPETGAFNQATLWIAFVVICFIMFFIGANRSIFWSTITELKVNKEIVGLAVGFISIIGFSKDVWLSPLLTGTTNQFIV
CCP      --------------------------------------------------------------------------------
SEG      --------------------------------------------------------------------------------
PHDhtm   -----------HHHHHHHHHHHHHHHHHHHHHHHH------------HHHHHHHHHHHHHHHHHHHHHHHHHHH------
TMHMM    ------------HHHHHHHHHHHHHHHHHHHHHHH------------HHHHHHHHHHHHHHHHHHHHHHH----------
PSIBLAST --------------------------------------------------------------------------------
Pred2ary --------HHHHHHHHHHHHHHHHEEH-----EEEEEEE-EEE----EEEEEEEEEEEEE-----EEEEE------EEEE

                170       180       190       200       210       220
                  |         |         |         |         |         |
Sequence KNSQGTSFYSQQALVAWAIFALINACLALLVTYMIVRKVKYGKVWVNPKFKKVFALGEQQYGH
CCP      ---------------------------------------------------------------
SEG      ---------------------------------------------------------------
PHDhtm   -----------HHHHHHHHHHHHHHHHHHHHHHHHHHH-------------------------
TMHMM    -------------HHHHHHHHHHHHHHHHHHHHHHH---------------------------
PSIBLAST ---------------------------------------------------------------
Pred2ary E------HHHHHHHHHHHHHHHHHHHHHHHHHHEEEEEEEEEEEEE----EEEEE--------

Back to Top