Berkeley Structural Genomics Center

Info for MP043

This gene is gi number 1673692 from Mycoplasma pneumonia.

Available sections:


Potential Target Homologues of MP043

Targetlog10(PSIBLAST E)% identical% coverage (of MP gene)Latest Status
1012B-12.1014.6692.43Override suggest work stop (low sequencial homology to the known structures)
1330B-5.4013.2629.38Purified

Back to Top


MP043 DNA sequence

ATGAACAAAAATAGTGGTAAAGACTACGAATCGATTCAAGGTAAGATTAAGAGCAAAGCTGAAATAGAAGATTCAGTTAT
ATTGGATCAAGACACTTTTCAAGGCAAGGGTGAAGAAAAAACTAAATTAGATGAGTTGATTAGGTGAGAGGCCGAGAAAA
ATGACTTACTCAAGTTAATTGACAACGTCGGTAATGACGAAGTTTTCACTCCCGTAGAAACTTGTCAAAGAATGCTCGAT
GAGCTCTTTCCTGAAGATCATGAAGTTTGATCAAATCCCGATCTCAAGTGATTAAATCCGTGTGATAAAAACGGAGTTTT
CTTTCGCGAAATTGCGCTAAGACTTGACAAAGGATTAACCAAAATTATTCCCGACGAATATGCCCGCAAGAAGCATATTA
TGACCAAGATGCTTTTTAGCATTGGCTTAACTAAATTTACTAGTTTGATGGTGCGTCGCACGCTTTACTATTGCATTAAA
GCTAACAAGCGTAAAACCAGCGAAGATGAGGGTTGTGCAATTGCAAATGGCGCACGATTTAATAATGAATTTGGCAATGT
TGTTACCCCTTATAAAGAACACTATTTTGGAAAAGAATCGAAAAGCAAAAACTGTCACTTTTGTGGGACAAATAAGAATT
CTAAATATGTAAATTCTATGACTGAAGAAAAGTATGCCTATGACTTCATTCATTTAAATTCGGATGAATATGAAAACTAC
TTTAAAACTAATTTTGGTGTTATGAAATTTGATGTAATTATTGGAAATCCGCCTTACCAGCTATCTAATGGTAGCGGTTC
TGATGGCAATGGAGCCAAAGCAATTTTTCAAGATTTTGTTTTAAAAGCAATTGACTTAGAACCAAAATATTTAGCGATGA
TTATCCCAGCAAAATGAATGATTTCTGCTGAAAATATTTTTTTAAATCTAAGAGATAAATTAAAGAAAAATAAAGATATA
AAAGAAATTAATATTTTTTTCGACTCTAAAGACTGTTTTCCAAAGAGAGAAATAAAAGGTGGCATTTGTTACTTTATCTG
GCAGAATAACTATCAAGGTAAGACTCTTATAAATACCAAATTTTCAAAGAAAGGAAAAAGCACAGAGATCGATAGTTCTA
AAAGAAACTTATTTGTTTCCGTTTTAAACAACACTGAACAAATCATTTTTCGCAAAAGGATTTGACAAGACATTTACACA
AAGATTGATTCCATCTCTGCTAGTGAAATGAAGAACACACACACAGAGAGAGAGCAATTTAGAAGTTAG

Back to Top


MP043 Codon Usage

(423 codons)
fields: [triplet] [frequency: per thousand] ([number])

UUU52.0(  22)  UCU18.9(   8)  UAU23.6(  10)  UGU16.5(   7)  
UUC9.5(   4)  UCC4.7(   2)  UAC14.2(   6)  UGC2.4(   1)  
UUA33.1(  14)  UCA7.1(   3)  UAA0.0(   0)  UGA11.8(   5)  
UUG7.1(   3)  UCG7.1(   3)  UAG2.4(   1)  UGG2.4(   1)  

CUU9.5(   4)  CCU7.1(   3)  CAU7.1(   3)  CGU4.7(   2)  
CUC9.5(   4)  CCC7.1(   3)  CAC7.1(   3)  CGC9.5(   4)  
CUA7.1(   3)  CCA7.1(   3)  CAA21.3(   9)  CGA2.4(   1)  
CUG0.0(   0)  CCG4.7(   2)  CAG4.7(   2)  CGG0.0(   0)  

AUU61.5(  26)  ACU23.6(  10)  AAU54.4(  23)  AGU11.8(   5)  
AUC11.8(   5)  ACC11.8(   5)  AAC26.0(  11)  AGC14.2(   6)  
AUA11.8(   5)  ACA11.8(   5)  AAA80.4(  34)  AGA16.5(   7)  
AUG23.6(  10)  ACG2.4(   1)  AAG40.2(  17)  AGG4.7(   2)  

GUU23.6(  10)  GCU9.5(   4)  GAU37.8(  16)  GGU23.6(  10)  
GUC2.4(   1)  GCC9.5(   4)  GAC28.4(  12)  GGC14.2(   6)  
GUA7.1(   3)  GCA14.2(   6)  GAA59.1(  25)  GGA14.2(   6)  
GUG2.4(   1)  GCG4.7(   2)  GAG18.9(   8)  GGG2.4(   1)  

Back to Top


MP043 Translation

Note: This was generated using the standard codon usage table; UGA codons in MP/MP genes will show up as terminators ("*")

atgaacaaaaatagtggtaaagactacgaatcgattcaaggtaagattaagagcaaagct
 M  N  K  N  S  G  K  D  Y  E  S  I  Q  G  K  I  K  S  K  A 
gaaatagaagattcagttatattggatcaagacacttttcaaggcaagggtgaagaaaaa
 E  I  E  D  S  V  I  L  D  Q  D  T  F  Q  G  K  G  E  E  K 
actaaattagatgagttgattaggtgagaggccgagaaaaatgacttactcaagttaatt
 T  K  L  D  E  L  I  R  *  E  A  E  K  N  D  L  L  K  L  I 
gacaacgtcggtaatgacgaagttttcactcccgtagaaacttgtcaaagaatgctcgat
 D  N  V  G  N  D  E  V  F  T  P  V  E  T  C  Q  R  M  L  D 
gagctctttcctgaagatcatgaagtttgatcaaatcccgatctcaagtgattaaatccg
 E  L  F  P  E  D  H  E  V  *  S  N  P  D  L  K  *  L  N  P 
tgtgataaaaacggagttttctttcgcgaaattgcgctaagacttgacaaaggattaacc
 C  D  K  N  G  V  F  F  R  E  I  A  L  R  L  D  K  G  L  T 
aaaattattcccgacgaatatgcccgcaagaagcatattatgaccaagatgctttttagc
 K  I  I  P  D  E  Y  A  R  K  K  H  I  M  T  K  M  L  F  S 
attggcttaactaaatttactagtttgatggtgcgtcgcacgctttactattgcattaaa
 I  G  L  T  K  F  T  S  L  M  V  R  R  T  L  Y  Y  C  I  K 
gctaacaagcgtaaaaccagcgaagatgagggttgtgcaattgcaaatggcgcacgattt
 A  N  K  R  K  T  S  E  D  E  G  C  A  I  A  N  G  A  R  F 
aataatgaatttggcaatgttgttaccccttataaagaacactattttggaaaagaatcg
 N  N  E  F  G  N  V  V  T  P  Y  K  E  H  Y  F  G  K  E  S 
aaaagcaaaaactgtcacttttgtgggacaaataagaattctaaatatgtaaattctatg
 K  S  K  N  C  H  F  C  G  T  N  K  N  S  K  Y  V  N  S  M 
actgaagaaaagtatgcctatgacttcattcatttaaattcggatgaatatgaaaactac
 T  E  E  K  Y  A  Y  D  F  I  H  L  N  S  D  E  Y  E  N  Y 
tttaaaactaattttggtgttatgaaatttgatgtaattattggaaatccgccttaccag
 F  K  T  N  F  G  V  M  K  F  D  V  I  I  G  N  P  P  Y  Q 
ctatctaatggtagcggttctgatggcaatggagccaaagcaatttttcaagattttgtt
 L  S  N  G  S  G  S  D  G  N  G  A  K  A  I  F  Q  D  F  V 
ttaaaagcaattgacttagaaccaaaatatttagcgatgattatcccagcaaaatgaatg
 L  K  A  I  D  L  E  P  K  Y  L  A  M  I  I  P  A  K  *  M 
atttctgctgaaaatatttttttaaatctaagagataaattaaagaaaaataaagatata
 I  S  A  E  N  I  F  L  N  L  R  D  K  L  K  K  N  K  D  I 
aaagaaattaatatttttttcgactctaaagactgttttccaaagagagaaataaaaggt
 K  E  I  N  I  F  F  D  S  K  D  C  F  P  K  R  E  I  K  G 
ggcatttgttactttatctggcagaataactatcaaggtaagactcttataaataccaaa
 G  I  C  Y  F  I  W  Q  N  N  Y  Q  G  K  T  L  I  N  T  K 
ttttcaaagaaaggaaaaagcacagagatcgatagttctaaaagaaacttatttgtttcc
 F  S  K  K  G  K  S  T  E  I  D  S  S  K  R  N  L  F  V  S 
gttttaaacaacactgaacaaatcatttttcgcaaaaggatttgacaagacatttacaca
 V  L  N  N  T  E  Q  I  I  F  R  K  R  I  *  Q  D  I  Y  T 
aagattgattccatctctgctagtgaaatgaagaacacacacacagagagagagcaattt
 K  I  D  S  I  S  A  S  E  M  K  N  T  H  T  E  R  E  Q  F 
agaagttag
 R  S  * 

Back to Top


MP043 AA Sequence

MNKNSGKDYESIQGKIKSKAEIEDSVILDQDTFQGKGEEKTKLDELIRWEAEKNDLLKLIDNVGNDEVFTPVETCQRMLD
ELFPEDHEVWSNPDLKWLNPCDKNGVFFREIALRLDKGLTKIIPDEYARKKHIMTKMLFSIGLTKFTSLMVRRTLYYCIK
ANKRKTSEDEGCAIANGARFNNEFGNVVTPYKEHYFGKESKSKNCHFCGTNKNSKYVNSMTEEKYAYDFIHLNSDEYENY
FKTNFGVMKFDVIIGNPPYQLSNGSGSDGNGAKAIFQDFVLKAIDLEPKYLAMIIPAKWMISAENIFLNLRDKLKKNKDI
KEINIFFDSKDCFPKREIKGGICYFIWQNNYQGKTLINTKFSKKGKSTEIDSSKRNLFVSVLNNTEQIIFRKRIWQDIYT
KIDSISASEMKNTHTEREQFRS

Back to Top


MP043 Protein Parameters

This information was obtained using the Protein Parameters tool on the ExPASy Molecular Biology Server.

Number of amino acids: 422

Molecular weight: 49061.9

Theoretical pI: 8.62

Amino acid composition:

Ala (A)  16	  3.8%
Arg (R)  16	  3.8%
Asn (N)  34	  8.1%
Asp (D)  28	  6.6%
Cys (C)   8	  1.9%
Gln (Q)  11	  2.6%
Glu (E)  33	  7.8%
Gly (G)  23	  5.5%
His (H)   6	  1.4%
Ile (I)  36	  8.5%
Leu (L)  28	  6.6%
Lys (K)  51	 12.1%
Met (M)  10	  2.4%
Phe (F)  26	  6.2%
Pro (P)  11	  2.6%
Ser (S)  27	  6.4%
Thr (T)  21	  5.0%
Trp (W)   6	  1.4%
Tyr (Y)  16	  3.8%
Val (V)  15	  3.6%

Asx (B)   0	  0.0%
Glx (Z)   0	  0.0%
Xaa (X)   0	  0.0%

Total number of negatively charged residues (Asp + Glu): 61
Total number of positively charged residues (Arg + Lys): 67

Atomic composition:

Carbon      C	      2197
Hydrogen    H	      3426
Nitrogen    N	       584
Oxygen      O	       654
Sulfur      S	        18

Formula: C2197H3426N584O654S18
Total number of atoms: 6879

Extinction coefficients:

Conditions: 6.0 M guanidium hydrochloride
            0.02 M phosphate buffer
            pH 6.5

Extinction coefficients are in units of  M-1 cm-1 .

The first table lists values computed assuming ALL Cys 
residues appear as half cystines, whereas the second table 
assumes that NONE do. 

                      276     278     279     280     282
                       nm      nm      nm      nm      nm
Ext. coefficient    56180   56508   55960   55100   53280
Abs 0.1% (=1 g/l)   1.145   1.152   1.141   1.123   1.086



                      276     278     279     280     282
                       nm      nm      nm      nm      nm
Ext. coefficient    55600   56000   55480   54620   52800
Abs 0.1% (=1 g/l)   1.133   1.141   1.131   1.113   1.076


Estimated half-life:

The N-terminal of the sequence considered is M (Met).

The estimated half-life is: 30 hours (mammalian reticulocytes, in vitro).
                            >20 hours (yeast, in vivo).
                            >10 hours (Escherichia coli, in vivo).


Instability index:

The instability index (II) is computed to be 34.40
This classifies the protein as stable.



Aliphatic index: 73.25

Grand average of hydropathicity (GRAVY): -0.660

Back to Top


Predicted properties of MP043

ModelPercentage of MP Gene
Coiled Coil (CCP)0
Disordered (SEG)5
Transmembrane (PHDhtm)0
Transmembrane (TMHMM)0
Homologous to known structure (PSIBLAST)0

Back to Top


MP043 Prediction Details

Sequence: Amino acid sequence.
CCP (C or -): coiled coil prediction from the ccp program (NCBI toolkit)
SEG (D or -): low complexity regions (possibly disordered) from
SEG
PHDhtm (H or -): Transmembrane prediction from PHDhtm
TMHMM (H or -): Transmembrane prediction from TMHMM
PSIBLAST (3 or -): Regions potentially homologous to a protein of known 3D structure, according to PSIBLAST
Pred2ary (H, E, or -): Secondary structure prediction from Pred2ary

                 10        20        30        40        50        60        70        80
                  |         |         |         |         |         |         |         |
Sequence MNKNSGKDYESIQGKIKSKAEIEDSVILDQDTFQGKGEEKTKLDELIRWEAEKNDLLKLIDNVGNDEVFTPVETCQRMLD
CCP      --------------------------------------------------------------------------------
SEG      --------------------------------------------------------------------------------
PHDhtm   --------------------------------------------------------------------------------
TMHMM    --------------------------------------------------------------------------------
PSIBLAST --------------------------------------------------------------------------------
Pred2ary ---------------EE---E----EEE-------------HHHHHHHHHH---HHHHHHH----------HHHHHHHHH

                 90       100       110       120       130       140       150       160
                  |         |         |         |         |         |         |         |
Sequence ELFPEDHEVWSNPDLKWLNPCDKNGVFFREIALRLDKGLTKIIPDEYARKKHIMTKMLFSIGLTKFTSLMVRRTLYYCIK
CCP      --------------------------------------------------------------------------------
SEG      --------------------------------------------------------------------------------
PHDhtm   --------------------------------------------------------------------------------
TMHMM    --------------------------------------------------------------------------------
PSIBLAST --------------------------------------------------------------------------------
Pred2ary H------------------------HHHHHHHHHHHH-------HHHHHHHHHHHHHHEEE----HHHHHHHHHHHHHHH

                170       180       190       200       210       220       230       240
                  |         |         |         |         |         |         |         |
Sequence ANKRKTSEDEGCAIANGARFNNEFGNVVTPYKEHYFGKESKSKNCHFCGTNKNSKYVNSMTEEKYAYDFIHLNSDEYENY
CCP      --------------------------------------------------------------------------------
SEG      --------------------------------------------------------------------------------
PHDhtm   --------------------------------------------------------------------------------
TMHMM    --------------------------------------------------------------------------------
PSIBLAST --------------------------------------------------------------------------------
Pred2ary H------------E--------------------EE---------EE-------EEEE-----------------HHHHH

                250       260       270       280       290       300       310       320
                  |         |         |         |         |         |         |         |
Sequence FKTNFGVMKFDVIIGNPPYQLSNGSGSDGNGAKAIFQDFVLKAIDLEPKYLAMIIPAKWMISAENIFLNLRDKLKKNKDI
CCP      --------------------------------------------------------------------------------
SEG      ------------------------------------------------------------------DDDDDDDDDDDDDD
PHDhtm   --------------------------------------------------------------------------------
TMHMM    --------------------------------------------------------------------------------
PSIBLAST --------------------------------------------------------------------------------
Pred2ary HH----EEEEEEEE-------------------HHHHHHHHHHHH-----EEEEE--------HHHHHHHHHHHHHH---

                330       340       350       360       370       380       390       400
                  |         |         |         |         |         |         |         |
Sequence KEINIFFDSKDCFPKREIKGGICYFIWQNNYQGKTLINTKFSKKGKSTEIDSSKRNLFVSVLNNTEQIIFRKRIWQDIYT
CCP      --------------------------------------------------------------------------------
SEG      DDDDDDD-------------------------------------------------------------------------
PHDhtm   --------------------------------------------------------------------------------
TMHMM    --------------------------------------------------------------------------------
PSIBLAST --------------------------------------------------------------------------------
Pred2ary EEEEEE---------------EEEEEEE------EEE------------H---H----HHHHHHHHHHHHHHHHHHHHHH

                410       420
                  |         |
Sequence KIDSISASEMKNTHTEREQFRS
CCP      ----------------------
SEG      ----------------------
PHDhtm   ----------------------
TMHMM    ----------------------
PSIBLAST ----------------------
Pred2ary ----------------------

Back to Top