Berkeley Structural Genomics Center

Info for MP066

This gene is gi number 1673717 from Mycoplasma pneumonia.

Available sections:


Potential Target Homologues of MP066

Targetlog10(PSIBLAST E)% identical% coverage (of MP gene)Latest Status
1112B-39.7020.6295.31Suggested work stop (64705 64705 64705 64705 64705 64705)
1132B-32.5216.5278.47Suggested work stop (64705 64705 64705 64705 66344 66344 66344)
1134B-36.7018.9979.29Suggested work stop (64705 64705)
1136B-40.4018.8280.00Suggested work stop (64705 64705)
1137B-26.7013.7476.38Suggested work stop (64705 64705 64705 64705 64705 64705)
1159B-34.0516.1579.12Suggested work stop (64705 64705 64705 64705)
1307B-19.1013.1994.74Suggested work stop (64705 64705 64705 64705)
1322B-39.4015.2278.82In PDB (1YF2)
1335B-37.7015.7774.74Override suggest work stop (Stop unless crystallization is working (if precipitated -> OS))
1360B-26.0097.0192.41Override suggest work stop (Stop unless crystallization is working (if precipitated -> OS))
1361B-25.0068.5376.47Suggested work stop (64705 64705)
1372B-39.1011.2779.95Suggested work stop (64705 64705 64705 64705 64705 64705)
1379B-45.1019.2483.33Override suggest work stop (Continue, but check again when 1322B is solved.)
1450T-13.5226.4491.58Suggested work stop (64705 64705 64705 64705 64705)
1451T-13.7020.2598.75Suggested work stop (64705 64705)
1521T-12.3050.0092.31Suggested work stop (64705 64705)
1526T-18.2219.6789.71Override suggest work stop (false positive)
1527T-14.1523.3099.04Suggested work stop (64705 64705)
1528T-19.1016.5699.39Override suggest work stop (a false positive)
1543T-16.0021.9997.24Suggested work stop (64705 64705 66344 66344)
1545T-17.0034.9171.14Suggested work stop (64705 64705)
1548T-17.0035.8571.62Suggested work stop (64705 64705)
1549T-17.5215.7298.15Suggested work stop (64705 64705 64705 64705)
1555T-16.0028.0098.04Suggested work stop (64705 64705 64705 64705)
1556T-15.7016.2297.37Suggested work stop (64705 64705 64705 64705)
1558T-17.0018.3098.71Suggested work stop (64705 64705 64705 64705)
1559T-15.0521.78100.00Suggested work stop (64705 64705 64705 64705 64705)
1560T-16.5220.1398.76Suggested work stop (64705 64705 64705 64705)
1564T-17.0027.3671.14Suggested work stop (64705 64705 64705 64705)
1565T-16.0518.47100.00Suggested work stop (64705 64705 64705 64705 64705)
1566T-15.2223.00100.00Suggested work stop (64705 64705 64705 64705)
1631T-14.7020.1498.63Suggested work stop (64705 64705)
1632T-16.5220.0072.41Suggested work stop (64705 64705)
1652T-19.52100.00100.00Suggested work stop (64705 64705)
1686T-15.7080.3985.00Override suggest work stop (false positive)
1694T-27.0077.70100.00Suggested work stop (64705 64705 64705)
1695T-20.0081.25100.00Suggested work stop (64705 64705)
1735T-25.30100.00100.00Suggested work stop (64705 64705)
1761T-14.3020.8398.63Suggested work stop (64705 64705 64705 64705 64705 64705 66344 66344 66344 66344 66344 66344)
1762T-15.1515.7473.47Suggested work stop (64705 64705)
1795T-20.0099.00100.00Suggested work stop (64705 64705)
1809T-13.5281.6387.50Work stopped (uncertainty in predicting domain boundaries)

Back to Top


MP066 DNA sequence

ATGGGCAGAATCAAAACTTACGATTTTGATGGTGAATATGTTACCTGAACTACACGTTGAAGTTATGCCGGTAGCATTTA
CTATCGAAATGGTAAATTTAGTGCTTCCTCAAATTGTGGAATTTTAAAAGTTTTAAATAAGGAAATAAATCCTAAATTTT
TAGCTTATGCTTTAAAAAAAGAAGCAAAAAAATTTGTTAATACAACTTCTGCTATTCCTATCCTTAGAACGCAAAAAGTT
GTCGAAATTCCCATTGATTTCCCACCCTTGCAAATTCAAGAAAAAATTGCTACTATCTTAGATACTTTTACCGAGCTAAG
CGCCGAGCTAAGCGCCGAGCTAAGCGCCGAGCTAAGCGCCGAGCTAAGCGCCGAGCTAAGAGAGCGAAAGAAGCAGTATG
CCTTCTATCGCGATTATTTACTTAACCTAAAAAATTGAAAAGAGGAAAATAAATATTACAAGCTGGGAGAAATAGCTCAA
AAAGTTTTAGTAGGTGGTGAAAAGCCTGCTGATTTTAGTAAGGAAAAGAACGAAGTTTATAAATATCCAATTCTTTCTAA
CAACAGTAAAGCTGAGGAATTTTTGGTTTACAGTAAAACTTTTCGAGTTGAAGAAAAATCCATTACGGTAAGTGCTAGAG
GCACAATTGGTGCCGTTTTCTATCGTGATTTTGCTTATTTGCCTGCTGTTAGTTTAATTTGCTTTGTACCTAAAGAAGAA
TTTGATATAAGGTTTTTATTCCACGCATTAAGAGCCATTAAATTTAAAAAGCAAGGTTCTGCAACAGGCCAGTTAACAGT
GGCACAATTTAAGGAGTATGGTATTCATGTACCTTCGCTGAAAAAACAGAAAGAAATCGCTGCTATCTTAGATCCACTTT
ACAGCTTTTTTACTGATCTAAACGAAGGTATTCCTGCTGAAATTGAACTGCGCAAAAAGCAATTGGATTACTATCAGAAC
TTCTTATTTAATTGAGTGCAAAACCAAAAAGCAGCATCAATCCTGTAA

Back to Top


MP066 Codon Usage

(336 codons)
fields: [triplet] [frequency: per thousand] ([number])

UUU50.6(  17)  UCU8.9(   3)  UAU41.7(  14)  UGU3.0(   1)  
UUC14.9(   5)  UCC6.0(   2)  UAC17.9(   6)  UGC3.0(   1)  
UUA38.7(  13)  UCA6.0(   2)  UAA3.0(   1)  UGA11.9(   4)  
UUG11.9(   4)  UCG3.0(   1)  UAG0.0(   0)  UGG0.0(   0)  

CUU11.9(   4)  CCU20.8(   7)  CAU3.0(   1)  CGU6.0(   2)  
CUC0.0(   0)  CCC6.0(   2)  CAC3.0(   1)  CGC6.0(   2)  
CUA23.8(   8)  CCA8.9(   3)  CAA26.8(   9)  CGA8.9(   3)  
CUG11.9(   4)  CCG0.0(   0)  CAG11.9(   4)  CGG0.0(   0)  

AUU44.6(  15)  ACU20.8(   7)  AAU23.8(   8)  AGU20.8(   7)  
AUC17.9(   6)  ACC6.0(   2)  AAC20.8(   7)  AGC20.8(   7)  
AUA8.9(   3)  ACA14.9(   5)  AAA77.4(  26)  AGA14.9(   5)  
AUG3.0(   1)  ACG6.0(   2)  AAG29.8(  10)  AGG3.0(   1)  

GUU29.8(  10)  GCU41.7(  14)  GAU32.7(  11)  GGU26.8(   9)  
GUC3.0(   1)  GCC26.8(   9)  GAC0.0(   0)  GGC8.9(   3)  
GUA11.9(   4)  GCA17.9(   6)  GAA56.5(  19)  GGA6.0(   2)  
GUG6.0(   2)  GCG0.0(   0)  GAG29.8(  10)  GGG0.0(   0)  

Back to Top


MP066 Translation

Note: This was generated using the standard codon usage table; UGA codons in MP/MP genes will show up as terminators ("*")

atgggcagaatcaaaacttacgattttgatggtgaatatgttacctgaactacacgttga
 M  G  R  I  K  T  Y  D  F  D  G  E  Y  V  T  *  T  T  R  * 
agttatgccggtagcatttactatcgaaatggtaaatttagtgcttcctcaaattgtgga
 S  Y  A  G  S  I  Y  Y  R  N  G  K  F  S  A  S  S  N  C  G 
attttaaaagttttaaataaggaaataaatcctaaatttttagcttatgctttaaaaaaa
 I  L  K  V  L  N  K  E  I  N  P  K  F  L  A  Y  A  L  K  K 
gaagcaaaaaaatttgttaatacaacttctgctattcctatccttagaacgcaaaaagtt
 E  A  K  K  F  V  N  T  T  S  A  I  P  I  L  R  T  Q  K  V 
gtcgaaattcccattgatttcccacccttgcaaattcaagaaaaaattgctactatctta
 V  E  I  P  I  D  F  P  P  L  Q  I  Q  E  K  I  A  T  I  L 
gatacttttaccgagctaagcgccgagctaagcgccgagctaagcgccgagctaagcgcc
 D  T  F  T  E  L  S  A  E  L  S  A  E  L  S  A  E  L  S  A 
gagctaagcgccgagctaagagagcgaaagaagcagtatgccttctatcgcgattattta
 E  L  S  A  E  L  R  E  R  K  K  Q  Y  A  F  Y  R  D  Y  L 
cttaacctaaaaaattgaaaagaggaaaataaatattacaagctgggagaaatagctcaa
 L  N  L  K  N  *  K  E  E  N  K  Y  Y  K  L  G  E  I  A  Q 
aaagttttagtaggtggtgaaaagcctgctgattttagtaaggaaaagaacgaagtttat
 K  V  L  V  G  G  E  K  P  A  D  F  S  K  E  K  N  E  V  Y 
aaatatccaattctttctaacaacagtaaagctgaggaatttttggtttacagtaaaact
 K  Y  P  I  L  S  N  N  S  K  A  E  E  F  L  V  Y  S  K  T 
tttcgagttgaagaaaaatccattacggtaagtgctagaggcacaattggtgccgttttc
 F  R  V  E  E  K  S  I  T  V  S  A  R  G  T  I  G  A  V  F 
tatcgtgattttgcttatttgcctgctgttagtttaatttgctttgtacctaaagaagaa
 Y  R  D  F  A  Y  L  P  A  V  S  L  I  C  F  V  P  K  E  E 
tttgatataaggtttttattccacgcattaagagccattaaatttaaaaagcaaggttct
 F  D  I  R  F  L  F  H  A  L  R  A  I  K  F  K  K  Q  G  S 
gcaacaggccagttaacagtggcacaatttaaggagtatggtattcatgtaccttcgctg
 A  T  G  Q  L  T  V  A  Q  F  K  E  Y  G  I  H  V  P  S  L 
aaaaaacagaaagaaatcgctgctatcttagatccactttacagcttttttactgatcta
 K  K  Q  K  E  I  A  A  I  L  D  P  L  Y  S  F  F  T  D  L 
aacgaaggtattcctgctgaaattgaactgcgcaaaaagcaattggattactatcagaac
 N  E  G  I  P  A  E  I  E  L  R  K  K  Q  L  D  Y  Y  Q  N 
ttcttatttaattgagtgcaaaaccaaaaagcagcatcaatcctgtaa
 F  L  F  N  *  V  Q  N  Q  K  A  A  S  I  L  * 

Back to Top


MP066 AA Sequence

MGRIKTYDFDGEYVTWTTRWSYAGSIYYRNGKFSASSNCGILKVLNKEINPKFLAYALKKEAKKFVNTTSAIPILRTQKV
VEIPIDFPPLQIQEKIATILDTFTELSAELSAELSAELSAELSAELRERKKQYAFYRDYLLNLKNWKEENKYYKLGEIAQ
KVLVGGEKPADFSKEKNEVYKYPILSNNSKAEEFLVYSKTFRVEEKSITVSARGTIGAVFYRDFAYLPAVSLICFVPKEE
FDIRFLFHALRAIKFKKQGSATGQLTVAQFKEYGIHVPSLKKQKEIAAILDPLYSFFTDLNEGIPAEIELRKKQLDYYQN
FLFNWVQNQKAASIL

Back to Top


MP066 Protein Parameters

This information was obtained using the Protein Parameters tool on the ExPASy Molecular Biology Server.

Number of amino acids: 335

Molecular weight: 38602.4

Theoretical pI: 9.14

Amino acid composition:

Ala (A)  29	  8.7%
Arg (R)  13	  3.9%
Asn (N)  15	  4.5%
Asp (D)  11	  3.3%
Cys (C)   2	  0.6%
Gln (Q)  13	  3.9%
Glu (E)  29	  8.7%
Gly (G)  14	  4.2%
His (H)   2	  0.6%
Ile (I)  24	  7.2%
Leu (L)  33	  9.9%
Lys (K)  36	 10.7%
Met (M)   1	  0.3%
Phe (F)  22	  6.6%
Pro (P)  12	  3.6%
Ser (S)  22	  6.6%
Thr (T)  16	  4.8%
Trp (W)   4	  1.2%
Tyr (Y)  20	  6.0%
Val (V)  17	  5.1%

Asx (B)   0	  0.0%
Glx (Z)   0	  0.0%
Xaa (X)   0	  0.0%

Total number of negatively charged residues (Asp + Glu): 40
Total number of positively charged residues (Arg + Lys): 49

Atomic composition:

Carbon      C	      1785
Hydrogen    H	      2766
Nitrogen    N	       446
Oxygen      O	       502
Sulfur      S	         3

Formula: C1785H2766N446O502S3
Total number of atoms: 5502

Extinction coefficients:

Conditions: 6.0 M guanidium hydrochloride
            0.02 M phosphate buffer
            pH 6.5

Extinction coefficients are in units of  M-1 cm-1 .

The first table lists values computed assuming ALL Cys 
residues appear as half cystines, whereas the second table 
assumes that NONE do. 

                      276     278     279     280     282
                       nm      nm      nm      nm      nm
Ext. coefficient    50745   50527   49660   48480   46520
Abs 0.1% (=1 g/l)   1.315   1.309   1.286   1.256   1.205



                      276     278     279     280     282
                       nm      nm      nm      nm      nm
Ext. coefficient    50600   50400   49540   48360   46400
Abs 0.1% (=1 g/l)   1.311   1.306   1.283   1.253   1.202


Estimated half-life:

The N-terminal of the sequence considered is M (Met).

The estimated half-life is: 30 hours (mammalian reticulocytes, in vitro).
                            >20 hours (yeast, in vivo).
                            >10 hours (Escherichia coli, in vivo).


Instability index:

The instability index (II) is computed to be 31.26
This classifies the protein as stable.



Aliphatic index: 89.73

Grand average of hydropathicity (GRAVY): -0.301

Back to Top


Predicted properties of MP066

ModelPercentage of MP Gene
Coiled Coil (CCP)0
Disordered (SEG)7
Transmembrane (PHDhtm)0
Transmembrane (TMHMM)0
Homologous to known structure (PSIBLAST)0

Back to Top


MP066 Prediction Details

Sequence: Amino acid sequence.
CCP (C or -): coiled coil prediction from the ccp program (NCBI toolkit)
SEG (D or -): low complexity regions (possibly disordered) from
SEG
PHDhtm (H or -): Transmembrane prediction from PHDhtm
TMHMM (H or -): Transmembrane prediction from TMHMM
PSIBLAST (3 or -): Regions potentially homologous to a protein of known 3D structure, according to PSIBLAST
Pred2ary (H, E, or -): Secondary structure prediction from Pred2ary

                 10        20        30        40        50        60        70        80
                  |         |         |         |         |         |         |         |
Sequence MGRIKTYDFDGEYVTWTTRWSYAGSIYYRNGKFSASSNCGILKVLNKEINPKFLAYALKKEAKKFVNTTSAIPILRTQKV
CCP      --------------------------------------------------------------------------------
SEG      --------------------------------------------------------------------------------
PHDhtm   --------------------------------------------------------------------------------
TMHMM    --------------------------------------------------------------------------------
PSIBLAST --------------------------------------------------------------------------------
Pred2ary ------------EEEEEE-----EEEEEE---EE----EEEEEE-----HHHHHHHHHHHHHHHH---------------

                 90       100       110       120       130       140       150       160
                  |         |         |         |         |         |         |         |
Sequence VEIPIDFPPLQIQEKIATILDTFTELSAELSAELSAELSAELSAELRERKKQYAFYRDYLLNLKNWKEENKYYKLGEIAQ
CCP      --------------------------------------------------------------------------------
SEG      ------------------------DDDDDDDDDDDDDDDDDDDDDD----------------------------------
PHDhtm   --------------------------------------------------------------------------------
TMHMM    --------------------------------------------------------------------------------
PSIBLAST --------------------------------------------------------------------------------
Pred2ary --EE-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------------------HHHHHHHHHHH

                170       180       190       200       210       220       230       240
                  |         |         |         |         |         |         |         |
Sequence KVLVGGEKPADFSKEKNEVYKYPILSNNSKAEEFLVYSKTFRVEEKSITVSARGTIGAVFYRDFAYLPAVSLICFVPKEE
CCP      --------------------------------------------------------------------------------
SEG      --------------------------------------------------------------------------------
PHDhtm   --------------------------------------------------------------------------------
TMHMM    --------------------------------------------------------------------------------
PSIBLAST --------------------------------------------------------------------------------
Pred2ary HH---------------------EE---------------------EEEEEE-----EEEEE---EE----EEEEE----

                250       260       270       280       290       300       310       320
                  |         |         |         |         |         |         |         |
Sequence FDIRFLFHALRAIKFKKQGSATGQLTVAQFKEYGIHVPSLKKQKEIAAILDPLYSFFTDLNEGIPAEIELRKKQLDYYQN
CCP      --------------------------------------------------------------------------------
SEG      --------------------------------------------------------------------------------
PHDhtm   --------------------------------------------------------------------------------
TMHMM    --------------------------------------------------------------------------------
PSIBLAST --------------------------------------------------------------------------------
Pred2ary HHHHHHHHHHHHHHH-----------------EE-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH

                330
                  |
Sequence FLFNWVQNQKAASIL
CCP      ---------------
SEG      ---------------
PHDhtm   ---------------
TMHMM    ---------------
PSIBLAST ---------------
Pred2ary HH----HHHHHH---

Back to Top