Berkeley Structural Genomics Center

Info for MP050

This gene is gi number 1673700 from Mycoplasma pneumonia.

Available sections:


Potential Target Homologues of MP050

Targetlog10(PSIBLAST E)% identical% coverage (of MP gene)Latest Status
1076B-9.5245.5736.57Solubility tested (S)
1139B-8.7027.4576.69Suggested work stop (77186 77186)
1142B-7.2262.7543.97Work stopped (Homology to 1636T)
1143B-5.0024.5656.44Override suggest work stop (phasing diffraction data)
1155B-15.1542.0690.68HSQC
1178B-17.5225.7461.96Solubility tested (S)
1179B-7.0040.0050.00Purified
1181B-7.5240.0050.00Purified
1184B-8.7058.9055.73Suggested work stop (77186 77186)
1192B-16.7044.0962.84Purified
1246B-15.3039.3652.51Suggested work stop (21701 21701 77186 77186)
1247B-9.3033.9646.49Override suggest work stop (a false positive)
1252B-24.2251.1135.02Work on hold
1357B-6.0032.2649.21NMR characterization
1636T-6.7057.3871.76In PDB (2BA2)
1653T-6.5234.7494.06Suggested work stop (77186 77186 77186 78218 78218)
1655T-6.0034.7495.96Work stopped (uncertainty in predicting domain boundaries)
1656T-8.1035.7886.51Work stopped (uncertainty in predicting domain boundaries)
1657T-10.0533.3373.88Suggested work stop (77186 77186)
1658T-12.7026.0479.34Work stopped (uncertainty in predicting domain boundaries)
1659T-12.1540.2273.02Override suggest work stop (No sequence homology to the known crystal structures )
1663T-10.0030.6993.52Suggested work stop (77186 77186 77186 78218 78218)
1681T-10.0070.2774.00Work stopped (uncertainty in predicting domain boundaries)
1687T-7.7027.5086.33Override suggest work stop (false positive)
1691T-8.5238.7193.00Work stopped (uncertainty in predicting domain boundaries)
1728T-6.7041.7798.75Work stopped (uncertainty in predicting domain boundaries)
1745T-7.3043.3795.40Work stopped (uncertainty in predicting domain boundaries)

Back to Top


MP050 DNA sequence

ATGGTAGAACTTGATGGTAAGTTTGCTACTAAAGCTGATTTAAAACGGGTTGAAGATAAAGTTGATGTTTTGTTTGAATT
GCAAAAAACGCAAGGTGAACAAATCAAGGTTCAAGGAAAACAAATCGAACAGTTAACTGAAACTGTTCAAAAACAGGGCG
AACAAATAAAGGAACTTCAAGTTCAGGTAAAAGCCCAAGGTGAAGAAATAAAAGAAATTAAAGTTGAACAAAAAGCTCAA
GGCCAAACCTTGCAGTTAATTCTGAAAGCACTTGAAGGAATAAACAAGCGACTAGATAATTTGGAATCGAAATAA

Back to Top


MP050 Codon Usage

(105 codons)
fields: [triplet] [frequency: per thousand] ([number])

UUU19.0(   2)  UCU0.0(   0)  UAU0.0(   0)  UGU0.0(   0)  
UUC0.0(   0)  UCC0.0(   0)  UAC0.0(   0)  UGC0.0(   0)  
UUA28.6(   3)  UCA0.0(   0)  UAA9.5(   1)  UGA0.0(   0)  
UUG38.1(   4)  UCG9.5(   1)  UAG0.0(   0)  UGG0.0(   0)  

CUU28.6(   3)  CCU0.0(   0)  CAU0.0(   0)  CGU0.0(   0)  
CUC0.0(   0)  CCC0.0(   0)  CAC0.0(   0)  CGC0.0(   0)  
CUA9.5(   1)  CCA0.0(   0)  CAA114.3(  12)  CGA9.5(   1)  
CUG9.5(   1)  CCG0.0(   0)  CAG38.1(   4)  CGG9.5(   1)  

AUU19.0(   2)  ACU28.6(   3)  AAU9.5(   1)  AGU0.0(   0)  
AUC19.0(   2)  ACC9.5(   1)  AAC9.5(   1)  AGC0.0(   0)  
AUA28.6(   3)  ACA0.0(   0)  AAA114.3(  12)  AGA0.0(   0)  
AUG9.5(   1)  ACG9.5(   1)  AAG38.1(   4)  AGG0.0(   0)  

GUU66.7(   7)  GCU28.6(   3)  GAU47.6(   5)  GGU28.6(   3)  
GUC0.0(   0)  GCC9.5(   1)  GAC0.0(   0)  GGC19.0(   2)  
GUA19.0(   2)  GCA9.5(   1)  GAA133.3(  14)  GGA19.0(   2)  
GUG0.0(   0)  GCG0.0(   0)  GAG0.0(   0)  GGG0.0(   0)  

Back to Top


MP050 Translation

Note: This was generated using the standard codon usage table; UGA codons in MP/MP genes will show up as terminators ("*")

atggtagaacttgatggtaagtttgctactaaagctgatttaaaacgggttgaagataaa
 M  V  E  L  D  G  K  F  A  T  K  A  D  L  K  R  V  E  D  K 
gttgatgttttgtttgaattgcaaaaaacgcaaggtgaacaaatcaaggttcaaggaaaa
 V  D  V  L  F  E  L  Q  K  T  Q  G  E  Q  I  K  V  Q  G  K 
caaatcgaacagttaactgaaactgttcaaaaacagggcgaacaaataaaggaacttcaa
 Q  I  E  Q  L  T  E  T  V  Q  K  Q  G  E  Q  I  K  E  L  Q 
gttcaggtaaaagcccaaggtgaagaaataaaagaaattaaagttgaacaaaaagctcaa
 V  Q  V  K  A  Q  G  E  E  I  K  E  I  K  V  E  Q  K  A  Q 
ggccaaaccttgcagttaattctgaaagcacttgaaggaataaacaagcgactagataat
 G  Q  T  L  Q  L  I  L  K  A  L  E  G  I  N  K  R  L  D  N 
ttggaatcgaaataa
 L  E  S  K  * 

Back to Top


MP050 AA Sequence

MVELDGKFATKADLKRVEDKVDVLFELQKTQGEQIKVQGKQIEQLTETVQKQGEQIKELQVQVKAQGEEIKEIKVEQKAQ
GQTLQLILKALEGINKRLDNLESK

Back to Top


MP050 Protein Parameters

This information was obtained using the Protein Parameters tool on the ExPASy Molecular Biology Server.

Number of amino acids: 104

Molecular weight: 11857.6

Theoretical pI: 5.61

Amino acid composition:

Ala (A)   5	  4.8%
Arg (R)   2	  1.9%
Asn (N)   2	  1.9%
Asp (D)   5	  4.8%
Cys (C)   0	  0.0%
Gln (Q)  16	 15.4%
Glu (E)  14	 13.5%
Gly (G)   7	  6.7%
His (H)   0	  0.0%
Ile (I)   7	  6.7%
Leu (L)  12	 11.5%
Lys (K)  16	 15.4%
Met (M)   1	  1.0%
Phe (F)   2	  1.9%
Pro (P)   0	  0.0%
Ser (S)   1	  1.0%
Thr (T)   5	  4.8%
Trp (W)   0	  0.0%
Tyr (Y)   0	  0.0%
Val (V)   9	  8.7%

Asx (B)   0	  0.0%
Glx (Z)   0	  0.0%
Xaa (X)   0	  0.0%

Total number of negatively charged residues (Asp + Glu): 19
Total number of positively charged residues (Arg + Lys): 18

Atomic composition:

Carbon      C	       520
Hydrogen    H	       884
Nitrogen    N	       144
Oxygen      O	       167
Sulfur      S	         1

Formula: C520H884N144O167S1
Total number of atoms: 1716

Extinction coefficients:

As there are no Trp, Tyr or Cys in the region considered, your protein
should not be visible by UV spectrophotometry.


Estimated half-life:

The N-terminal of the sequence considered is M (Met).

The estimated half-life is: 30 hours (mammalian reticulocytes, in vitro).
                            >20 hours (yeast, in vivo).
                            >10 hours (Escherichia coli, in vivo).


Instability index:

The instability index (II) is computed to be 40.79
This classifies the protein as unstable.



Aliphatic index: 101.15

Grand average of hydropathicity (GRAVY): -0.737

Back to Top


Predicted properties of MP050

ModelPercentage of MP Gene
Coiled Coil (CCP)41
Disordered (SEG)0
Transmembrane (PHDhtm)0
Transmembrane (TMHMM)0
Homologous to known structure (PSIBLAST)0

Back to Top


MP050 Prediction Details

Sequence: Amino acid sequence.
CCP (C or -): coiled coil prediction from the ccp program (NCBI toolkit)
SEG (D or -): low complexity regions (possibly disordered) from
SEG
PHDhtm (H or -): Transmembrane prediction from PHDhtm
TMHMM (H or -): Transmembrane prediction from TMHMM
PSIBLAST (3 or -): Regions potentially homologous to a protein of known 3D structure, according to PSIBLAST
Pred2ary (H, E, or -): Secondary structure prediction from Pred2ary

                 10        20        30        40        50        60        70        80
                  |         |         |         |         |         |         |         |
Sequence MVELDGKFATKADLKRVEDKVDVLFELQKTQGEQIKVQGKQIEQLTETVQKQGEQIKELQVQVKAQGEEIKEIKVEQKAQ
CCP      ----------------------------------CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC---
SEG      --------------------------------------------------------------------------------
PHDhtm   --------------------------------------------------------------------------------
TMHMM    --------------------------------------------------------------------------------
PSIBLAST --------------------------------------------------------------------------------
Pred2ary --------H-HHH-------HHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH

                 90       100
                  |         |
Sequence GQTLQLILKALEGINKRLDNLESK
CCP      ------------------------
SEG      ------------------------
PHDhtm   ------------------------
TMHMM    ------------------------
PSIBLAST ------------------------
Pred2ary HHHHHHHHHHHHHHHHHH------

Back to Top