Berkeley Structural Genomics Center

Info for MP074

This gene is gi number 1673727 from Mycoplasma pneumonia.

Available sections:


Potential Target Homologues of MP074

Targetlog10(PSIBLAST E)% identical% coverage (of MP gene)Latest Status
1001B-45.7032.0469.36Suggested work stop (1192 1192 4957 4981 4981 5931 5931 6021 6021 6116 6116 7498 7960 7960 8131 8131 9084 9084 13286 13286 13830 13830)
1741T-111.00100.00100.00Work stopped (uncertainty in predicting domain boundaries)

Back to Top


MP074 DNA sequence

ATGCGATACTTTGCCCTCACTGATTTAACCAAGCCCACTGCAAAATTTACCACTGATGCCATGGAACTGCGTCAGATATC
CAATGCTTACCTCAACCAAGCCCAAGCATATTTGCAAAAGGGTTTAAAGCAACTCAAAAAGGATTACAAGAATGCCATTT
TGTATACCCCCAAAACGGAGTACAAGCGCTTTTTGAAATGGAAGCAAACTTTTCTTCAAGATTTAAACCAAACCCAAAAG
CGCTTCTTTATTGTTCGTGCCCAACATTTTTCTTACGTGTTGTTGTTTAACTTATTGGATGAACAGGTGGAAGCGGTGAT
TGCGACCTTTAACAACTTCTTGGATGAACACCGCTTGGAGGCACAATCCAAGCAGTTTGATTTAGACACAGCTGTCAATG
AACTGCATGATTACTTTGACCAACTGCAAAAGAACACCGCTTACAGCGAGGATTTACCAACTCACCTCACGCAAAAAACC
GAGCAATTAATTAACGCTCGTAATACCCAGTTAACGAATTTGTTAAACAAGATTGCTACTACCAAACCACCCCTAAACAA
ACAGCAACGTTTGTTAGCCAGTTTTCGCAATTACCACGAACACCTCTTTTTAAAGAACGAGGTCAAAAAGGTGACTTGGT
TAAACGAACCACGCGCCAAAAAAGAGTCTGTTACCCCTGATGAAGAACACATTATTGAACTTAAAAACGTTTACAAGTAC
ATTACTAATGGGGTGACTACCAATGCCGTTTTAAAGGGCATAGATCTCAAACTCAAAGCCCATGATTTCATTGTGATTTT
AGGTCCCTCTGGCTCGGGGAAAACGACCCTGCTCAACATTATTTCCGGGATGGACCGGCCATCAAGCGGATCTGTTGTCG
TTAATGGTCAGGAAATGATTTGCATGAACGACCGTCAGTTAACTAACTTTCGCCGCAACTATGTCGGTTACATCTTCCAG
CAGTACGGGCTTTTACCCAACCTAACTGTAAGGGAAAACGTTGAAGTGGGTGCTAACTTGCAACGTAACCCTGATAAGCG
GATTAACATTGATGAACTGTTAGAAGCAGTAGGGATGAAACACCTCCAAAAGAAACTCCCCAACGAGTTGAGTGGTGGAC
AACAACAACGGGTCTCGATTGCACGGGCGTTTGCTAAAAATCCCTTACTCATTTTTGGTGATGAACCAACTGGGGCTTTG
GATTTGGAAATGACCCAAATTGTGCTCAAACAGTTTTTAGCCATTAAACAACGCTATAAAACCACGATGGTCATAGTGAC
GCACAATAACCTAATTGCCCAACTAGCCGATTTAGTCATTTATGTGGCAGATGGCAAAATTCAAGCACTGCAAGCAAACC
CTAATCCCAAACAGGTTGAGGACATTAACTGAATTTAG

Back to Top


MP074 Codon Usage

(466 codons)
fields: [triplet] [frequency: per thousand] ([number])

UUU34.3(  16)  UCU8.6(   4)  UAU10.7(   5)  UGU0.0(   0)  
UUC8.6(   4)  UCC6.4(   3)  UAC25.8(  12)  UGC2.1(   1)  
UUA42.9(  20)  UCA2.1(   1)  UAA0.0(   0)  UGA2.1(   1)  
UUG30.0(  14)  UCG4.3(   2)  UAG2.1(   1)  UGG4.3(   2)  

CUU6.4(   3)  CCU6.4(   3)  CAU6.4(   3)  CGU12.9(   6)  
CUC25.8(  12)  CCC17.2(   8)  CAC15.0(   7)  CGC17.2(   8)  
CUA8.6(   4)  CCA10.7(   5)  CAA53.6(  25)  CGA2.1(   1)  
CUG12.9(   6)  CCG0.0(   0)  CAG23.6(  11)  CGG8.6(   4)  

AUU51.5(  24)  ACU25.8(  12)  AAU25.8(  12)  AGU4.3(   2)  
AUC2.1(   1)  ACC30.0(  14)  AAC53.6(  25)  AGC4.3(   2)  
AUA6.4(   3)  ACA2.1(   1)  AAA47.2(  22)  AGA0.0(   0)  
AUG17.2(   8)  ACG12.9(   6)  AAG36.5(  17)  AGG2.1(   1)  

GUU17.2(   8)  GCU17.2(   8)  GAU38.6(  18)  GGU15.0(   7)  
GUC15.0(   7)  GCC25.8(  12)  GAC10.7(   5)  GGC6.4(   3)  
GUA4.3(   2)  GCA17.2(   8)  GAA36.5(  17)  GGA4.3(   2)  
GUG21.5(  10)  GCG6.4(   3)  GAG17.2(   8)  GGG12.9(   6)  

Back to Top


MP074 Translation

Note: This was generated using the standard codon usage table; UGA codons in MP/MP genes will show up as terminators ("*")

atgcgatactttgccctcactgatttaaccaagcccactgcaaaatttaccactgatgcc
 M  R  Y  F  A  L  T  D  L  T  K  P  T  A  K  F  T  T  D  A 
atggaactgcgtcagatatccaatgcttacctcaaccaagcccaagcatatttgcaaaag
 M  E  L  R  Q  I  S  N  A  Y  L  N  Q  A  Q  A  Y  L  Q  K 
ggtttaaagcaactcaaaaaggattacaagaatgccattttgtatacccccaaaacggag
 G  L  K  Q  L  K  K  D  Y  K  N  A  I  L  Y  T  P  K  T  E 
tacaagcgctttttgaaatggaagcaaacttttcttcaagatttaaaccaaacccaaaag
 Y  K  R  F  L  K  W  K  Q  T  F  L  Q  D  L  N  Q  T  Q  K 
cgcttctttattgttcgtgcccaacatttttcttacgtgttgttgtttaacttattggat
 R  F  F  I  V  R  A  Q  H  F  S  Y  V  L  L  F  N  L  L  D 
gaacaggtggaagcggtgattgcgacctttaacaacttcttggatgaacaccgcttggag
 E  Q  V  E  A  V  I  A  T  F  N  N  F  L  D  E  H  R  L  E 
gcacaatccaagcagtttgatttagacacagctgtcaatgaactgcatgattactttgac
 A  Q  S  K  Q  F  D  L  D  T  A  V  N  E  L  H  D  Y  F  D 
caactgcaaaagaacaccgcttacagcgaggatttaccaactcacctcacgcaaaaaacc
 Q  L  Q  K  N  T  A  Y  S  E  D  L  P  T  H  L  T  Q  K  T 
gagcaattaattaacgctcgtaatacccagttaacgaatttgttaaacaagattgctact
 E  Q  L  I  N  A  R  N  T  Q  L  T  N  L  L  N  K  I  A  T 
accaaaccacccctaaacaaacagcaacgtttgttagccagttttcgcaattaccacgaa
 T  K  P  P  L  N  K  Q  Q  R  L  L  A  S  F  R  N  Y  H  E 
cacctctttttaaagaacgaggtcaaaaaggtgacttggttaaacgaaccacgcgccaaa
 H  L  F  L  K  N  E  V  K  K  V  T  W  L  N  E  P  R  A  K 
aaagagtctgttacccctgatgaagaacacattattgaacttaaaaacgtttacaagtac
 K  E  S  V  T  P  D  E  E  H  I  I  E  L  K  N  V  Y  K  Y 
attactaatggggtgactaccaatgccgttttaaagggcatagatctcaaactcaaagcc
 I  T  N  G  V  T  T  N  A  V  L  K  G  I  D  L  K  L  K  A 
catgatttcattgtgattttaggtccctctggctcggggaaaacgaccctgctcaacatt
 H  D  F  I  V  I  L  G  P  S  G  S  G  K  T  T  L  L  N  I 
atttccgggatggaccggccatcaagcggatctgttgtcgttaatggtcaggaaatgatt
 I  S  G  M  D  R  P  S  S  G  S  V  V  V  N  G  Q  E  M  I 
tgcatgaacgaccgtcagttaactaactttcgccgcaactatgtcggttacatcttccag
 C  M  N  D  R  Q  L  T  N  F  R  R  N  Y  V  G  Y  I  F  Q 
cagtacgggcttttacccaacctaactgtaagggaaaacgttgaagtgggtgctaacttg
 Q  Y  G  L  L  P  N  L  T  V  R  E  N  V  E  V  G  A  N  L 
caacgtaaccctgataagcggattaacattgatgaactgttagaagcagtagggatgaaa
 Q  R  N  P  D  K  R  I  N  I  D  E  L  L  E  A  V  G  M  K 
cacctccaaaagaaactccccaacgagttgagtggtggacaacaacaacgggtctcgatt
 H  L  Q  K  K  L  P  N  E  L  S  G  G  Q  Q  Q  R  V  S  I 
gcacgggcgtttgctaaaaatcccttactcatttttggtgatgaaccaactggggctttg
 A  R  A  F  A  K  N  P  L  L  I  F  G  D  E  P  T  G  A  L 
gatttggaaatgacccaaattgtgctcaaacagtttttagccattaaacaacgctataaa
 D  L  E  M  T  Q  I  V  L  K  Q  F  L  A  I  K  Q  R  Y  K 
accacgatggtcatagtgacgcacaataacctaattgcccaactagccgatttagtcatt
 T  T  M  V  I  V  T  H  N  N  L  I  A  Q  L  A  D  L  V  I 
tatgtggcagatggcaaaattcaagcactgcaagcaaaccctaatcccaaacaggttgag
 Y  V  A  D  G  K  I  Q  A  L  Q  A  N  P  N  P  K  Q  V  E 
gacattaactgaatttag
 D  I  N  *  I  * 

Back to Top


MP074 AA Sequence

MRYFALTDLTKPTAKFTTDAMELRQISNAYLNQAQAYLQKGLKQLKKDYKNAILYTPKTEYKRFLKWKQTFLQDLNQTQK
RFFIVRAQHFSYVLLFNLLDEQVEAVIATFNNFLDEHRLEAQSKQFDLDTAVNELHDYFDQLQKNTAYSEDLPTHLTQKT
EQLINARNTQLTNLLNKIATTKPPLNKQQRLLASFRNYHEHLFLKNEVKKVTWLNEPRAKKESVTPDEEHIIELKNVYKY
ITNGVTTNAVLKGIDLKLKAHDFIVILGPSGSGKTTLLNIISGMDRPSSGSVVVNGQEMICMNDRQLTNFRRNYVGYIFQ
QYGLLPNLTVRENVEVGANLQRNPDKRINIDELLEAVGMKHLQKKLPNELSGGQQQRVSIARAFAKNPLLIFGDEPTGAL
DLEMTQIVLKQFLAIKQRYKTTMVIVTHNNLIAQLADLVIYVADGKIQALQANPNPKQVEDINWI

Back to Top


MP074 Protein Parameters

This information was obtained using the Protein Parameters tool on the ExPASy Molecular Biology Server.

Number of amino acids: 465

Molecular weight: 53511.5

Theoretical pI: 9.30

Amino acid composition:

Ala (A)  31	  6.7%
Arg (R)  20	  4.3%
Asn (N)  37	  8.0%
Asp (D)  23	  4.9%
Cys (C)   1	  0.2%
Gln (Q)  36	  7.7%
Glu (E)  25	  5.4%
Gly (G)  18	  3.9%
His (H)  10	  2.2%
Ile (I)  28	  6.0%
Leu (L)  59	 12.7%
Lys (K)  39	  8.4%
Met (M)   8	  1.7%
Phe (F)  20	  4.3%
Pro (P)  16	  3.4%
Ser (S)  14	  3.0%
Thr (T)  33	  7.1%
Trp (W)   3	  0.6%
Tyr (Y)  17	  3.7%
Val (V)  27	  5.8%

Asx (B)   0	  0.0%
Glx (Z)   0	  0.0%
Xaa (X)   0	  0.0%

Total number of negatively charged residues (Asp + Glu): 48
Total number of positively charged residues (Arg + Lys): 59

Atomic composition:

Carbon      C	      2408
Hydrogen    H	      3842
Nitrogen    N	       660
Oxygen      O	       699
Sulfur      S	         9

Formula: C2408H3842N660O699S9
Total number of atoms: 7618

Extinction coefficients:

Conditions: 6.0 M guanidium hydrochloride
            0.02 M phosphate buffer
            pH 6.5

Extinction coefficients are in units of  M-1 cm-1 .

The first table lists values computed assuming ALL Cys 
residues appear as half cystines, whereas the second table 
assumes that NONE do. 

                      276     278     279     280     282
                       nm      nm      nm      nm      nm
Ext. coefficient    40850   40600   39845   38830   37200
Abs 0.1% (=1 g/l)   0.763   0.759   0.745   0.726   0.695



                      276     278     279     280     282
                       nm      nm      nm      nm      nm
Ext. coefficient    40850   40600   39845   38830   37200
Abs 0.1% (=1 g/l)   0.763   0.759   0.745   0.726   0.695


Estimated half-life:

The N-terminal of the sequence considered is M (Met).

The estimated half-life is: 30 hours (mammalian reticulocytes, in vitro).
                            >20 hours (yeast, in vivo).
                            >10 hours (Escherichia coli, in vivo).


Instability index:

The instability index (II) is computed to be 27.55
This classifies the protein as stable.



Aliphatic index: 96.47

Grand average of hydropathicity (GRAVY): -0.422

Back to Top


Predicted properties of MP074

ModelPercentage of MP Gene
Coiled Coil (CCP)0
Disordered (SEG)0
Transmembrane (PHDhtm)0
Transmembrane (TMHMM)0
Homologous to known structure (PSIBLAST)98

Back to Top


MP074 Prediction Details

Sequence: Amino acid sequence.
CCP (C or -): coiled coil prediction from the ccp program (NCBI toolkit)
SEG (D or -): low complexity regions (possibly disordered) from
SEG
PHDhtm (H or -): Transmembrane prediction from PHDhtm
TMHMM (H or -): Transmembrane prediction from TMHMM
PSIBLAST (3 or -): Regions potentially homologous to a protein of known 3D structure, according to PSIBLAST
Pred2ary (H, E, or -): Secondary structure prediction from Pred2ary

                 10        20        30        40        50        60        70        80
                  |         |         |         |         |         |         |         |
Sequence MRYFALTDLTKPTAKFTTDAMELRQISNAYLNQAQAYLQKGLKQLKKDYKNAILYTPKTEYKRFLKWKQTFLQDLNQTQK
CCP      --------------------------------------------------------------------------------
SEG      --------------------------------------------------------------------------------
PHDhtm   --------------------------------------------------------------------------------
TMHMM    --------------------------------------------------------------------------------
PSIBLAST ----3333333333333333333333333333333333333333333333333333333333333333333333333333
Pred2ary --HHHHHHHHH----------EEEE-------------HHHHHHH----EEEEE-----------HHHHHHHHHHHH---

                 90       100       110       120       130       140       150       160
                  |         |         |         |         |         |         |         |
Sequence RFFIVRAQHFSYVLLFNLLDEQVEAVIATFNNFLDEHRLEAQSKQFDLDTAVNELHDYFDQLQKNTAYSEDLPTHLTQKT
CCP      --------------------------------------------------------------------------------
SEG      --------------------------------------------------------------------------------
PHDhtm   --------------------------------------------------------------------------------
TMHMM    --------------------------------------------------------------------------------
PSIBLAST 33333333333333333333333333333333333333333333333333333333333333333333333333333333
Pred2ary ---EEE-----HHHHHHHHHHHHHHH------------------HHHHHHHHHHHH------------HHHHHHHHHHHH

                170       180       190       200       210       220       230       240
                  |         |         |         |         |         |         |         |
Sequence EQLINARNTQLTNLLNKIATTKPPLNKQQRLLASFRNYHEHLFLKNEVKKVTWLNEPRAKKESVTPDEEHIIELKNVYKY
CCP      --------------------------------------------------------------------------------
SEG      --------------------------------------------------------------------------------
PHDhtm   --------------------------------------------------------------------------------
TMHMM    --------------------------------------------------------------------------------
PSIBLAST 33333333333333333333333333333333333333333333333333333333333333333333333333333333
Pred2ary HHHHHH---EEEEEE--------EEEEE---EEEE---------------EE------------------EEEEEEEEEE

                250       260       270       280       290       300       310       320
                  |         |         |         |         |         |         |         |
Sequence ITNGVTTNAVLKGIDLKLKAHDFIVILGPSGSGKTTLLNIISGMDRPSSGSVVVNGQEMICMNDRQLTNFRRNYVGYIFQ
CCP      --------------------------------------------------------------------------------
SEG      --------------------------------------------------------------------------------
PHDhtm   --------------------------------------------------------------------------------
TMHMM    --------------------------------------------------------------------------------
PSIBLAST 33333333333333333333333333333333333333333333333333333333333333333333333333333333
Pred2ary E------EEEEE-EEEEE----EEEEE-------HHHHHHHHH-------EEEE--------------H-----EEEEEE

                330       340       350       360       370       380       390       400
                  |         |         |         |         |         |         |         |
Sequence QYGLLPNLTVRENVEVGANLQRNPDKRINIDELLEAVGMKHLQKKLPNELSGGQQQRVSIARAFAKNPLLIFGDEPTGAL
CCP      --------------------------------------------------------------------------------
SEG      --------------------------------------------------------------------------------
PHDhtm   --------------------------------------------------------------------------------
TMHMM    --------------------------------------------------------------------------------
PSIBLAST 33333333333333333333333333333333333333333333333333333333333333333333333333333333
Pred2ary --------HHHHHHHH-------HHHHHHHHHHHHHH---HHH----------HHHHHHHHHHHH----EEEE-------

                410       420       430       440       450       460
                  |         |         |         |         |         |
Sequence DLEMTQIVLKQFLAIKQRYKTTMVIVTHNNLIAQLADLVIYVADGKIQALQANPNPKQVEDINWI
CCP      -----------------------------------------------------------------
SEG      -----------------------------------------------------------------
PHDhtm   -----------------------------------------------------------------
TMHMM    -----------------------------------------------------------------
PSIBLAST 33333333333333333333333333333333-----------3333333333333333333333
Pred2ary HHHHHHHHHHHHHHHHHH---EEEEE--HHHHHHH---EEEE---EEEE----------------

Back to Top