Berkeley Structural Genomics Center

Info for MP012

This gene is gi number 1673658 from Mycoplasma pneumonia.

Available sections:


Potential Target Homologues of MP012

Targetlog10(PSIBLAST E)% identical% coverage (of MP gene)Latest Status
1390B-108.0063.4695.59Selected
1392B-18.7060.2645.09Selected
1406B-105.0083.6787.50Selected
1411B-139.0072.3578.34Selected
1416B-200.0086.5683.60Selected
1419B-142.0097.5099.38Selected

Back to Top


MP012 DNA sequence

ATGAAATCGAAGCTAAAGTTAAAACGTTATTTACTGTTTTTACCACTTTTACCGCTAGGGACGTTGTCACTAGCCAACAC
CTACCTCCTCCAAGACCACAACACCCTCACCCCCTACACGCCCTTTACGACACCGCTCAATGGGGGGCTGGATGTCGTGC
GCGCCGCCCATTTACACCCCTCATACGAACTCGTGGACTGAAAGCGGGTGGGGGATACCAAGTTGGTGGCGCTGGTCCGC
TCAGCGTTGGTCAGGGTGAAATTCCAGGACACAACGAGTTCGGATCAAAGTAATACCAACCAAAATGCCTTGAGTTTTGA
TACCCAAGAATCACAGAAGGCACTTAATGGCTCGCAGAGTGGATCTTCTGACACTTCCGGGTCTAACTCCCAAGACTTCG
CCAGCTATGTCCTCATCTTTAAAGCCGCGCCCAGGGCCACGTGGGTGTTTGAACGCAAGATTAAGTTGGCGTTGCCCTAC
GTTAAGCAGGAAAGTCAGGGTTCCGGCGATCAAGGTTCCAATGGTAAGGGCTCCCTCTACAAAACCCTCCAAGACCTCCT
CGTCGAACAACCCGTGACCCCTTACACCCCGAATGCGGGGTTAGCCCGGGTGAATGGGGTTGCTCAGGATACGGTTCATT
TTGGTTCGGGTCAAGAATCGAGTTGGAATTCCCAACGTTCCCAAAAAGGCCTTAAAAACAACCCCGGACCCAAAGCCGTC
ACCGGCTTTAAGCTCGATAAGGGCCGCGCGTACCGGAAGCTGAATGAAAGTTGACCGGTGTATGAACCCCTGGATTCGAC
CAAGGAGGGGAAGGGGAAGGATGAGAGCTCTTGGAAAAATTCGGAAAAAACAACAGCGGAAAATGATGCCCCGTTGGTGG
GGATGGTTGGAAGTGGTGCGGCTGGAAGTGCTTCTAGTTTACAAGGCAATGGCTCGAACAGTTCGGGGTTAAAATCGCTC
TTGAGATCAGCACCTGTCAGTGTTCCACCAAGCAGTACAAGTAATCAAACTTTAAGCTTATCTAACCCCGCTCCTGTGGG
CCCACAAGCGGTTGTAAGCCAACCCGCGGGGGGTGCTACGGCAGCAGTGTCCGTCAATCGCACAGCGAGTGACACCGCCA
CCTTTAGCAAGTACCTCAACACCGCCCAGGCCTTGCACCAGATGGGGGTGATTGTTCCGGGGTTGGAAAAATGAGGTGGT
AACAACGGTACGGGTGTAGTGGCTAGCCGACAGGATGCTACTTCCACTAACCTGCCCCATGCGGCAGGTGCTTCCCAAAC
GGGTTTGGGAACTGGTTCGCCCCGCGAACCAGCTTTAACCGCAACGTCACAGCGTGCCGTCACGGTGGTTGCTGGCCCCC
TTCGTGCGGGCAATAGCAGTGAAACTGATGCCCTACCGAATGTCATCACCCAGCTCTATCATACTTCAACCGCCCAACTC
GCTTACTTAAATGGCCAGATCGTTGTGATGGGTTCCGACCGGGTACCGAGTCTTTGGTATTGAGTTGTCGGGGAGGACCA
GGAATCGGGCAAAGCGACCTGATGAGCGAAAACCGAGCTCAACTGGGGCACCGACAAGCAGAAGCAGTTTGTCGAAAACC
AGTTGGGGTTTAAAGATGACTCAAATTCGGATTCCAAAAATTCGAATTTGAAGGCCCAAGGCCTCACCCAACCCGCCTAC
CTCATCGCCGGTCTTGACGTTGTGGCCGACCACCTCGTCTTTGCGGCCTTTAAAGCGGGCGCGGTGGGGTATGATATGAC
GACTGATTCGAGCGCTTCGACCTACAACCAAGCACTCGCCTGGTCGACCACGGCCGGGTTGGACAGTGATGGGGGGTACA
AGGCCTTGGTGGAAAACACGGCCGGGCTCAACGGCCCGATTAATGGCTTGTTTACCCTGCTCGACACCTTTGCGTATGTG
ACCCCCGTGAGTGGGATGAAAGGGGGGAGTCAGAATAATGAAGAAGTGCAAACGACTTACCCGGTCAAGTCCGACCAAAA
GGCCACCGCCAAAATTGCCTCCTTAATTAATGCCAGCCCACTCAACAGTTATGGGGATGATGGGGTGACCGTGTTTGATG
CCCTGGGCCTTAACTTTAACTTTAAGTTGAACGAGGAGCGCTTGCCATCGCGCACCGACCAACTGCTTGTGTATGGGATT
GTAAACGAAAGTGAACTGAAGTCCGCACGGGAAAATGCCCAGTCGACCTCCGATGATAATTCAAACACCAAAGTCAAGTG
AACCAACACCGCCTCGCACTACCTCCCCGTGCCGTATTACTACAGTGCCAATTTCCCCGAAGCGGGTAACAGAAGGCGAG
CGGAGCAGCGGAATGGGGTGAAGATTAGCACCTTGGAATCGCAAGCCACTGATGGCTTTGCCAACTCGTTACTTAACTTT
GGTACCGGTCTTAAAGCCGGTGTTGACCCAGCTCCAGTAGCACGGGGTCATAAACCGAACTATAGTGCAGTACTACTAGT
GCGTGGTGGCGTTGTAAGGTTAAACTTTAACCCCGATACTGATAAACTGTTGGATTCTACTGACAAAAACAGTGAACCTA
TCTCCTTCTCCTATACCCCATTTGGGTCTGCTGAAAGTGCCGTAGACCTCACCACGTTGAAGGATGTGACCTATATTGCT
GAAAGTGGTCTGTGGTTCTATACCTTTGACAATGGTGAAAAACCAACGTACGATGGTAAACAACAACAGGTCAAAAACCG
CAAGGGTTATGCTGTGATTACCGTATCACGTACCGGAATTGAATTTAACGAGGACGCTAATACCACAACCTTAAGCCAAG
CCCCAGCTGCTTTGGCTGTCCAAAACGGGATTGCTTCCAGTCAGGACGACCTCACAGGCATCCTACCGTTATCCGATGAG
TTCTCCGCTGTGATTACCAAGGATCAAACATGGACCGGTAAGGTTGATATCTATAAGAACACCAACGGGTTGTTTGAAAA
GGATGATCAGCTATCGGAAAACGTGAAGAGGCGTGACAACGGTTTGGTCCCTATTTACAACGAAGGTATCGTCGATATTT
GGGGCAGAGTGGATTTTGCTGCCAACAGTGTTTTGCAAGCGCGTAACCTCACTGATAAAACGGTTGATGAGGTGATCAAT
AACCCCGATATCCTCCAAAGCTTCTTTAAGTTTACCCCAGCCTTTGATAACCAAAGAGCAATGCTAGTGGGGGAAAAGAC
ATCGGATACTACCTTAACGGTTAAACCGAAGATTGAGTACTTGGATGGTAACTTCTATGGTGAGGATTCCAAGATTGCTG
GAATTCCGCTCAACATTGATTTCCCTTCCCGGATTTTTGCTGGCTTTGCTGCTTTACCGTCCTGGGTCATTCCGGTATCA
GTCGGTTCATCGGTGGGCATTCTCTTAATCCTGCTCATCTTAGGCCTTGGTATTGGAATTCCAATGTATAAGGTCCGCAA
GCTTCAAGACTCCAGCTTTGTTGATGTGTTTAAAAAGGTGGATACGTTGACAACCGCTGTGGGTAGCGTGTACAAGAAGA
TTATCACCCAAACGAGTGTGATCAAAAAAGCTCCTAGTGCGTTGAAAGCTGCTAATAACGCTGCTCCTAAAGCACCAGTT
AAACCAGCTGCTCCAACAGCTCCAAGACCACCAGTCCAACCACCTAAAAAGGCTTAA

Back to Top


MP012 Codon Usage

(1219 codons)
fields: [triplet] [frequency: per thousand] ([number])

UUU26.3(  32)  UCU6.6(   8)  UAU14.8(  18)  UGU0.0(   0)  
UUC7.4(   9)  UCC20.5(  25)  UAC16.4(  20)  UGC0.0(   0)  
UUA17.2(  21)  UCA9.8(  12)  UAA0.8(   1)  UGA5.7(   7)  
UUG23.0(  28)  UCG20.5(  25)  UAG0.0(   0)  UGG8.2(  10)  

CUU9.8(  12)  CCU7.4(   9)  CAU4.1(   5)  CGU6.6(   8)  
CUC25.4(  31)  CCC16.4(  20)  CAC4.1(   5)  CGC8.2(  10)  
CUA7.4(   9)  CCA17.2(  21)  CAA28.7(  35)  CGA1.6(   2)  
CUG10.7(  13)  CCG13.9(  17)  CAG18.0(  22)  CGG6.6(   8)  

AUU19.7(  24)  ACU11.5(  14)  AAU26.3(  32)  AGU25.4(  31)  
AUC11.5(  14)  ACC41.0(  50)  AAC37.7(  46)  AGC12.3(  15)  
AUA0.0(   0)  ACA9.8(  12)  AAA29.5(  36)  AGA4.1(   5)  
AUG6.6(   8)  ACG17.2(  21)  AAG35.3(  43)  AGG4.1(   5)  

GUU15.6(  19)  GCU30.4(  37)  GAU36.9(  45)  GGU27.9(  34)  
GUC18.9(  23)  GCC32.8(  40)  GAC21.3(  26)  GGC21.3(  26)  
GUA8.2(  10)  GCA9.8(  12)  GAA26.3(  32)  GGA6.6(   8)  
GUG33.6(  41)  GCG18.9(  23)  GAG9.8(  12)  GGG26.3(  32)  

Back to Top


MP012 Translation

Note: This was generated using the standard codon usage table; UGA codons in MP/MP genes will show up as terminators ("*")

atgaaatcgaagctaaagttaaaacgttatttactgtttttaccacttttaccgctaggg
 M  K  S  K  L  K  L  K  R  Y  L  L  F  L  P  L  L  P  L  G 
acgttgtcactagccaacacctacctcctccaagaccacaacaccctcaccccctacacg
 T  L  S  L  A  N  T  Y  L  L  Q  D  H  N  T  L  T  P  Y  T 
ccctttacgacaccgctcaatggggggctggatgtcgtgcgcgccgcccatttacacccc
 P  F  T  T  P  L  N  G  G  L  D  V  V  R  A  A  H  L  H  P 
tcatacgaactcgtggactgaaagcgggtgggggataccaagttggtggcgctggtccgc
 S  Y  E  L  V  D  *  K  R  V  G  D  T  K  L  V  A  L  V  R 
tcagcgttggtcagggtgaaattccaggacacaacgagttcggatcaaagtaataccaac
 S  A  L  V  R  V  K  F  Q  D  T  T  S  S  D  Q  S  N  T  N 
caaaatgccttgagttttgatacccaagaatcacagaaggcacttaatggctcgcagagt
 Q  N  A  L  S  F  D  T  Q  E  S  Q  K  A  L  N  G  S  Q  S 
ggatcttctgacacttccgggtctaactcccaagacttcgccagctatgtcctcatcttt
 G  S  S  D  T  S  G  S  N  S  Q  D  F  A  S  Y  V  L  I  F 
aaagccgcgcccagggccacgtgggtgtttgaacgcaagattaagttggcgttgccctac
 K  A  A  P  R  A  T  W  V  F  E  R  K  I  K  L  A  L  P  Y 
gttaagcaggaaagtcagggttccggcgatcaaggttccaatggtaagggctccctctac
 V  K  Q  E  S  Q  G  S  G  D  Q  G  S  N  G  K  G  S  L  Y 
aaaaccctccaagacctcctcgtcgaacaacccgtgaccccttacaccccgaatgcgggg
 K  T  L  Q  D  L  L  V  E  Q  P  V  T  P  Y  T  P  N  A  G 
ttagcccgggtgaatggggttgctcaggatacggttcattttggttcgggtcaagaatcg
 L  A  R  V  N  G  V  A  Q  D  T  V  H  F  G  S  G  Q  E  S 
agttggaattcccaacgttcccaaaaaggccttaaaaacaaccccggacccaaagccgtc
 S  W  N  S  Q  R  S  Q  K  G  L  K  N  N  P  G  P  K  A  V 
accggctttaagctcgataagggccgcgcgtaccggaagctgaatgaaagttgaccggtg
 T  G  F  K  L  D  K  G  R  A  Y  R  K  L  N  E  S  *  P  V 
tatgaacccctggattcgaccaaggaggggaaggggaaggatgagagctcttggaaaaat
 Y  E  P  L  D  S  T  K  E  G  K  G  K  D  E  S  S  W  K  N 
tcggaaaaaacaacagcggaaaatgatgccccgttggtggggatggttggaagtggtgcg
 S  E  K  T  T  A  E  N  D  A  P  L  V  G  M  V  G  S  G  A 
gctggaagtgcttctagtttacaaggcaatggctcgaacagttcggggttaaaatcgctc
 A  G  S  A  S  S  L  Q  G  N  G  S  N  S  S  G  L  K  S  L 
ttgagatcagcacctgtcagtgttccaccaagcagtacaagtaatcaaactttaagctta
 L  R  S  A  P  V  S  V  P  P  S  S  T  S  N  Q  T  L  S  L 
tctaaccccgctcctgtgggcccacaagcggttgtaagccaacccgcggggggtgctacg
 S  N  P  A  P  V  G  P  Q  A  V  V  S  Q  P  A  G  G  A  T 
gcagcagtgtccgtcaatcgcacagcgagtgacaccgccacctttagcaagtacctcaac
 A  A  V  S  V  N  R  T  A  S  D  T  A  T  F  S  K  Y  L  N 
accgcccaggccttgcaccagatgggggtgattgttccggggttggaaaaatgaggtggt
 T  A  Q  A  L  H  Q  M  G  V  I  V  P  G  L  E  K  *  G  G 
aacaacggtacgggtgtagtggctagccgacaggatgctacttccactaacctgccccat
 N  N  G  T  G  V  V  A  S  R  Q  D  A  T  S  T  N  L  P  H 
gcggcaggtgcttcccaaacgggtttgggaactggttcgccccgcgaaccagctttaacc
 A  A  G  A  S  Q  T  G  L  G  T  G  S  P  R  E  P  A  L  T 
gcaacgtcacagcgtgccgtcacggtggttgctggcccccttcgtgcgggcaatagcagt
 A  T  S  Q  R  A  V  T  V  V  A  G  P  L  R  A  G  N  S  S 
gaaactgatgccctaccgaatgtcatcacccagctctatcatacttcaaccgcccaactc
 E  T  D  A  L  P  N  V  I  T  Q  L  Y  H  T  S  T  A  Q  L 
gcttacttaaatggccagatcgttgtgatgggttccgaccgggtaccgagtctttggtat
 A  Y  L  N  G  Q  I  V  V  M  G  S  D  R  V  P  S  L  W  Y 
tgagttgtcggggaggaccaggaatcgggcaaagcgacctgatgagcgaaaaccgagctc
 *  V  V  G  E  D  Q  E  S  G  K  A  T  *  *  A  K  T  E  L 
aactggggcaccgacaagcagaagcagtttgtcgaaaaccagttggggtttaaagatgac
 N  W  G  T  D  K  Q  K  Q  F  V  E  N  Q  L  G  F  K  D  D 
tcaaattcggattccaaaaattcgaatttgaaggcccaaggcctcacccaacccgcctac
 S  N  S  D  S  K  N  S  N  L  K  A  Q  G  L  T  Q  P  A  Y 
ctcatcgccggtcttgacgttgtggccgaccacctcgtctttgcggcctttaaagcgggc
 L  I  A  G  L  D  V  V  A  D  H  L  V  F  A  A  F  K  A  G 
gcggtggggtatgatatgacgactgattcgagcgcttcgacctacaaccaagcactcgcc
 A  V  G  Y  D  M  T  T  D  S  S  A  S  T  Y  N  Q  A  L  A 
tggtcgaccacggccgggttggacagtgatggggggtacaaggccttggtggaaaacacg
 W  S  T  T  A  G  L  D  S  D  G  G  Y  K  A  L  V  E  N  T 
gccgggctcaacggcccgattaatggcttgtttaccctgctcgacacctttgcgtatgtg
 A  G  L  N  G  P  I  N  G  L  F  T  L  L  D  T  F  A  Y  V 
acccccgtgagtgggatgaaaggggggagtcagaataatgaagaagtgcaaacgacttac
 T  P  V  S  G  M  K  G  G  S  Q  N  N  E  E  V  Q  T  T  Y 
ccggtcaagtccgaccaaaaggccaccgccaaaattgcctccttaattaatgccagccca
 P  V  K  S  D  Q  K  A  T  A  K  I  A  S  L  I  N  A  S  P 
ctcaacagttatggggatgatggggtgaccgtgtttgatgccctgggccttaactttaac
 L  N  S  Y  G  D  D  G  V  T  V  F  D  A  L  G  L  N  F  N 
tttaagttgaacgaggagcgcttgccatcgcgcaccgaccaactgcttgtgtatgggatt
 F  K  L  N  E  E  R  L  P  S  R  T  D  Q  L  L  V  Y  G  I 
gtaaacgaaagtgaactgaagtccgcacgggaaaatgcccagtcgacctccgatgataat
 V  N  E  S  E  L  K  S  A  R  E  N  A  Q  S  T  S  D  D  N 
tcaaacaccaaagtcaagtgaaccaacaccgcctcgcactacctccccgtgccgtattac
 S  N  T  K  V  K  *  T  N  T  A  S  H  Y  L  P  V  P  Y  Y 
tacagtgccaatttccccgaagcgggtaacagaaggcgagcggagcagcggaatggggtg
 Y  S  A  N  F  P  E  A  G  N  R  R  R  A  E  Q  R  N  G  V 
aagattagcaccttggaatcgcaagccactgatggctttgccaactcgttacttaacttt
 K  I  S  T  L  E  S  Q  A  T  D  G  F  A  N  S  L  L  N  F 
ggtaccggtcttaaagccggtgttgacccagctccagtagcacggggtcataaaccgaac
 G  T  G  L  K  A  G  V  D  P  A  P  V  A  R  G  H  K  P  N 
tatagtgcagtactactagtgcgtggtggcgttgtaaggttaaactttaaccccgatact
 Y  S  A  V  L  L  V  R  G  G  V  V  R  L  N  F  N  P  D  T 
gataaactgttggattctactgacaaaaacagtgaacctatctccttctcctatacccca
 D  K  L  L  D  S  T  D  K  N  S  E  P  I  S  F  S  Y  T  P 
tttgggtctgctgaaagtgccgtagacctcaccacgttgaaggatgtgacctatattgct
 F  G  S  A  E  S  A  V  D  L  T  T  L  K  D  V  T  Y  I  A 
gaaagtggtctgtggttctatacctttgacaatggtgaaaaaccaacgtacgatggtaaa
 E  S  G  L  W  F  Y  T  F  D  N  G  E  K  P  T  Y  D  G  K 
caacaacaggtcaaaaaccgcaagggttatgctgtgattaccgtatcacgtaccggaatt
 Q  Q  Q  V  K  N  R  K  G  Y  A  V  I  T  V  S  R  T  G  I 
gaatttaacgaggacgctaataccacaaccttaagccaagccccagctgctttggctgtc
 E  F  N  E  D  A  N  T  T  T  L  S  Q  A  P  A  A  L  A  V 
caaaacgggattgcttccagtcaggacgacctcacaggcatcctaccgttatccgatgag
 Q  N  G  I  A  S  S  Q  D  D  L  T  G  I  L  P  L  S  D  E 
ttctccgctgtgattaccaaggatcaaacatggaccggtaaggttgatatctataagaac
 F  S  A  V  I  T  K  D  Q  T  W  T  G  K  V  D  I  Y  K  N 
accaacgggttgtttgaaaaggatgatcagctatcggaaaacgtgaagaggcgtgacaac
 T  N  G  L  F  E  K  D  D  Q  L  S  E  N  V  K  R  R  D  N 
ggtttggtccctatttacaacgaaggtatcgtcgatatttggggcagagtggattttgct
 G  L  V  P  I  Y  N  E  G  I  V  D  I  W  G  R  V  D  F  A 
gccaacagtgttttgcaagcgcgtaacctcactgataaaacggttgatgaggtgatcaat
 A  N  S  V  L  Q  A  R  N  L  T  D  K  T  V  D  E  V  I  N 
aaccccgatatcctccaaagcttctttaagtttaccccagcctttgataaccaaagagca
 N  P  D  I  L  Q  S  F  F  K  F  T  P  A  F  D  N  Q  R  A 
atgctagtgggggaaaagacatcggatactaccttaacggttaaaccgaagattgagtac
 M  L  V  G  E  K  T  S  D  T  T  L  T  V  K  P  K  I  E  Y 
ttggatggtaacttctatggtgaggattccaagattgctggaattccgctcaacattgat
 L  D  G  N  F  Y  G  E  D  S  K  I  A  G  I  P  L  N  I  D 
ttcccttcccggatttttgctggctttgctgctttaccgtcctgggtcattccggtatca
 F  P  S  R  I  F  A  G  F  A  A  L  P  S  W  V  I  P  V  S 
gtcggttcatcggtgggcattctcttaatcctgctcatcttaggccttggtattggaatt
 V  G  S  S  V  G  I  L  L  I  L  L  I  L  G  L  G  I  G  I 
ccaatgtataaggtccgcaagcttcaagactccagctttgttgatgtgtttaaaaaggtg
 P  M  Y  K  V  R  K  L  Q  D  S  S  F  V  D  V  F  K  K  V 
gatacgttgacaaccgctgtgggtagcgtgtacaagaagattatcacccaaacgagtgtg
 D  T  L  T  T  A  V  G  S  V  Y  K  K  I  I  T  Q  T  S  V 
atcaaaaaagctcctagtgcgttgaaagctgctaataacgctgctcctaaagcaccagtt
 I  K  K  A  P  S  A  L  K  A  A  N  N  A  A  P  K  A  P  V 
aaaccagctgctccaacagctccaagaccaccagtccaaccacctaaaaaggcttaa
 K  P  A  A  P  T  A  P  R  P  P  V  Q  P  P  K  K  A  * 

Back to Top


MP012 AA Sequence

MKSKLKLKRYLLFLPLLPLGTLSLANTYLLQDHNTLTPYTPFTTPLNGGLDVVRAAHLHPSYELVDWKRVGDTKLVALVR
SALVRVKFQDTTSSDQSNTNQNALSFDTQESQKALNGSQSGSSDTSGSNSQDFASYVLIFKAAPRATWVFERKIKLALPY
VKQESQGSGDQGSNGKGSLYKTLQDLLVEQPVTPYTPNAGLARVNGVAQDTVHFGSGQESSWNSQRSQKGLKNNPGPKAV
TGFKLDKGRAYRKLNESWPVYEPLDSTKEGKGKDESSWKNSEKTTAENDAPLVGMVGSGAAGSASSLQGNGSNSSGLKSL
LRSAPVSVPPSSTSNQTLSLSNPAPVGPQAVVSQPAGGATAAVSVNRTASDTATFSKYLNTAQALHQMGVIVPGLEKWGG
NNGTGVVASRQDATSTNLPHAAGASQTGLGTGSPREPALTATSQRAVTVVAGPLRAGNSSETDALPNVITQLYHTSTAQL
AYLNGQIVVMGSDRVPSLWYWVVGEDQESGKATWWAKTELNWGTDKQKQFVENQLGFKDDSNSDSKNSNLKAQGLTQPAY
LIAGLDVVADHLVFAAFKAGAVGYDMTTDSSASTYNQALAWSTTAGLDSDGGYKALVENTAGLNGPINGLFTLLDTFAYV
TPVSGMKGGSQNNEEVQTTYPVKSDQKATAKIASLINASPLNSYGDDGVTVFDALGLNFNFKLNEERLPSRTDQLLVYGI
VNESELKSARENAQSTSDDNSNTKVKWTNTASHYLPVPYYYSANFPEAGNRRRAEQRNGVKISTLESQATDGFANSLLNF
GTGLKAGVDPAPVARGHKPNYSAVLLVRGGVVRLNFNPDTDKLLDSTDKNSEPISFSYTPFGSAESAVDLTTLKDVTYIA
ESGLWFYTFDNGEKPTYDGKQQQVKNRKGYAVITVSRTGIEFNEDANTTTLSQAPAALAVQNGIASSQDDLTGILPLSDE
FSAVITKDQTWTGKVDIYKNTNGLFEKDDQLSENVKRRDNGLVPIYNEGIVDIWGRVDFAANSVLQARNLTDKTVDEVIN
NPDILQSFFKFTPAFDNQRAMLVGEKTSDTTLTVKPKIEYLDGNFYGEDSKIAGIPLNIDFPSRIFAGFAALPSWVIPVS
VGSSVGILLILLILGLGIGIPMYKVRKLQDSSFVDVFKKVDTLTTAVGSVYKKIITQTSVIKKAPSALKAANNAAPKAPV
KPAAPTAPRPPVQPPKKA

Back to Top


MP012 Protein Parameters

This information was obtained using the Protein Parameters tool on the ExPASy Molecular Biology Server.

Number of amino acids: 1218

Molecular weight: 130456.9

Theoretical pI: 8.01

Amino acid composition:

Ala (A) 112	  9.2%
Arg (R)  38	  3.1%
Asn (N)  78	  6.4%
Asp (D)  71	  5.8%
Cys (C)   0	  0.0%
Gln (Q)  57	  4.7%
Glu (E)  44	  3.6%
Gly (G) 100	  8.2%
His (H)  10	  0.8%
Ile (I)  38	  3.1%
Leu (L) 114	  9.4%
Lys (K)  79	  6.5%
Met (M)   8	  0.7%
Phe (F)  41	  3.4%
Pro (P)  67	  5.5%
Ser (S) 116	  9.5%
Thr (T)  97	  8.0%
Trp (W)  17	  1.4%
Tyr (Y)  38	  3.1%
Val (V)  93	  7.6%

Asx (B)   0	  0.0%
Glx (Z)   0	  0.0%
Xaa (X)   0	  0.0%

Total number of negatively charged residues (Asp + Glu): 115
Total number of positively charged residues (Arg + Lys): 117

Atomic composition:

Carbon      C	      5785
Hydrogen    H	      9113
Nitrogen    N	      1583
Oxygen      O	      1835
Sulfur      S	         8

Formula: C5785H9113N1583O1835S8
Total number of atoms: 18324

Extinction coefficients:

Conditions: 6.0 M guanidium hydrochloride
            0.02 M phosphate buffer
            pH 6.5

Extinction coefficients are in units of  M-1 cm-1 .

                      276     278     279     280     282
                       nm      nm      nm      nm      nm
Ext. coefficient   146900  148400  147330  145370  140800
Abs 0.1% (=1 g/l)   1.126   1.138   1.129   1.114   1.079


Estimated half-life:

The N-terminal of the sequence considered is M (Met).

The estimated half-life is: 30 hours (mammalian reticulocytes, in vitro).
                            >20 hours (yeast, in vivo).
                            >10 hours (Escherichia coli, in vivo).


Instability index:

The instability index (II) is computed to be 28.46
This classifies the protein as stable.



Aliphatic index: 80.01

Grand average of hydropathicity (GRAVY): -0.355

Back to Top


Predicted properties of MP012

ModelPercentage of MP Gene
Coiled Coil (CCP)0
Disordered (SEG)16
Transmembrane (PHDhtm)2
Transmembrane (TMHMM)2
Homologous to known structure (PSIBLAST)0

Back to Top


MP012 Prediction Details

Sequence: Amino acid sequence.
CCP (C or -): coiled coil prediction from the ccp program (NCBI toolkit)
SEG (D or -): low complexity regions (possibly disordered) from
SEG
PHDhtm (H or -): Transmembrane prediction from PHDhtm
TMHMM (H or -): Transmembrane prediction from TMHMM
PSIBLAST (3 or -): Regions potentially homologous to a protein of known 3D structure, according to PSIBLAST
Pred2ary (H, E, or -): Secondary structure prediction from Pred2ary

                 10        20        30        40        50        60        70        80
                  |         |         |         |         |         |         |         |
Sequence MKSKLKLKRYLLFLPLLPLGTLSLANTYLLQDHNTLTPYTPFTTPLNGGLDVVRAAHLHPSYELVDWKRVGDTKLVALVR
CCP      --------------------------------------------------------------------------------
SEG      -DDDDDDDDDDDDDDDDDDDDDDD----------DDDDDDDDDDDD----------------------------DDDDDD
PHDhtm   --------------------------------------------------------------------------------
TMHMM    --------------------------------------------------------------------------------
PSIBLAST --------------------------------------------------------------------------------
Pred2ary ---HHHHHHHHHHH-------------------------------------EEEEE------EEEEEEE-----EEEEEE

                 90       100       110       120       130       140       150       160
                  |         |         |         |         |         |         |         |
Sequence SALVRVKFQDTTSSDQSNTNQNALSFDTQESQKALNGSQSGSSDTSGSNSQDFASYVLIFKAAPRATWVFERKIKLALPY
CCP      --------------------------------------------------------------------------------
SEG      DDDDDD-----------------------------DDDDDDDDDDDDDDDDD----------------------------
PHDhtm   --------------------------------------------------------------------------------
TMHMM    --------------------------------------------------------------------------------
PSIBLAST --------------------------------------------------------------------------------
Pred2ary ---EEEEEE-------E----------HHHHHHHH-------------------EEEEEEE----HHHHHHHHHH-----

                170       180       190       200       210       220       230       240
                  |         |         |         |         |         |         |         |
Sequence VKQESQGSGDQGSNGKGSLYKTLQDLLVEQPVTPYTPNAGLARVNGVAQDTVHFGSGQESSWNSQRSQKGLKNNPGPKAV
CCP      --------------------------------------------------------------------------------
SEG      --------------------------------------------------------------------------------
PHDhtm   --------------------------------------------------------------------------------
TMHMM    --------------------------------------------------------------------------------
PSIBLAST --------------------------------------------------------------------------------
Pred2ary -E----------------HHHHHHHHHH-----------HHHHH------EEE---------------------------

                250       260       270       280       290       300       310       320
                  |         |         |         |         |         |         |         |
Sequence TGFKLDKGRAYRKLNESWPVYEPLDSTKEGKGKDESSWKNSEKTTAENDAPLVGMVGSGAAGSASSLQGNGSNSSGLKSL
CCP      --------------------------------------------------------------------------------
SEG      --------------------------------------------------------DDDDDDDDDDDDDDDDDDDDDDDD
PHDhtm   --------------------------------------------------------------------------------
TMHMM    --------------------------------------------------------------------------------
PSIBLAST --------------------------------------------------------------------------------
Pred2ary ---------HHHHHH--------------------------------------------EEEEE----------HHHHHH

                330       340       350       360       370       380       390       400
                  |         |         |         |         |         |         |         |
Sequence LRSAPVSVPPSSTSNQTLSLSNPAPVGPQAVVSQPAGGATAAVSVNRTASDTATFSKYLNTAQALHQMGVIVPGLEKWGG
CCP      --------------------------------------------------------------------------------
SEG      DDDDDDDDDDDDDDDDDDDDDD----------------------------------------------------------
PHDhtm   --------------------------------------------------------------------------------
TMHMM    --------------------------------------------------------------------------------
PSIBLAST --------------------------------------------------------------------------------
Pred2ary ----EEEEE---------------------EEE------EEEEE-------HHHHHHHHHHHHHHHHH------------

                410       420       430       440       450       460       470       480
                  |         |         |         |         |         |         |         |
Sequence NNGTGVVASRQDATSTNLPHAAGASQTGLGTGSPREPALTATSQRAVTVVAGPLRAGNSSETDALPNVITQLYHTSTAQL
CCP      --------------------------------------------------------------------------------
SEG      --------------------------------------------------------------------------------
PHDhtm   --------------------------------------------------------------------------------
TMHMM    --------------------------------------------------------------------------------
PSIBLAST --------------------------------------------------------------------------------
Pred2ary -----EEEE---------------------------HHHHH----EEEEE----------------HHHHHHHH-HHHHH

                490       500       510       520       530       540       550       560
                  |         |         |         |         |         |         |         |
Sequence AYLNGQIVVMGSDRVPSLWYWVVGEDQESGKATWWAKTELNWGTDKQKQFVENQLGFKDDSNSDSKNSNLKAQGLTQPAY
CCP      --------------------------------------------------------------------------------
SEG      ---------------------------------------------------------DDDDDDDDDDDDDD---------
PHDhtm   --------------------------------------------------------------------------------
TMHMM    --------------------------------------------------------------------------------
PSIBLAST --------------------------------------------------------------------------------
Pred2ary HH----EEEEE-----EEEEEEE---------EEEEE--------HHHHHHHH-------------------------HH

                570       580       590       600       610       620       630       640
                  |         |         |         |         |         |         |         |
Sequence LIAGLDVVADHLVFAAFKAGAVGYDMTTDSSASTYNQALAWSTTAGLDSDGGYKALVENTAGLNGPINGLFTLLDTFAYV
CCP      --------------------------------------------------------------------------------
SEG      --------------------------------------------------------------------------------
PHDhtm   --------------------------------------------------------------------------------
TMHMM    --------------------------------------------------------------------------------
PSIBLAST --------------------------------------------------------------------------------
Pred2ary HHH--HHHHHHHHHHHHH---------------HHHHHHHHHHH---------EEEEE------------EEE---EEEE

                650       660       670       680       690       700       710       720
                  |         |         |         |         |         |         |         |
Sequence TPVSGMKGGSQNNEEVQTTYPVKSDQKATAKIASLINASPLNSYGDDGVTVFDALGLNFNFKLNEERLPSRTDQLLVYGI
CCP      --------------------------------------------------------------------------------
SEG      --------------------------------------------------------------------------------
PHDhtm   --------------------------------------------------------------------------------
TMHMM    --------------------------------------------------------------------------------
PSIBLAST --------------------------------------------------------------------------------
Pred2ary EE--------------EEE----------EEEEEEE-------------EEEE-----E---------------EEEEEE

                730       740       750       760       770       780       790       800
                  |         |         |         |         |         |         |         |
Sequence VNESELKSARENAQSTSDDNSNTKVKWTNTASHYLPVPYYYSANFPEAGNRRRAEQRNGVKISTLESQATDGFANSLLNF
CCP      --------------------------------------------------------------------------------
SEG      --------------------------------------------------------------------------------
PHDhtm   --------------------------------------------------------------------------------
TMHMM    --------------------------------------------------------------------------------
PSIBLAST --------------------------------------------------------------------------------
Pred2ary E----------------------EEEE---------------------------------EEEEE--------HHHHHHH

                810       820       830       840       850       860       870       880
                  |         |         |         |         |         |         |         |
Sequence GTGLKAGVDPAPVARGHKPNYSAVLLVRGGVVRLNFNPDTDKLLDSTDKNSEPISFSYTPFGSAESAVDLTTLKDVTYIA
CCP      --------------------------------------------------------------------------------
SEG      -----------------------DDDDDDDDDDD----------------------------------------------
PHDhtm   --------------------------------------------------------------------------------
TMHMM    --------------------------------------------------------------------------------
PSIBLAST --------------------------------------------------------------------------------
Pred2ary H--------------------EEEEEE---EEEEE------------------EEEE----------EE------EEEEE

                890       900       910       920       930       940       950       960
                  |         |         |         |         |         |         |         |
Sequence ESGLWFYTFDNGEKPTYDGKQQQVKNRKGYAVITVSRTGIEFNEDANTTTLSQAPAALAVQNGIASSQDDLTGILPLSDE
CCP      --------------------------------------------------------------------------------
SEG      --------------------------------------------------------------------------------
PHDhtm   --------------------------------------------------------------------------------
TMHMM    --------------------------------------------------------------------------------
PSIBLAST --------------------------------------------------------------------------------
Pred2ary ---EEEEEE--------------------EEEEEEE---EEE----------HHHHHHHHHH------------------

                970       980       990      1000      1010      1020      1030      1040
                  |         |         |         |         |         |         |         |
Sequence FSAVITKDQTWTGKVDIYKNTNGLFEKDDQLSENVKRRDNGLVPIYNEGIVDIWGRVDFAANSVLQARNLTDKTVDEVIN
CCP      --------------------------------------------------------------------------------
SEG      --------------------------------------------------------------------------------
PHDhtm   --------------------------------------------------------------------------------
TMHMM    --------------------------------------------------------------------------------
PSIBLAST --------------------------------------------------------------------------------
Pred2ary -EEEEE--------EEEEE---------HHHH----------------------EEEEE---------------HHH---

               1050      1060      1070      1080      1090      1100      1110      1120
                  |         |         |         |         |         |         |         |
Sequence NPDILQSFFKFTPAFDNQRAMLVGEKTSDTTLTVKPKIEYLDGNFYGEDSKIAGIPLNIDFPSRIFAGFAALPSWVIPVS
CCP      --------------------------------------------------------------------------------
SEG      ------------------------------------------------------------------------------DD
PHDhtm   ----------------------------------------------------------------------------HHHH
TMHMM    --------------------------------------------------------------------------HHHHHH
PSIBLAST --------------------------------------------------------------------------------
Pred2ary ---HHHHH----------EEEEEEE-----------EEEEE----------EEEE--------------------EEE--

               1130      1140      1150      1160      1170      1180      1190      1200
                  |         |         |         |         |         |         |         |
Sequence VGSSVGILLILLILGLGIGIPMYKVRKLQDSSFVDVFKKVDTLTTAVGSVYKKIITQTSVIKKAPSALKAANNAAPKAPV
CCP      --------------------------------------------------------------------------------
SEG      DDDDDDDDDDDDDDDDDDDD-----------------------------------------DDDDDDDDDDDDDDDDDDD
PHDhtm   HHHHHHHHHHHHHHHHHHHH------------------------------------------------------------
TMHMM    HHHHHHHHHHHHHHHHH---------------------------------------------------------------
PSIBLAST --------------------------------------------------------------------------------
Pred2ary ---HHHHHHHHHHHH------H-HHHHHHHH--H-------EEEHHHHHHHHHHHHH---------H-------------

               1210
                  |
Sequence KPAAPTAPRPPVQPPKKA
CCP      ------------------
SEG      DDDDDDDDDDDDDDDDDD
PHDhtm   ------------------
TMHMM    ------------------
PSIBLAST ------------------
Pred2ary ------------------

Back to Top