|
|
Available sections:
| Target | log10(PSIBLAST E) | % identical | % coverage (of MP gene) | Latest Status |
|---|---|---|---|---|
| 1043B | -81.70 | 36.47 | 99.43 | Purified |
| 1044B | -37.00 | 34.35 | 93.57 | Purified |
| 1123B | -110.00 | 40.13 | 98.31 | Purified |
| 1189B | -43.10 | 36.14 | 93.09 | Expression tested |
| 1190B | -126.00 | 37.09 | 98.49 | Selected |
| 1579T | -26.30 | 49.21 | 100.00 | Work stopped (uncertainty in predicting domain boundaries) |
| 1580T | -30.52 | 71.08 | 98.81 | Work stopped (uncertainty in predicting domain boundaries) |
| 1582T | -52.40 | 37.00 | 98.70 | Work stopped (uncertainty in predicting domain boundaries) |
| 1583T | -41.70 | 45.56 | 100.00 | Work stopped (uncertainty in predicting domain boundaries) |
| 1584T | -37.52 | 38.71 | 96.88 | Work stopped (uncertainty in predicting domain boundaries) |
| 1585T | -25.70 | 54.65 | 94.51 | Work stopped (uncertainty in predicting domain boundaries) |
| 1586T | -34.70 | 38.97 | 98.55 | Work stopped (uncertainty in predicting domain boundaries) |
| 1587T | -27.70 | 44.58 | 100.00 | Work stopped (uncertainty in predicting domain boundaries) |
| 1588T | -42.00 | 37.14 | 98.31 | Work stopped (uncertainty in predicting domain boundaries) |
| 1627T | -15.22 | 52.17 | 92.00 | Work stopped (uncertainty in predicting domain boundaries) |
| 1628T | -30.70 | 44.05 | 100.00 | Work stopped (uncertainty in predicting domain boundaries) |
| 1650T | -25.30 | 59.04 | 98.81 | Work stopped (uncertainty in predicting domain boundaries) |
| 1651T | -27.52 | 57.61 | 97.87 | Work stopped (uncertainty in predicting domain boundaries) |
| 1697T | -48.40 | 33.33 | 100.00 | Work stopped (uncertainty in predicting domain boundaries) |
| 1698T | -42.05 | 37.10 | 93.94 | Work stopped (uncertainty in predicting domain boundaries) |
| 1699T | -23.10 | 30.43 | 88.46 | Work stopped (uncertainty in predicting domain boundaries) |
| 1700T | -48.30 | 34.73 | 92.27 | Work stopped (uncertainty in predicting domain boundaries) |
| 1701T | -25.40 | 31.94 | 94.74 | Work stopped (uncertainty in predicting domain boundaries) |
| 1702T | -41.70 | 35.71 | 85.14 | Work stopped (uncertainty in predicting domain boundaries) |
| 1703T | -42.15 | 35.71 | 97.67 | Work stopped (uncertainty in predicting domain boundaries) |
| 1705T | -20.40 | 52.24 | 97.10 | Work stopped (uncertainty in predicting domain boundaries) |
| 1706T | -21.22 | 33.87 | 87.32 | Work stopped (uncertainty in predicting domain boundaries) |
| 1707T | -19.70 | 52.24 | 70.53 | Work stopped (uncertainty in predicting domain boundaries) |
| 1709T | -44.70 | 33.08 | 96.30 | Work stopped (uncertainty in predicting domain boundaries) |
| 1710T | -52.15 | 32.35 | 95.51 | Work stopped (uncertainty in predicting domain boundaries) |
| 1711T | -45.52 | 31.47 | 96.62 | Work stopped (uncertainty in predicting domain boundaries) |
| 1712T | -36.00 | 50.51 | 100.00 | Work stopped (uncertainty in predicting domain boundaries) |
| 1713T | -56.00 | 42.71 | 99.50 | Work stopped (uncertainty in predicting domain boundaries) |
| 1714T | -27.70 | 44.74 | 98.70 | Work stopped (uncertainty in predicting domain boundaries) |
| 1715T | -40.70 | 41.54 | 95.59 | Work stopped (uncertainty in predicting domain boundaries) |
| 1716T | -17.70 | 51.72 | 95.08 | Override suggest work stop (False positive) |
| 1717T | -25.00 | 48.53 | 95.77 | Override suggest work stop (low sequence homology to the known structures) |
| 1718T | -25.00 | 48.53 | 95.77 | Work stopped (uncertainty in predicting domain boundaries) |
| 1719T | -39.70 | 39.20 | 89.93 | Work stopped (uncertainty in predicting domain boundaries) |
| 1720T | -49.15 | 32.74 | 92.31 | Work stopped (uncertainty in predicting domain boundaries) |
| 1721T | -38.52 | 41.67 | 99.17 | Work stopped (uncertainty in predicting domain boundaries) |
| 1722T | -22.00 | 53.33 | 97.40 | Work stopped (uncertainty in predicting domain boundaries) |
| 1723T | -23.70 | 54.41 | 98.55 | Work stopped (uncertainty in predicting domain boundaries) |
| 1737T | -28.00 | 45.88 | 98.84 | Work stopped (uncertainty in predicting domain boundaries) |
| 1738T | -17.10 | 46.43 | 70.00 | Work stopped (uncertainty in predicting domain boundaries) |
| 1739T | -28.22 | 100.00 | 100.00 | Work stopped (uncertainty in predicting domain boundaries) |
| 1740T | -19.40 | 100.00 | 100.00 | Work stopped (uncertainty in predicting domain boundaries) |
| 1767T | -15.70 | 44.00 | 89.29 | Work stopped (uncertainty in predicting domain boundaries) |
| 1768T | -62.00 | 32.09 | 94.71 | Work stopped (uncertainty in predicting domain boundaries) |
| 1769T | -15.00 | 50.00 | 85.19 | Work stopped (uncertainty in predicting domain boundaries) |
| 1771T | -42.70 | 33.85 | 83.87 | Work stopped (uncertainty in predicting domain boundaries) |
| 1772T | -14.40 | 47.83 | 86.79 | Work stopped (uncertainty in predicting domain boundaries) |
| 1773T | -14.00 | 45.65 | 80.70 | Work stopped (uncertainty in predicting domain boundaries) |
ATGGTTAAGGTCTGAAAAATTGGGTTTGGGGTCTTTCTACCAACAGCGCTTTTATTCTCGGCCTGTTCGTTTAAGGACTA TATTCCTACACCAAGCTTTAGGAAGGATTTCAGTACGGAGAACAATTTTGTTAAAAATAAGGTTCCCGGGAAGGATGACA TTTACAGTAAATTTTACGATCTAACCTTTTCCTTAAACTTTGTCAACAACCAAGCCCAAGAGTTTGGTACCGGTTGGTTA ATTGACTGGAAGGGTGATGAGAATAAGAATCTAAGCAAAAACAAAGAAGGACAAACGGCATCTCAAACCAGAAGTTCCTC TGAACAAACTACCGATCAAGACGCTAACCTCTTTACTGCCTATATCGCCACTAACCTCCATGTCGCGGATGGTTTAAAGA ATGATCAGGACTACGCTCCTTACAACAAGGATGGTTGGGGTCAACCATATCCTTACCAGCAAAAAACCCAATCCTTCTTA TTAGGTAAGTACACCAAACCTAATGTCCAGTTAGTCAAGACTAACTATGAAAAGCCAGAGGATGCCGTGATTGAGCAGAA GCTCAAGGAAGATTCGCTGTTGTTTATCCAAACGAGTACGTTACCTAAAACAGCATACGCAGCAATTGATCCGGTGAACT TTAGCTATAACCCAACCCGTACCAACGGTTTTTGAACAGCCGGTAAATACAATGTTTACAATGGTGGTAACAGCATTGGT AACTATGCTGATTTTGCTGTGATTGAAGTACCGTTAGTACTATCCAACCCCAATGATGCCAAGATCTACCAAGAGTGGAT TAGACCAGCTACCCAAGCCTACAAATATTTAGGTGATGTAGAAGGTTTGTTTGCTAAGAAGGGCTATCGCAGTTACATCC AAGATTTCTACCACTTATTGGGTTATCCGGTAACTAAGAACACTAAATCAGAGTTTATTTTGGGACAAAGTCAAGGTACT GTGAACCACATGAGTTTTAGTAATGATGAGTCCAATACTACTAATACTATTACTAAAGCTAGCTTAACTCAGCAACCCCA CAAGGAACAATCAGCTTACGTGGTACGTGAAAGTGGACTGCCAACCCTAACAATGAATGTGGACAAGTACACTGGTGCAA AGGGAACTCACCTAGTTAACGTTGACCAAATTACTGACCTTAGTTTGGGTGATGGTTTAATTGACTTTGGTGGATTATCA CGCTTTATCTTGCAGTACCACAATGTGAACTATAAGCAATTTGGTTATGGCACGATCTTGTGAGATACTAACTTTGGCGG TGGTTCTTCTGGTAGTGCCATCTTTAACCAAAACAAGCAGATTAACAGTATTTACTTTGGAGCATTAGTTAATGTAACTA CCGACAGAAACGAAAATGTGGGTTTAGGCTTAGGGCAAATTCTCCGTGCTCCCAATACCTTCAACTCTTCCCACGAAGTG CCCTATTCTTATGACCTCATCTTTGGTGATGTTAACACCACCAATTTTTACGCCCAGTTTGCGAAAAAGCACAACACCCA CATGTGGAGCAAAATTCAAAGTACGCAGAATGGAGAAATTGGTTTCCACAAAAATTCAAAAACCGGTCAACAGCGGCATT AA
(534 codons) fields: [triplet] [frequency: per thousand] ([number]) UUU48.7( 26) UCU11.2( 6) UAU24.3( 13) UGU1.9( 1) UUC11.2( 6) UCC11.2( 6) UAC33.7( 18) UGC0.0( 0) UUA31.8( 17) UCA7.5( 4) UAA1.9( 1) UGA5.6( 3) UUG13.1( 7) UCG5.6( 3) UAG0.0( 0) UGG9.4( 5) CUU3.7( 2) CCU9.4( 5) CAU3.7( 2) CGU5.6( 3) CUC9.4( 5) CCC9.4( 5) CAC16.9( 9) CGC3.7( 2) CUA11.2( 6) CCA13.1( 7) CAA43.1( 23) CGA0.0( 0) CUG3.7( 2) CCG5.6( 3) CAG18.7( 10) CGG1.9( 1) AUU33.7( 18) ACU31.8( 17) AAU37.5( 20) AGU24.3( 13) AUC15.0( 8) ACC30.0( 16) AAC50.6( 27) AGC11.2( 6) AUA0.0( 0) ACA9.4( 5) AAA30.0( 16) AGA5.6( 3) AUG7.5( 4) ACG11.2( 6) AAG44.9( 24) AGG1.9( 1) GUU15.0( 8) GCU16.9( 9) GAU35.6( 19) GGU52.4( 28) GUC11.2( 6) GCC18.7( 10) GAC20.6( 11) GGC7.5( 4) GUA11.2( 6) GCA11.2( 6) GAA20.6( 11) GGA13.1( 7) GUG16.9( 9) GCG5.6( 3) GAG15.0( 8) GGG7.5( 4)
Note: This was generated using the standard codon usage table; UGA codons in MP/MP genes will show up as terminators ("*")
atggttaaggtctgaaaaattgggtttggggtctttctaccaacagcgcttttattctcg M V K V * K I G F G V F L P T A L L F S gcctgttcgtttaaggactatattcctacaccaagctttaggaaggatttcagtacggag A C S F K D Y I P T P S F R K D F S T E aacaattttgttaaaaataaggttcccgggaaggatgacatttacagtaaattttacgat N N F V K N K V P G K D D I Y S K F Y D ctaaccttttccttaaactttgtcaacaaccaagcccaagagtttggtaccggttggtta L T F S L N F V N N Q A Q E F G T G W L attgactggaagggtgatgagaataagaatctaagcaaaaacaaagaaggacaaacggca I D W K G D E N K N L S K N K E G Q T A tctcaaaccagaagttcctctgaacaaactaccgatcaagacgctaacctctttactgcc S Q T R S S S E Q T T D Q D A N L F T A tatatcgccactaacctccatgtcgcggatggtttaaagaatgatcaggactacgctcct Y I A T N L H V A D G L K N D Q D Y A P tacaacaaggatggttggggtcaaccatatccttaccagcaaaaaacccaatccttctta Y N K D G W G Q P Y P Y Q Q K T Q S F L ttaggtaagtacaccaaacctaatgtccagttagtcaagactaactatgaaaagccagag L G K Y T K P N V Q L V K T N Y E K P E gatgccgtgattgagcagaagctcaaggaagattcgctgttgtttatccaaacgagtacg D A V I E Q K L K E D S L L F I Q T S T ttacctaaaacagcatacgcagcaattgatccggtgaactttagctataacccaacccgt L P K T A Y A A I D P V N F S Y N P T R accaacggtttttgaacagccggtaaatacaatgtttacaatggtggtaacagcattggt T N G F * T A G K Y N V Y N G G N S I G aactatgctgattttgctgtgattgaagtaccgttagtactatccaaccccaatgatgcc N Y A D F A V I E V P L V L S N P N D A aagatctaccaagagtggattagaccagctacccaagcctacaaatatttaggtgatgta K I Y Q E W I R P A T Q A Y K Y L G D V gaaggtttgtttgctaagaagggctatcgcagttacatccaagatttctaccacttattg E G L F A K K G Y R S Y I Q D F Y H L L ggttatccggtaactaagaacactaaatcagagtttattttgggacaaagtcaaggtact G Y P V T K N T K S E F I L G Q S Q G T gtgaaccacatgagttttagtaatgatgagtccaatactactaatactattactaaagct V N H M S F S N D E S N T T N T I T K A agcttaactcagcaaccccacaaggaacaatcagcttacgtggtacgtgaaagtggactg S L T Q Q P H K E Q S A Y V V R E S G L ccaaccctaacaatgaatgtggacaagtacactggtgcaaagggaactcacctagttaac P T L T M N V D K Y T G A K G T H L V N gttgaccaaattactgaccttagtttgggtgatggtttaattgactttggtggattatca V D Q I T D L S L G D G L I D F G G L S cgctttatcttgcagtaccacaatgtgaactataagcaatttggttatggcacgatcttg R F I L Q Y H N V N Y K Q F G Y G T I L tgagatactaactttggcggtggttcttctggtagtgccatctttaaccaaaacaagcag * D T N F G G G S S G S A I F N Q N K Q attaacagtatttactttggagcattagttaatgtaactaccgacagaaacgaaaatgtg I N S I Y F G A L V N V T T D R N E N V ggtttaggcttagggcaaattctccgtgctcccaataccttcaactcttcccacgaagtg G L G L G Q I L R A P N T F N S S H E V ccctattcttatgacctcatctttggtgatgttaacaccaccaatttttacgcccagttt P Y S Y D L I F G D V N T T N F Y A Q F gcgaaaaagcacaacacccacatgtggagcaaaattcaaagtacgcagaatggagaaatt A K K H N T H M W S K I Q S T Q N G E I ggtttccacaaaaattcaaaaaccggtcaacagcggcattaa G F H K N S K T G Q Q R H *
MVKVWKIGFGVFLPTALLFSACSFKDYIPTPSFRKDFSTENNFVKNKVPGKDDIYSKFYDLTFSLNFVNNQAQEFGTGWL IDWKGDENKNLSKNKEGQTASQTRSSSEQTTDQDANLFTAYIATNLHVADGLKNDQDYAPYNKDGWGQPYPYQQKTQSFL LGKYTKPNVQLVKTNYEKPEDAVIEQKLKEDSLLFIQTSTLPKTAYAAIDPVNFSYNPTRTNGFWTAGKYNVYNGGNSIG NYADFAVIEVPLVLSNPNDAKIYQEWIRPATQAYKYLGDVEGLFAKKGYRSYIQDFYHLLGYPVTKNTKSEFILGQSQGT VNHMSFSNDESNTTNTITKASLTQQPHKEQSAYVVRESGLPTLTMNVDKYTGAKGTHLVNVDQITDLSLGDGLIDFGGLS RFILQYHNVNYKQFGYGTILWDTNFGGGSSGSAIFNQNKQINSIYFGALVNVTTDRNENVGLGLGQILRAPNTFNSSHEV PYSYDLIFGDVNTTNFYAQFAKKHNTHMWSKIQSTQNGEIGFHKNSKTGQQRH
This information was obtained using the Protein Parameters tool on the ExPASy Molecular Biology Server.
Number of amino acids: 533
Molecular weight: 59971.7
Theoretical pI: 7.69
Amino acid composition:
Ala (A) 28 5.3%
Arg (R) 10 1.9%
Asn (N) 47 8.8%
Asp (D) 30 5.6%
Cys (C) 1 0.2%
Gln (Q) 33 6.2%
Glu (E) 19 3.6%
Gly (G) 43 8.1%
His (H) 11 2.1%
Ile (I) 26 4.9%
Leu (L) 39 7.3%
Lys (K) 40 7.5%
Met (M) 4 0.8%
Phe (F) 32 6.0%
Pro (P) 20 3.8%
Ser (S) 38 7.1%
Thr (T) 44 8.3%
Trp (W) 8 1.5%
Tyr (Y) 31 5.8%
Val (V) 29 5.4%
Asx (B) 0 0.0%
Glx (Z) 0 0.0%
Xaa (X) 0 0.0%
Total number of negatively charged residues (Asp + Glu): 49
Total number of positively charged residues (Arg + Lys): 50
Atomic composition:
Carbon C 2707
Hydrogen H 4079
Nitrogen N 713
Oxygen O 825
Sulfur S 5
Formula: C2707H4079N713O825S5
Total number of atoms: 8329
Extinction coefficients:
Conditions: 6.0 M guanidium hydrochloride
0.02 M phosphate buffer
pH 6.5
Extinction coefficients are in units of M-1 cm-1 .
The first table lists values computed assuming ALL Cys
residues appear as half cystines, whereas the second table
assumes that NONE do.
276 278 279 280 282
nm nm nm nm nm
Ext. coefficient 88150 88200 86975 85200 82000
Abs 0.1% (=1 g/l) 1.470 1.471 1.450 1.421 1.367
276 278 279 280 282
nm nm nm nm nm
Ext. coefficient 88150 88200 86975 85200 82000
Abs 0.1% (=1 g/l) 1.470 1.471 1.450 1.421 1.367
Estimated half-life:
The N-terminal of the sequence considered is M (Met).
The estimated half-life is: 30 hours (mammalian reticulocytes, in vitro).
>20 hours (yeast, in vivo).
>10 hours (Escherichia coli, in vivo).
Instability index:
The instability index (II) is computed to be 33.57
This classifies the protein as stable.
Aliphatic index: 68.59
Grand average of hydropathicity (GRAVY): -0.579| Model | Percentage of MP Gene |
|---|---|
| Coiled Coil (CCP) | 0 |
| Disordered (SEG) | 0 |
| Transmembrane (PHDhtm) | 0 |
| Transmembrane (TMHMM) | 0 |
| Homologous to known structure (PSIBLAST) | 0 |
Sequence: Amino acid sequence.
CCP (C or -): coiled coil prediction from the ccp program (NCBI toolkit)
SEG (D or -): low complexity regions (possibly disordered) from SEG
PHDhtm (H or -): Transmembrane prediction from PHDhtm
TMHMM (H or -): Transmembrane prediction from TMHMM
PSIBLAST (3 or -): Regions potentially homologous to a protein of known 3D structure, according to PSIBLAST
Pred2ary (H, E, or -): Secondary structure prediction from Pred2ary
10 20 30 40 50 60 70 80
| | | | | | | |
Sequence MVKVWKIGFGVFLPTALLFSACSFKDYIPTPSFRKDFSTENNFVKNKVPGKDDIYSKFYDLTFSLNFVNNQAQEFGTGWL
CCP --------------------------------------------------------------------------------
SEG --------------------------------------------------------------------------------
PHDhtm --------------------------------------------------------------------------------
TMHMM --------------------------------------------------------------------------------
PSIBLAST --------------------------------------------------------------------------------
Pred2ary -EEEE-HHHHHHHHHHHHHH-------------E-------EEE-----------------EEEEEEE---------EEE
90 100 110 120 130 140 150 160
| | | | | | | |
Sequence IDWKGDENKNLSKNKEGQTASQTRSSSEQTTDQDANLFTAYIATNLHVADGLKNDQDYAPYNKDGWGQPYPYQQKTQSFL
CCP --------------------------------------------------------------------------------
SEG --------------------------------------------------------------------------------
PHDhtm --------------------------------------------------------------------------------
TMHMM --------------------------------------------------------------------------------
PSIBLAST --------------------------------------------------------------------------------
Pred2ary EE-----------------EEEEEE------------EEEEEE--HHHHH--------------------------EEEE
170 180 190 200 210 220 230 240
| | | | | | | |
Sequence LGKYTKPNVQLVKTNYEKPEDAVIEQKLKEDSLLFIQTSTLPKTAYAAIDPVNFSYNPTRTNGFWTAGKYNVYNGGNSIG
CCP --------------------------------------------------------------------------------
SEG --------------------------------------------------------------------------------
PHDhtm --------------------------------------------------------------------------------
TMHMM --------------------------------------------------------------------------------
PSIBLAST --------------------------------------------------------------------------------
Pred2ary E------------------HHHHHH-------EEEE-------EEEE---------------------------------
250 260 270 280 290 300 310 320
| | | | | | | |
Sequence NYADFAVIEVPLVLSNPNDAKIYQEWIRPATQAYKYLGDVEGLFAKKGYRSYIQDFYHLLGYPVTKNTKSEFILGQSQGT
CCP --------------------------------------------------------------------------------
SEG --------------------------------------------------------------------------------
PHDhtm --------------------------------------------------------------------------------
TMHMM --------------------------------------------------------------------------------
PSIBLAST --------------------------------------------------------------------------------
Pred2ary ----EEEEEE---------HHHHHHHHHHHHHHHHHH-------------------EEEE-----------EEE------
330 340 350 360 370 380 390 400
| | | | | | | |
Sequence VNHMSFSNDESNTTNTITKASLTQQPHKEQSAYVVRESGLPTLTMNVDKYTGAKGTHLVNVDQITDLSLGDGLIDFGGLS
CCP --------------------------------------------------------------------------------
SEG --------------------------------------------------------------------------------
PHDhtm --------------------------------------------------------------------------------
TMHMM --------------------------------------------------------------------------------
PSIBLAST --------------------------------------------------------------------------------
Pred2ary --------------------------------------------------------EEE-------------EEEE----
410 420 430 440 450 460 470 480
| | | | | | | |
Sequence RFILQYHNVNYKQFGYGTILWDTNFGGGSSGSAIFNQNKQINSIYFGALVNVTTDRNENVGLGLGQILRAPNTFNSSHEV
CCP --------------------------------------------------------------------------------
SEG --------------------------------------------------------------------------------
PHDhtm --------------------------------------------------------------------------------
TMHMM --------------------------------------------------------------------------------
PSIBLAST --------------------------------------------------------------------------------
Pred2ary --EEE---HHHHH----EEEE-------------E-----EEEEEEEEEE-------------HHHHE------------
490 500 510 520 530
| | | | |
Sequence PYSYDLIFGDVNTTNFYAQFAKKHNTHMWSKIQSTQNGEIGFHKNSKTGQQRH
CCP -----------------------------------------------------
SEG -----------------------------------------------------
PHDhtm -----------------------------------------------------
TMHMM -----------------------------------------------------
PSIBLAST -----------------------------------------------------
Pred2ary ---EEEEE-----HHHHHHHHHHHHHHH-----------EEEE----------