|
|
Available sections:
| Target | log10(PSIBLAST E) | % identical | % coverage (of MP gene) | Latest Status |
|---|---|---|---|---|
| 1047B | -54.70 | 15.89 | 98.37 | Suggested work stop (7248 7248 7248 8230 8230 8230 8231 8231) |
| 1435T | -11.30 | 31.91 | 78.33 | Override suggest work stop (No sequence homology to the known crystal structures ) |
| 1484T | -16.00 | 42.31 | 100.00 | Work stopped (uncertainty in predicting domain boundaries) |
| 1485T | -21.70 | 22.99 | 100.00 | Work stopped (uncertainty in predicting domain boundaries) |
| 1502T | -15.52 | 45.28 | 96.36 | Work stopped (uncertainty in predicting domain boundaries) |
| 1607T | -18.00 | 60.71 | 98.25 | Work stopped (uncertainty in predicting domain boundaries) |
| 1608T | -22.52 | 58.82 | 98.84 | Work stopped (uncertainty in predicting domain boundaries) |
| 1730T | -23.70 | 100.00 | 100.00 | Work stopped (uncertainty in predicting domain boundaries) |
| 1731T | -17.52 | 100.00 | 100.00 | Work stopped (uncertainty in predicting domain boundaries) |
| 1847T | -12.40 | 38.00 | 94.34 | Work stopped (uncertainty in predicting domain boundaries) |
| 1852T | -17.70 | 38.89 | 100.00 | Work stopped (uncertainty in predicting domain boundaries) |
| 1861T | -17.22 | 37.74 | 98.15 | Work stopped (uncertainty in predicting domain boundaries) |
| 1864T | -17.30 | 37.74 | 98.15 | Work stopped (uncertainty in predicting domain boundaries) |
| 1868T | -17.70 | 32.73 | 98.21 | Work stopped (uncertainty in predicting domain boundaries) |
| 1915T | -13.15 | 37.50 | 88.89 | Work stopped (uncertainty in predicting domain boundaries) |
| 1916T | -19.05 | 19.10 | 98.89 | Work stopped (uncertainty in predicting domain boundaries) |
| 1929T | -15.30 | 35.09 | 98.28 | Work stopped (uncertainty in predicting domain boundaries) |
ATGAATAGCCAAGTACACCGCAAGGGCTCTATTGCAGAAGCAGTTAGTGCCATCCAAGCTCATGATAAGATCGTGATCTT CCACCACATTCGTCCTGATGGCGATTGTTTGGGCGCACAACACGGCTTAGCGCGTTTAATCCAAACTAACTTTCCCCACA AGCAGGTCTTCTGTGTTGGTGATCCCAAACACAACTTTCCCTGATTGGAGATGGTTTTCACTCCAAAGGAACAGATTACC CCGGAGTTAATGCAACAAGCCTTAGCCGTTATTGTTGATGCCAACTATAAGGAACGGATTGAGTGCCGGGACTTATTAGA CCAAAACCAGTTTAAGGCAGTATTGCGGATTGACCACCACCCCAATGAGGACGATCTCAATACGACCCATAACTTCGTTG ATGCGTCTTACATTGCCGCGGCTGAACAAGTGGTAGATCTAGCGGTGCAGGCCAAATGGAAGCTTAGCCCCCCAGCGGCT ACGGCGCTGTATCTAGGTATTTATACAGATAGTAATAGGTTTCTATATAGTAATACATCATGAAGAACACTATATCTAGG ATCTATGCTATATAGAGCTCAAGCTAATATAGCTAAGATCCATGATGAGTTAAACCACACTTCCTTAAAGGACATCCAGT TTAAACAATATGTCTTTAAAAACTTTCAGACCTTTCAGAATGTTATTTACTTTGTGGCCGATAAGAAGTTCCAAAAGAAA TTAAAGGTAACACCCTTAGAATGTGCACGGGTAAATATCCTAGCTAACATTGAACAATTCCACATTTGGCTGTTCTTTAT AGAAGAGGGTAAGAACCACTATCGGGTCGAATTCCGTAGTAACGGAATTAACGTACGCGAAGTAGCTTTAAAGTATGGTG GCGGGGGTCATATTCAGGCCAGCGGTGCAGTTCTTAAAAGCAAGCGCGACATAATTCGTGTAGTTCAAGATTGCCAAAAG CAAATTGCTGTATAA
(325 codons) fields: [triplet] [frequency: per thousand] ([number]) UUU30.8( 10) UCU9.2( 3) UAU27.7( 9) UGU9.2( 3) UUC24.6( 8) UCC3.1( 1) UAC6.2( 2) UGC6.2( 2) UUA33.8( 11) UCA3.1( 1) UAA3.1( 1) UGA6.2( 2) UUG9.2( 3) UCG0.0( 0) UAG0.0( 0) UGG6.2( 2) CUU6.2( 2) CCU3.1( 1) CAU12.3( 4) CGU12.3( 4) CUC3.1( 1) CCC18.5( 6) CAC33.8( 11) CGC9.2( 3) CUA21.5( 7) CCA6.2( 2) CAA46.2( 15) CGA0.0( 0) CUG6.2( 2) CCG3.1( 1) CAG24.6( 8) CGG15.4( 5) AUU46.2( 15) ACU9.2( 3) AAU24.6( 8) AGU12.3( 4) AUC21.5( 7) ACC9.2( 3) AAC33.8( 11) AGC12.3( 4) AUA9.2( 3) ACA12.3( 4) AAA18.5( 6) AGA6.2( 2) AUG12.3( 4) ACG6.2( 2) AAG52.3( 17) AGG3.1( 1) GUU27.7( 9) GCU27.7( 9) GAU36.9( 12) GGU18.5( 6) GUC9.2( 3) GCC24.6( 8) GAC18.5( 6) GGC15.4( 5) GUA27.7( 9) GCA18.5( 6) GAA27.7( 9) GGA6.2( 2) GUG12.3( 4) GCG18.5( 6) GAG18.5( 6) GGG3.1( 1)
Note: This was generated using the standard codon usage table; UGA codons in MP/MP genes will show up as terminators ("*")
atgaatagccaagtacaccgcaagggctctattgcagaagcagttagtgccatccaagct M N S Q V H R K G S I A E A V S A I Q A catgataagatcgtgatcttccaccacattcgtcctgatggcgattgtttgggcgcacaa H D K I V I F H H I R P D G D C L G A Q cacggcttagcgcgtttaatccaaactaactttccccacaagcaggtcttctgtgttggt H G L A R L I Q T N F P H K Q V F C V G gatcccaaacacaactttccctgattggagatggttttcactccaaaggaacagattacc D P K H N F P * L E M V F T P K E Q I T ccggagttaatgcaacaagccttagccgttattgttgatgccaactataaggaacggatt P E L M Q Q A L A V I V D A N Y K E R I gagtgccgggacttattagaccaaaaccagtttaaggcagtattgcggattgaccaccac E C R D L L D Q N Q F K A V L R I D H H cccaatgaggacgatctcaatacgacccataacttcgttgatgcgtcttacattgccgcg P N E D D L N T T H N F V D A S Y I A A gctgaacaagtggtagatctagcggtgcaggccaaatggaagcttagccccccagcggct A E Q V V D L A V Q A K W K L S P P A A acggcgctgtatctaggtatttatacagatagtaataggtttctatatagtaatacatca T A L Y L G I Y T D S N R F L Y S N T S tgaagaacactatatctaggatctatgctatatagagctcaagctaatatagctaagatc * R T L Y L G S M L Y R A Q A N I A K I catgatgagttaaaccacacttccttaaaggacatccagtttaaacaatatgtctttaaa H D E L N H T S L K D I Q F K Q Y V F K aactttcagacctttcagaatgttatttactttgtggccgataagaagttccaaaagaaa N F Q T F Q N V I Y F V A D K K F Q K K ttaaaggtaacacccttagaatgtgcacgggtaaatatcctagctaacattgaacaattc L K V T P L E C A R V N I L A N I E Q F cacatttggctgttctttatagaagagggtaagaaccactatcgggtcgaattccgtagt H I W L F F I E E G K N H Y R V E F R S aacggaattaacgtacgcgaagtagctttaaagtatggtggcgggggtcatattcaggcc N G I N V R E V A L K Y G G G G H I Q A agcggtgcagttcttaaaagcaagcgcgacataattcgtgtagttcaagattgccaaaag S G A V L K S K R D I I R V V Q D C Q K caaattgctgtataa Q I A V *
MNSQVHRKGSIAEAVSAIQAHDKIVIFHHIRPDGDCLGAQHGLARLIQTNFPHKQVFCVGDPKHNFPWLEMVFTPKEQIT PELMQQALAVIVDANYKERIECRDLLDQNQFKAVLRIDHHPNEDDLNTTHNFVDASYIAAAEQVVDLAVQAKWKLSPPAA TALYLGIYTDSNRFLYSNTSWRTLYLGSMLYRAQANIAKIHDELNHTSLKDIQFKQYVFKNFQTFQNVIYFVADKKFQKK LKVTPLECARVNILANIEQFHIWLFFIEEGKNHYRVEFRSNGINVREVALKYGGGGHIQASGAVLKSKRDIIRVVQDCQK QIAV
This information was obtained using the Protein Parameters tool on the ExPASy Molecular Biology Server.
Number of amino acids: 324
Molecular weight: 37144.5
Theoretical pI: 8.81
Amino acid composition:
Ala (A) 29 9.0%
Arg (R) 15 4.6%
Asn (N) 19 5.9%
Asp (D) 18 5.6%
Cys (C) 5 1.5%
Gln (Q) 23 7.1%
Glu (E) 15 4.6%
Gly (G) 14 4.3%
His (H) 15 4.6%
Ile (I) 25 7.7%
Leu (L) 26 8.0%
Lys (K) 23 7.1%
Met (M) 4 1.2%
Phe (F) 18 5.6%
Pro (P) 10 3.1%
Ser (S) 13 4.0%
Thr (T) 12 3.7%
Trp (W) 4 1.2%
Tyr (Y) 11 3.4%
Val (V) 25 7.7%
Asx (B) 0 0.0%
Glx (Z) 0 0.0%
Xaa (X) 0 0.0%
Total number of negatively charged residues (Asp + Glu): 33
Total number of positively charged residues (Arg + Lys): 38
Atomic composition:
Carbon C 1679
Hydrogen H 2610
Nitrogen N 468
Oxygen O 469
Sulfur S 9
Formula: C1679H2610N468O469S9
Total number of atoms: 5235
Extinction coefficients:
Conditions: 6.0 M guanidium hydrochloride
0.02 M phosphate buffer
pH 6.5
Extinction coefficients are in units of M-1 cm-1 .
The first table lists values computed assuming ALL Cys
residues appear as half cystines, whereas the second table
assumes that NONE do.
276 278 279 280 282
nm nm nm nm nm
Ext. coefficient 37840 38054 37675 37080 35840
Abs 0.1% (=1 g/l) 1.019 1.024 1.014 0.998 0.965
276 278 279 280 282
nm nm nm nm nm
Ext. coefficient 37550 37800 37435 36840 35600
Abs 0.1% (=1 g/l) 1.011 1.018 1.008 0.992 0.958
Estimated half-life:
The N-terminal of the sequence considered is M (Met).
The estimated half-life is: 30 hours (mammalian reticulocytes, in vitro).
>20 hours (yeast, in vivo).
>10 hours (Escherichia coli, in vivo).
Instability index:
The instability index (II) is computed to be 30.86
This classifies the protein as stable.
Aliphatic index: 92.72
Grand average of hydropathicity (GRAVY): -0.269| Model | Percentage of MP Gene |
|---|---|
| Coiled Coil (CCP) | 0 |
| Disordered (SEG) | 0 |
| Transmembrane (PHDhtm) | 0 |
| Transmembrane (TMHMM) | 0 |
| Homologous to known structure (PSIBLAST) | 72 |
Sequence: Amino acid sequence.
CCP (C or -): coiled coil prediction from the ccp program (NCBI toolkit)
SEG (D or -): low complexity regions (possibly disordered) from SEG
PHDhtm (H or -): Transmembrane prediction from PHDhtm
TMHMM (H or -): Transmembrane prediction from TMHMM
PSIBLAST (3 or -): Regions potentially homologous to a protein of known 3D structure, according to PSIBLAST
Pred2ary (H, E, or -): Secondary structure prediction from Pred2ary
10 20 30 40 50 60 70 80
| | | | | | | |
Sequence MNSQVHRKGSIAEAVSAIQAHDKIVIFHHIRPDGDCLGAQHGLARLIQTNFPHKQVFCVGDPKHNFPWLEMVFTPKEQIT
CCP --------------------------------------------------------------------------------
SEG --------------------------------------------------------------------------------
PHDhtm --------------------------------------------------------------------------------
TMHMM --------------------------------------------------------------------------------
PSIBLAST ----------------------3333333333333333333333333333333333333333333333333333333333
Pred2ary ---HHHHHHHHHHHHHHHHH---EEEEE-------HHHHHHHHHHHHHHH----EEEE----------------------
90 100 110 120 130 140 150 160
| | | | | | | |
Sequence PELMQQALAVIVDANYKERIECRDLLDQNQFKAVLRIDHHPNEDDLNTTHNFVDASYIAAAEQVVDLAVQAKWKLSPPAA
CCP --------------------------------------------------------------------------------
SEG --------------------------------------------------------------------------------
PHDhtm --------------------------------------------------------------------------------
TMHMM --------------------------------------------------------------------------------
PSIBLAST 33333333333333333333333333333333333333333333333333333333333333333333333333333333
Pred2ary -------EEEEE-------HHHHHHHHH----EEEEEE------------EE-------HHHHHHHHHHHH-------HH
170 180 190 200 210 220 230 240
| | | | | | | |
Sequence TALYLGIYTDSNRFLYSNTSWRTLYLGSMLYRAQANIAKIHDELNHTSLKDIQFKQYVFKNFQTFQNVIYFVADKKFQKK
CCP --------------------------------------------------------------------------------
SEG --------------------------------------------------------------------------------
PHDhtm --------------------------------------------------------------------------------
TMHMM --------------------------------------------------------------------------------
PSIBLAST 33333333333333333333333333333333333333333333333333333333333333333333333333333333
Pred2ary HHHHHHHH-------------HHHHHHHHHHH----HHHHHHHHH----HHHHHHHHHHHHHHHH---EEEEE---HHHH
250 260 270 280 290 300 310 320
| | | | | | | |
Sequence LKVTPLECARVNILANIEQFHIWLFFIEEGKNHYRVEFRSNGINVREVALKYGGGGHIQASGAVLKSKRDIIRVVQDCQK
CCP --------------------------------------------------------------------------------
SEG --------------------------------------------------------------------------------
PHDhtm --------------------------------------------------------------------------------
TMHMM --------------------------------------------------------------------------------
PSIBLAST 3333333333333333----------------------------------------------------------------
Pred2ary -----HHHHHHHHHHHH----EEEEEEE----EEEEEE-----HHHHHHHHH-----HHHHH-----HHHHHHHHHHHHH
Sequence QIAV
CCP ----
SEG ----
PHDhtm ----
TMHMM ----
PSIBLAST ----
Pred2ary H---