|
|
Available sections:
| Target | log10(PSIBLAST E) | % identical | % coverage (of MP gene) | Latest Status |
|---|---|---|---|---|
| 1981B | -7.05 | 31.43 | 100.00 | Expression tested |
| 1984B | -8.05 | 38.54 | 64.00 | Expression tested |
ATGCAATTAACTACAAGTCAACAAATGCAAACTAACTTTCTTACTACAAATAAACATCACTTATTGAAATACACCAACGG TTTGATTTGGTGTTGGTGATTATTTGTTATTTCTTTAGTTTTAGCTTCATCGACATTTAGAGGTTTCTTTCTCGGCACTA TCAATATTGTGAATTTTGTGTTTTGGATATTAGCACTGATTTTTGGTGTAGCTGTCGCTTTCATTAATGGTGTTTTGAGT AGTGAATTAAAAGAAAATTCAGTTTTTCAAGAAGAACAAAAACGATTCTTCTTGGGCTTTTTCTTTCCACAAATGGCTTT CTGTAATGCGCTTTGACTGAAACTTAAATTAAGCTATCTAAATTCTGAAAGAGAAAACTTATTAGAAAAAATAAAGCAAA AACTAAAGAAGCTTACCTTGTCTGTTTTTGTTGTTTGAGGAATTTATTGTGTTTTAGCAACCTCAATTTATTTACCAAAT GCATTAAGAATTTTAAACATTTATCAAATACCTAACTTAATCGCTTTAATAAATAATAGATTGAGTGAAATTTTGCCAGA TGGTAACCGTTATTTTTTAGGTCATTCTTCATTCGCTTTTCATTACTATGAAATAGTTTCAAGGATTCCTTTTTTAGTTT TCTTTATTATTCCAACAATAACCTTAATTACTTTAGGTTGCTATTTATTCGCTTACTTAAGGTTTATAAACTCAAATAAG TTAAGAAAACCACTTTCTACTTTATCAATAGTTATCATGCTGACTGATGTTGTGGGAATAATTCAATGAATAATTATTGA TATCTTATTAATTTGGCTTAATGTACCCTTTGTTATATTTGTAATTTTTTGAGTAATAAAACTTGTTTTGCCTTTAGCAA TGATAGGTACGTTTGTCAGCTCCTTAACTATTTACAAAAAAGTAACAAGCAAAGAATGATTAGCAATAAAAGAAGAACAA ATTAATTTAACGACAATGAATATTAATATTAATATGGGAGAGCAATCATCCAAAAACATGAATTCTTTTGAAAATCATGA ATCAAATGAACGAAATAGTCTTCAAATTTATCAACAACATTCATCTATGATGTCAGAAACAAAGAGAAAACAAAGTAGCT TAAGTTATGATGCCAGAATCCTTTTACCTAAAAGTCCCTATAACACAAAGAAAACTTTATTTTTAATTATTTTCTTTTCC ATTATCTCCTTGATATTGGCAACTATAGGTAGTGTGTTCATTTCTTTTGCTATAGTTCAAATTTCTATACCTTTTTACGT TATAGGTGGAGTTATTTGATTTTTTACTTTTATTTCTTTATAG
(441 codons) fields: [triplet] [frequency: per thousand] ([number]) UUU63.5( 28) UCU22.7( 10) UAU22.7( 10) UGU6.8( 3) UUC24.9( 11) UCC9.1( 4) UAC11.3( 5) UGC2.3( 1) UUA79.4( 35) UCA24.9( 11) UAA0.0( 0) UGA15.9( 7) UUG22.7( 10) UCG2.3( 1) UAG2.3( 1) UGG9.1( 4) CUU20.4( 9) CCU11.3( 5) CAU11.3( 5) CGU2.3( 1) CUC2.3( 1) CCC4.5( 2) CAC2.3( 1) CGC0.0( 0) CUA4.5( 2) CCA11.3( 5) CAA38.5( 17) CGA4.5( 2) CUG6.8( 3) CCG0.0( 0) CAG0.0( 0) CGG0.0( 0) AUU70.3( 31) ACU24.9( 11) AAU45.4( 20) AGU20.4( 9) AUC13.6( 6) ACC9.1( 4) AAC20.4( 9) AGC9.1( 4) AUA43.1( 19) ACA18.1( 8) AAA40.8( 18) AGA15.9( 7) AUG22.7( 10) ACG4.5( 2) AAG13.6( 6) AGG4.5( 2) GUU38.5( 17) GCU18.1( 8) GAU9.1( 4) GGU22.7( 10) GUC4.5( 2) GCC2.3( 1) GAC0.0( 0) GGC4.5( 2) GUA11.3( 5) GCA13.6( 6) GAA36.3( 16) GGA9.1( 4) GUG9.1( 4) GCG2.3( 1) GAG2.3( 1) GGG0.0( 0)
Note: This was generated using the standard codon usage table; UGA codons in MP/MP genes will show up as terminators ("*")
atgcaattaactacaagtcaacaaatgcaaactaactttcttactacaaataaacatcac M Q L T T S Q Q M Q T N F L T T N K H H ttattgaaatacaccaacggtttgatttggtgttggtgattatttgttatttctttagtt L L K Y T N G L I W C W * L F V I S L V ttagcttcatcgacatttagaggtttctttctcggcactatcaatattgtgaattttgtg L A S S T F R G F F L G T I N I V N F V ttttggatattagcactgatttttggtgtagctgtcgctttcattaatggtgttttgagt F W I L A L I F G V A V A F I N G V L S agtgaattaaaagaaaattcagtttttcaagaagaacaaaaacgattcttcttgggcttt S E L K E N S V F Q E E Q K R F F L G F ttctttccacaaatggctttctgtaatgcgctttgactgaaacttaaattaagctatcta F F P Q M A F C N A L * L K L K L S Y L aattctgaaagagaaaacttattagaaaaaataaagcaaaaactaaagaagcttaccttg N S E R E N L L E K I K Q K L K K L T L tctgtttttgttgtttgaggaatttattgtgttttagcaacctcaatttatttaccaaat S V F V V * G I Y C V L A T S I Y L P N gcattaagaattttaaacatttatcaaatacctaacttaatcgctttaataaataataga A L R I L N I Y Q I P N L I A L I N N R ttgagtgaaattttgccagatggtaaccgttattttttaggtcattcttcattcgctttt L S E I L P D G N R Y F L G H S S F A F cattactatgaaatagtttcaaggattccttttttagttttctttattattccaacaata H Y Y E I V S R I P F L V F F I I P T I accttaattactttaggttgctatttattcgcttacttaaggtttataaactcaaataag T L I T L G C Y L F A Y L R F I N S N K ttaagaaaaccactttctactttatcaatagttatcatgctgactgatgttgtgggaata L R K P L S T L S I V I M L T D V V G I attcaatgaataattattgatatcttattaatttggcttaatgtaccctttgttatattt I Q * I I I D I L L I W L N V P F V I F gtaattttttgagtaataaaacttgttttgcctttagcaatgataggtacgtttgtcagc V I F * V I K L V L P L A M I G T F V S tccttaactatttacaaaaaagtaacaagcaaagaatgattagcaataaaagaagaacaa S L T I Y K K V T S K E * L A I K E E Q attaatttaacgacaatgaatattaatattaatatgggagagcaatcatccaaaaacatg I N L T T M N I N I N M G E Q S S K N M aattcttttgaaaatcatgaatcaaatgaacgaaatagtcttcaaatttatcaacaacat N S F E N H E S N E R N S L Q I Y Q Q H tcatctatgatgtcagaaacaaagagaaaacaaagtagcttaagttatgatgccagaatc S S M M S E T K R K Q S S L S Y D A R I cttttacctaaaagtccctataacacaaagaaaactttatttttaattattttcttttcc L L P K S P Y N T K K T L F L I I F F S attatctccttgatattggcaactataggtagtgtgttcatttcttttgctatagttcaa I I S L I L A T I G S V F I S F A I V Q atttctatacctttttacgttataggtggagttatttgattttttacttttatttcttta I S I P F Y V I G G V I * F F T F I S L tag *
MQLTTSQQMQTNFLTTNKHHLLKYTNGLIWCWWLFVISLVLASSTFRGFFLGTINIVNFVFWILALIFGVAVAFINGVLS SELKENSVFQEEQKRFFLGFFFPQMAFCNALWLKLKLSYLNSERENLLEKIKQKLKKLTLSVFVVWGIYCVLATSIYLPN ALRILNIYQIPNLIALINNRLSEILPDGNRYFLGHSSFAFHYYEIVSRIPFLVFFIIPTITLITLGCYLFAYLRFINSNK LRKPLSTLSIVIMLTDVVGIIQWIIIDILLIWLNVPFVIFVIFWVIKLVLPLAMIGTFVSSLTIYKKVTSKEWLAIKEEQ INLTTMNININMGEQSSKNMNSFENHESNERNSLQIYQQHSSMMSETKRKQSSLSYDARILLPKSPYNTKKTLFLIIFFS IISLILATIGSVFISFAIVQISIPFYVIGGVIWFFTFISL
This information was obtained using the Protein Parameters tool on the ExPASy Molecular Biology Server.
Number of amino acids: 440
Molecular weight: 50935.5
Theoretical pI: 9.58
Amino acid composition:
Ala (A) 16 3.6%
Arg (R) 12 2.7%
Asn (N) 29 6.6%
Asp (D) 4 0.9%
Cys (C) 4 0.9%
Gln (Q) 17 3.9%
Glu (E) 17 3.9%
Gly (G) 16 3.6%
His (H) 6 1.4%
Ile (I) 56 12.7%
Leu (L) 60 13.6%
Lys (K) 24 5.5%
Met (M) 10 2.3%
Phe (F) 39 8.9%
Pro (P) 12 2.7%
Ser (S) 39 8.9%
Thr (T) 25 5.7%
Trp (W) 11 2.5%
Tyr (Y) 15 3.4%
Val (V) 28 6.4%
Asx (B) 0 0.0%
Glx (Z) 0 0.0%
Xaa (X) 0 0.0%
Total number of negatively charged residues (Asp + Glu): 21
Total number of positively charged residues (Arg + Lys): 36
Atomic composition:
Carbon C 2416
Hydrogen H 3741
Nitrogen N 569
Oxygen O 608
Sulfur S 14
Formula: C2416H3741N569O608S14
Total number of atoms: 7348
Extinction coefficients:
Conditions: 6.0 M guanidium hydrochloride
0.02 M phosphate buffer
pH 6.5
Extinction coefficients are in units of M-1 cm-1 .
The first table lists values computed assuming ALL Cys
residues appear as half cystines, whereas the second table
assumes that NONE do.
276 278 279 280 282
nm nm nm nm nm
Ext. coefficient 81440 82854 82675 82030 79840
Abs 0.1% (=1 g/l) 1.599 1.627 1.623 1.610 1.567
276 278 279 280 282
nm nm nm nm nm
Ext. coefficient 81150 82600 82435 81790 79600
Abs 0.1% (=1 g/l) 1.593 1.622 1.618 1.606 1.563
Estimated half-life:
The N-terminal of the sequence considered is M (Met).
The estimated half-life is: 30 hours (mammalian reticulocytes, in vitro).
>20 hours (yeast, in vivo).
>10 hours (Escherichia coli, in vivo).
Instability index:
The instability index (II) is computed to be 37.56
This classifies the protein as stable.
Aliphatic index: 124.91
Grand average of hydropathicity (GRAVY): 0.590| Model | Percentage of MP Gene |
|---|---|
| Coiled Coil (CCP) | 6 |
| Disordered (SEG) | 18 |
| Transmembrane (PHDhtm) | 57 |
| Transmembrane (TMHMM) | 45 |
| Homologous to known structure (PSIBLAST) | 0 |
Sequence: Amino acid sequence.
CCP (C or -): coiled coil prediction from the ccp program (NCBI toolkit)
SEG (D or -): low complexity regions (possibly disordered) from SEG
PHDhtm (H or -): Transmembrane prediction from PHDhtm
TMHMM (H or -): Transmembrane prediction from TMHMM
PSIBLAST (3 or -): Regions potentially homologous to a protein of known 3D structure, according to PSIBLAST
Pred2ary (H, E, or -): Secondary structure prediction from Pred2ary
10 20 30 40 50 60 70 80
| | | | | | | |
Sequence MQLTTSQQMQTNFLTTNKHHLLKYTNGLIWCWWLFVISLVLASSTFRGFFLGTINIVNFVFWILALIFGVAVAFINGVLS
CCP --------------------------------------------------------------------------------
SEG --------------------------------------------------------------------------------
PHDhtm --------------------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-
TMHMM ----------------------------HHHHHHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHH
PSIBLAST --------------------------------------------------------------------------------
Pred2ary ------HHHHHH-------HHHH-----HEHHHHHHHHHHH-------EEE--EEEHHHHHHHHHHHHHHHHHHHH----
90 100 110 120 130 140 150 160
| | | | | | | |
Sequence SELKENSVFQEEQKRFFLGFFFPQMAFCNALWLKLKLSYLNSERENLLEKIKQKLKKLTLSVFVVWGIYCVLATSIYLPN
CCP ------------------------------------CCCCCCCCCCCCCCCCCCCCCCCCCC------------------
SEG ----------------------------------------------DDDDDDDDDDDDDD--------------------
PHDhtm --------------------HHHHHHHHHHHHHH--------------------------HHHHHHHHHHHHHHHHHHHH
TMHMM --------------HHHHHHHHHHHHHHHHHHH--------------------------HHHHHHHHHHHHHHHHHHHHH
PSIBLAST --------------------------------------------------------------------------------
Pred2ary --------HHHHHHHHHH---HHHHHHHHHHHHHH------HHHHHHHHHHHHHH-----EEEEEEEEEEEEEEE---HH
170 180 190 200 210 220 230 240
| | | | | | | |
Sequence ALRILNIYQIPNLIALINNRLSEILPDGNRYFLGHSSFAFHYYEIVSRIPFLVFFIIPTITLITLGCYLFAYLRFINSNK
CCP --------------------------------------------------------------------------------
SEG ------------------------------------------------DDDDDDDDDDDDDDDDD---------------
PHDhtm HHHHHHHHHHHHHHHHH--------------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----
TMHMM HH----------------------------------------------HHHHHHHHHHHHHHHHHHHHHHH---------
PSIBLAST --------------------------------------------------------------------------------
Pred2ary HHHHHH-----HHHHHHH------------EEE----EEEEEEEEE----EEEEEEE---EEEHH-HHHHHHHHHH----
250 260 270 280 290 300 310 320
| | | | | | | |
Sequence LRKPLSTLSIVIMLTDVVGIIQWIIIDILLIWLNVPFVIFVIFWVIKLVLPLAMIGTFVSSLTIYKKVTSKEWLAIKEEQ
CCP --------------------------------------------------------------------------------
SEG -------------------DDDDDDDDDDDDDD---DDDDDDDDDD----------------------------------
PHDhtm ------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---------------
TMHMM ---HHHHHHHHHHHHHHHHHHHHHHH--------------HHHHHHHHHHHHHHHHHHHHHHH-----------------
PSIBLAST --------------------------------------------------------------------------------
Pred2ary --------EEEEEE--HHHHHHHHHHHHHHHHH---EEEEEEHHHHHHH----HH--EEEEEEEEEE---HHHHHHHHH-
330 340 350 360 370 380 390 400
| | | | | | | |
Sequence INLTTMNININMGEQSSKNMNSFENHESNERNSLQIYQQHSSMMSETKRKQSSLSYDARILLPKSPYNTKKTLFLIIFFS
CCP --------------------------------------------------------------------------------
SEG DDDDDDDDDDDD------------------------------------------------------------DDDDDDDD
PHDhtm -----------------------------------------------------------------------HHHHHHHHH
TMHMM ------------------------------------------------------------------------HHHHHHHH
PSIBLAST --------------------------------------------------------------------------------
Pred2ary ---EEEEEE-------------------HHHHHHHHHH---HHHHHHHH---------EEE----------EEEEEEHHH
410 420 430 440
| | | |
Sequence IISLILATIGSVFISFAIVQISIPFYVIGGVIWFFTFISL
CCP ----------------------------------------
SEG DDDDDD----------------------------------
PHDhtm HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-
TMHMM HHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHH--
PSIBLAST ----------------------------------------
Pred2ary HHHHHHHH---EEEEEEEEEEEE-EEEE--EEEEEEEE--