|
|
Available sections:
ATGAGAATTAAAGGTGGAAAACAAACCCGCGTACGTCGCAAGAAGTGGTTAAAGCAAGCTAGTGGTAGCTTTGGTACCCG ACATGCCTCTTATAAGGTGGCAAAGCAAACGGTGATCCAAGCCGCTAAGTATGCTTACCGCGATCGTCGTAACAAAAAGC GCGATTTCCGTTCGTTGTGGATCTTGCGCTTAAACGCCGCTTTGCGCGAACAGGGCATGACCTATTCGGTCTTTATCAAT CTATTGAAGAAGCACAACATTGAGATTAACCGCAAAGTGTTGTCGGAACTAGCCATTAAAGAACCAAGTAAATTTAACCT AATTGTGCAAAAGGTTAAAAGTGAACAACCGAAAGCCGCTAAGCCAGCTGCTTTAGGAAACTAG
(128 codons) fields: [triplet] [frequency: per thousand] ([number]) UUU23.4( 3) UCU7.8( 1) UAU23.4( 3) UGU0.0( 0) UUC7.8( 1) UCC0.0( 0) UAC7.8( 1) UGC0.0( 0) UUA23.4( 3) UCA0.0( 0) UAA0.0( 0) UGA0.0( 0) UUG39.1( 5) UCG23.4( 3) UAG7.8( 1) UGG15.6( 2) CUU0.0( 0) CCU0.0( 0) CAU7.8( 1) CGU31.2( 4) CUC0.0( 0) CCC0.0( 0) CAC7.8( 1) CGC54.7( 7) CUA23.4( 3) CCA15.6( 2) CAA46.9( 6) CGA7.8( 1) CUG0.0( 0) CCG7.8( 1) CAG7.8( 1) CGG0.0( 0) AUU39.1( 5) ACU0.0( 0) AAU7.8( 1) AGU23.4( 3) AUC23.4( 3) ACC23.4( 3) AAC46.9( 6) AGC7.8( 1) AUA0.0( 0) ACA0.0( 0) AAA62.5( 8) AGA7.8( 1) AUG15.6( 2) ACG7.8( 1) AAG85.9( 11) AGG0.0( 0) GUU7.8( 1) GCU54.7( 7) GAU15.6( 2) GGU23.4( 3) GUC7.8( 1) GCC39.1( 5) GAC0.0( 0) GGC7.8( 1) GUA7.8( 1) GCA7.8( 1) GAA31.2( 4) GGA15.6( 2) GUG31.2( 4) GCG0.0( 0) GAG7.8( 1) GGG0.0( 0)
Note: This was generated using the standard codon usage table; UGA codons in MP/MP genes will show up as terminators ("*")
atgagaattaaaggtggaaaacaaacccgcgtacgtcgcaagaagtggttaaagcaagct M R I K G G K Q T R V R R K K W L K Q A agtggtagctttggtacccgacatgcctcttataaggtggcaaagcaaacggtgatccaa S G S F G T R H A S Y K V A K Q T V I Q gccgctaagtatgcttaccgcgatcgtcgtaacaaaaagcgcgatttccgttcgttgtgg A A K Y A Y R D R R N K K R D F R S L W atcttgcgcttaaacgccgctttgcgcgaacagggcatgacctattcggtctttatcaat I L R L N A A L R E Q G M T Y S V F I N ctattgaagaagcacaacattgagattaaccgcaaagtgttgtcggaactagccattaaa L L K K H N I E I N R K V L S E L A I K gaaccaagtaaatttaacctaattgtgcaaaaggttaaaagtgaacaaccgaaagccgct E P S K F N L I V Q K V K S E Q P K A A aagccagctgctttaggaaactag K P A A L G N *
MRIKGGKQTRVRRKKWLKQASGSFGTRHASYKVAKQTVIQAAKYAYRDRRNKKRDFRSLWILRLNAALREQGMTYSVFIN LLKKHNIEINRKVLSELAIKEPSKFNLIVQKVKSEQPKAAKPAALGN
This information was obtained using the Protein Parameters tool on the ExPASy Molecular Biology Server.
Number of amino acids: 127
Molecular weight: 14708.3
Theoretical pI: 11.29
Amino acid composition:
Ala (A) 13 10.2%
Arg (R) 13 10.2%
Asn (N) 7 5.5%
Asp (D) 2 1.6%
Cys (C) 0 0.0%
Gln (Q) 7 5.5%
Glu (E) 5 3.9%
Gly (G) 6 4.7%
His (H) 2 1.6%
Ile (I) 8 6.3%
Leu (L) 11 8.7%
Lys (K) 19 15.0%
Met (M) 2 1.6%
Phe (F) 4 3.1%
Pro (P) 3 2.4%
Ser (S) 8 6.3%
Thr (T) 4 3.1%
Trp (W) 2 1.6%
Tyr (Y) 4 3.1%
Val (V) 7 5.5%
Asx (B) 0 0.0%
Glx (Z) 0 0.0%
Xaa (X) 0 0.0%
Total number of negatively charged residues (Asp + Glu): 7
Total number of positively charged residues (Arg + Lys): 32
Atomic composition:
Carbon C 659
Hydrogen H 1097
Nitrogen N 205
Oxygen O 172
Sulfur S 2
Formula: C659H1097N205O172S2
Total number of atoms: 2135
Extinction coefficients:
Conditions: 6.0 M guanidium hydrochloride
0.02 M phosphate buffer
pH 6.5
Extinction coefficients are in units of M-1 cm-1 .
276 278 279 280 282
nm nm nm nm nm
Ext. coefficient 16600 16800 16700 16500 16000
Abs 0.1% (=1 g/l) 1.129 1.142 1.135 1.122 1.088
Estimated half-life:
The N-terminal of the sequence considered is M (Met).
The estimated half-life is: 30 hours (mammalian reticulocytes, in vitro).
>20 hours (yeast, in vivo).
>10 hours (Escherichia coli, in vivo).
Instability index:
The instability index (II) is computed to be 55.39
This classifies the protein as unstable.
Aliphatic index: 84.57
Grand average of hydropathicity (GRAVY): -0.711| Model | Percentage of MP Gene |
|---|---|
| Coiled Coil (CCP) | 0 |
| Disordered (SEG) | 0 |
| Transmembrane (PHDhtm) | 0 |
| Transmembrane (TMHMM) | 0 |
| Homologous to known structure (PSIBLAST) | 0 |
Sequence: Amino acid sequence.
CCP (C or -): coiled coil prediction from the ccp program (NCBI toolkit)
SEG (D or -): low complexity regions (possibly disordered) from SEG
PHDhtm (H or -): Transmembrane prediction from PHDhtm
TMHMM (H or -): Transmembrane prediction from TMHMM
PSIBLAST (3 or -): Regions potentially homologous to a protein of known 3D structure, according to PSIBLAST
Pred2ary (H, E, or -): Secondary structure prediction from Pred2ary
10 20 30 40 50 60 70 80
| | | | | | | |
Sequence MRIKGGKQTRVRRKKWLKQASGSFGTRHASYKVAKQTVIQAAKYAYRDRRNKKRDFRSLWILRLNAALREQGMTYSVFIN
CCP --------------------------------------------------------------------------------
SEG --------------------------------------------------------------------------------
PHDhtm --------------------------------------------------------------------------------
TMHMM --------------------------------------------------------------------------------
PSIBLAST --------------------------------------------------------------------------------
Pred2ary ------HHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHH
90 100 110 120
| | | |
Sequence LLKKHNIEINRKVLSELAIKEPSKFNLIVQKVKSEQPKAAKPAALGN
CCP -----------------------------------------------
SEG -----------------------------------------------
PHDhtm -----------------------------------------------
TMHMM -----------------------------------------------
PSIBLAST -----------------------------------------------
Pred2ary HHHHH---HHHHHHHHHH---HHHHHHHHHHHHHHHHHH--HHH---