|
|
Available sections:
| Target | log10(PSIBLAST E) | % identical | % coverage (of MP gene) | Latest Status |
|---|---|---|---|---|
| 1982B | -56.70 | 100.00 | 100.00 | Expression tested |
ATGAAAGTGATTAAGAAGCTCAATCTGCTCAATGAACTAACGATGCTGGTGTGCATCTTCTTCTTTGTGTGCACTATCTC CCTAATTGGCATTGGCATTATGTATGACCTCATTAACAGAACTTCATTGAGTGCACCACGTCGTGATCCGATTTTTCGGA ACCTCAACACAGTCTTAATTGTTTTAGGGGTGTTAGAAATCCTTTTAATGGTGGCCCAATTAGTTATGTCCAATATGGCA GCCAACATTATTAACGAGGTTGCCGAAAACGTCGAGCAAAAGTTTGCTAAAGCCTTGAAATGATCACGTTTTCTGCCCTT TGGTTTGCTCCAACTTTACTGTTACCATAAGATTAAATTGGTAACCCAAACCGATAATATATAA
(128 codons) fields: [triplet] [frequency: per thousand] ([number]) UUU39.1( 5) UCU0.0( 0) UAU7.8( 1) UGU7.8( 1) UUC15.6( 2) UCC15.6( 2) UAC15.6( 2) UGC15.6( 2) UUA39.1( 5) UCA15.6( 2) UAA7.8( 1) UGA7.8( 1) UUG31.2( 4) UCG0.0( 0) UAG0.0( 0) UGG0.0( 0) CUU15.6( 2) CCU0.0( 0) CAU7.8( 1) CGU23.4( 3) CUC39.1( 5) CCC7.8( 1) CAC0.0( 0) CGC0.0( 0) CUA15.6( 2) CCA7.8( 1) CAA31.2( 4) CGA0.0( 0) CUG23.4( 3) CCG7.8( 1) CAG0.0( 0) CGG7.8( 1) AUU78.1( 10) ACU15.6( 2) AAU31.2( 4) AGU7.8( 1) AUC23.4( 3) ACC15.6( 2) AAC46.9( 6) AGC0.0( 0) AUA7.8( 1) ACA7.8( 1) AAA31.2( 4) AGA7.8( 1) AUG46.9( 6) ACG7.8( 1) AAG31.2( 4) AGG0.0( 0) GUU23.4( 3) GCU7.8( 1) GAU15.6( 2) GGU7.8( 1) GUC15.6( 2) GCC31.2( 4) GAC7.8( 1) GGC15.6( 2) GUA7.8( 1) GCA15.6( 2) GAA23.4( 3) GGA0.0( 0) GUG39.1( 5) GCG0.0( 0) GAG15.6( 2) GGG7.8( 1)
Note: This was generated using the standard codon usage table; UGA codons in MP/MP genes will show up as terminators ("*")
atgaaagtgattaagaagctcaatctgctcaatgaactaacgatgctggtgtgcatcttc M K V I K K L N L L N E L T M L V C I F ttctttgtgtgcactatctccctaattggcattggcattatgtatgacctcattaacaga F F V C T I S L I G I G I M Y D L I N R acttcattgagtgcaccacgtcgtgatccgatttttcggaacctcaacacagtcttaatt T S L S A P R R D P I F R N L N T V L I gttttaggggtgttagaaatccttttaatggtggcccaattagttatgtccaatatggca V L G V L E I L L M V A Q L V M S N M A gccaacattattaacgaggttgccgaaaacgtcgagcaaaagtttgctaaagccttgaaa A N I I N E V A E N V E Q K F A K A L K tgatcacgttttctgccctttggtttgctccaactttactgttaccataagattaaattg * S R F L P F G L L Q L Y C Y H K I K L gtaacccaaaccgataatatataa V T Q T D N I *
MKVIKKLNLLNELTMLVCIFFFVCTISLIGIGIMYDLINRTSLSAPRRDPIFRNLNTVLIVLGVLEILLMVAQLVMSNMA ANIINEVAENVEQKFAKALKWSRFLPFGLLQLYCYHKIKLVTQTDNI
This information was obtained using the Protein Parameters tool on the ExPASy Molecular Biology Server.
Number of amino acids: 127
Molecular weight: 14518.6
Theoretical pI: 9.33
Amino acid composition:
Ala (A) 7 5.5%
Arg (R) 5 3.9%
Asn (N) 10 7.9%
Asp (D) 3 2.4%
Cys (C) 3 2.4%
Gln (Q) 4 3.1%
Glu (E) 5 3.9%
Gly (G) 4 3.1%
His (H) 1 0.8%
Ile (I) 14 11.0%
Leu (L) 21 16.5%
Lys (K) 8 6.3%
Met (M) 6 4.7%
Phe (F) 7 5.5%
Pro (P) 3 2.4%
Ser (S) 5 3.9%
Thr (T) 6 4.7%
Trp (W) 1 0.8%
Tyr (Y) 3 2.4%
Val (V) 11 8.7%
Asx (B) 0 0.0%
Glx (Z) 0 0.0%
Xaa (X) 0 0.0%
Total number of negatively charged residues (Asp + Glu): 8
Total number of positively charged residues (Arg + Lys): 13
Atomic composition:
Carbon C 669
Hydrogen H 1095
Nitrogen N 167
Oxygen O 172
Sulfur S 9
Formula: C669H1095N167O172S9
Total number of atoms: 2112
Extinction coefficients:
Conditions: 6.0 M guanidium hydrochloride
0.02 M phosphate buffer
pH 6.5
Extinction coefficients are in units of M-1 cm-1 .
The first table lists values computed assuming ALL Cys
residues appear as half cystines, whereas the second table
assumes that NONE do.
276 278 279 280 282
nm nm nm nm nm
Ext. coefficient 9895 9927 9815 9650 9320
Abs 0.1% (=1 g/l) 0.682 0.684 0.676 0.665 0.642
276 278 279 280 282
nm nm nm nm nm
Ext. coefficient 9750 9800 9695 9530 9200
Abs 0.1% (=1 g/l) 0.672 0.675 0.668 0.656 0.634
Estimated half-life:
The N-terminal of the sequence considered is M (Met).
The estimated half-life is: 30 hours (mammalian reticulocytes, in vitro).
>20 hours (yeast, in vivo).
>10 hours (Escherichia coli, in vivo).
Instability index:
The instability index (II) is computed to be 39.86
This classifies the protein as stable.
Aliphatic index: 138.11
Grand average of hydropathicity (GRAVY): 0.683| Model | Percentage of MP Gene |
|---|---|
| Coiled Coil (CCP) | 0 |
| Disordered (SEG) | 11 |
| Transmembrane (PHDhtm) | 42 |
| Transmembrane (TMHMM) | 36 |
| Homologous to known structure (PSIBLAST) | 0 |
Sequence: Amino acid sequence.
CCP (C or -): coiled coil prediction from the ccp program (NCBI toolkit)
SEG (D or -): low complexity regions (possibly disordered) from SEG
PHDhtm (H or -): Transmembrane prediction from PHDhtm
TMHMM (H or -): Transmembrane prediction from TMHMM
PSIBLAST (3 or -): Regions potentially homologous to a protein of known 3D structure, according to PSIBLAST
Pred2ary (H, E, or -): Secondary structure prediction from Pred2ary
10 20 30 40 50 60 70 80
| | | | | | | |
Sequence MKVIKKLNLLNELTMLVCIFFFVCTISLIGIGIMYDLINRTSLSAPRRDPIFRNLNTVLIVLGVLEILLMVAQLVMSNMA
CCP --------------------------------------------------------------------------------
SEG ---------------------------------------------------------DDDDDDDDDDDDDD---------
PHDhtm -------------HHHHHHHHHHHHHHHHHHHHHHHH------------------HHHHHHHHHHHHHHHHHHHHHHH--
TMHMM ------------HHHHHHHHHHHHHHHHHHHHHHH--------------HHHHHHHHHHHHHHHHHHHHHHH--------
PSIBLAST --------------------------------------------------------------------------------
Pred2ary -HHHHHH-----HHHEEEEEEEEEEEEEEE-EEEH-HH-------------H----HHHHHHHHHHHHHHHHHHHHHHHH
90 100 110 120
| | | |
Sequence ANIINEVAENVEQKFAKALKWSRFLPFGLLQLYCYHKIKLVTQTDNI
CCP -----------------------------------------------
SEG -----------------------------------------------
PHDhtm ---------------------------HHHHHH--------------
TMHMM -----------------------------------------------
PSIBLAST -----------------------------------------------
Pred2ary HHHHHHHHHHHHHHHHHHH---------HHHHHHEEEEEEEEE----