|
|
Available sections:
| Target | log10(PSIBLAST E) | % identical | % coverage (of MP gene) | Latest Status |
|---|---|---|---|---|
| 1180B | -55.52 | 100.00 | 100.00 | Solubility tested (S) |
| 1262B | -9.00 | 78.57 | 16.87 | Purified |
| 1388B | -53.15 | 51.33 | 30.93 | Selected |
| 1402B | -55.00 | 57.69 | 23.04 | Expression tested |
| 1604T | -32.70 | 39.80 | 100.00 | Work stopped (uncertainty in predicting domain boundaries) |
| 1605T | -37.30 | 40.54 | 100.00 | Work stopped (uncertainty in predicting domain boundaries) |
| 1606T | -9.00 | 62.96 | 18.49 | Work stopped (uncertainty in predicting domain boundaries) |
| 1618T | -40.52 | 46.95 | 99.39 | Work stopped (uncertainty in predicting domain boundaries) |
| 1619T | -9.00 | 81.48 | 18.49 | Work stopped (uncertainty in predicting domain boundaries) |
| 1662T | -9.05 | 70.37 | 16.56 | Work stopped (uncertainty in predicting domain boundaries) |
| 1665T | -9.40 | 74.07 | 16.77 | Work stopped (uncertainty in predicting domain boundaries) |
| 1669T | -8.30 | 55.56 | 18.62 | Work stopped (uncertainty in predicting domain boundaries) |
| 1672T | -9.30 | 85.19 | 16.56 | Work stopped (uncertainty in predicting domain boundaries) |
| 1677T | -9.15 | 74.07 | 17.76 | Work stopped (uncertainty in predicting domain boundaries) |
| 1680T | -9.05 | 77.78 | 16.67 | Work stopped (uncertainty in predicting domain boundaries) |
| 1684T | -9.52 | 77.78 | 16.98 | Work stopped (uncertainty in predicting domain boundaries) |
| 1685T | -22.70 | 57.14 | 93.81 | Work stopped (uncertainty in predicting domain boundaries) |
| 1689T | -9.52 | 62.96 | 16.98 | Work stopped (uncertainty in predicting domain boundaries) |
ATGATGACTCAAGGTCCTAATCTAATTGGTATCCATACTAATACCAAGGAAAATGAAGAAACGCAAAAGTTTGTTAACTG GTTCTTAAATACCAGTCTAACTTGAGATAACAACGAAAGCAAAACACCAGCGCAGTACTTTACCGAGTCTGCTTCTTACA TCCTGCCTTTAAAAGAAACTTTTACCGGCTCTAACAACAAAGGTCAAAGCGGCAAGAATGACGGTAAGAATTCCAATAAC ACCTTTAAAGCCAAAGCACTAGAGCTCTTTAAGGAACAGAGCGAAAACAAAATAGTGGGTTACAGTGATCCAAGCGATTT CAGGGGTGGTAAATTCCGCGAATCAATTGGTTCTGCCTTTAACGCCACGGTCAATTCACACGTTGACTTTAATACTTTCG TTGCTAACTTCCGTGCTAACTTAGGTTCTGGTTACGATAAATAG
(148 codons) fields: [triplet] [frequency: per thousand] ([number]) UUU47.3( 7) UCU33.8( 5) UAU0.0( 0) UGU0.0( 0) UUC33.8( 5) UCC6.8( 1) UAC27.0( 4) UGC0.0( 0) UUA20.3( 3) UCA13.5( 2) UAA0.0( 0) UGA6.8( 1) UUG0.0( 0) UCG0.0( 0) UAG6.8( 1) UGG6.8( 1) CUU0.0( 0) CCU13.5( 2) CAU6.8( 1) CGU6.8( 1) CUC6.8( 1) CCC0.0( 0) CAC6.8( 1) CGC6.8( 1) CUA20.3( 3) CCA13.5( 2) CAA20.3( 3) CGA0.0( 0) CUG6.8( 1) CCG0.0( 0) CAG13.5( 2) CGG0.0( 0) AUU13.5( 2) ACU33.8( 5) AAU60.8( 9) AGU13.5( 2) AUC13.5( 2) ACC33.8( 5) AAC67.6( 10) AGC27.0( 4) AUA6.8( 1) ACA6.8( 1) AAA54.1( 8) AGA0.0( 0) AUG13.5( 2) ACG13.5( 2) AAG33.8( 5) AGG6.8( 1) GUU20.3( 3) GCU20.3( 3) GAU27.0( 4) GGU67.6( 10) GUC6.8( 1) GCC20.3( 3) GAC13.5( 2) GGC13.5( 2) GUA0.0( 0) GCA6.8( 1) GAA54.1( 8) GGA0.0( 0) GUG6.8( 1) GCG6.8( 1) GAG13.5( 2) GGG0.0( 0)
Note: This was generated using the standard codon usage table; UGA codons in MP/MP genes will show up as terminators ("*")
atgatgactcaaggtcctaatctaattggtatccatactaataccaaggaaaatgaagaa M M T Q G P N L I G I H T N T K E N E E acgcaaaagtttgttaactggttcttaaataccagtctaacttgagataacaacgaaagc T Q K F V N W F L N T S L T * D N N E S aaaacaccagcgcagtactttaccgagtctgcttcttacatcctgcctttaaaagaaact K T P A Q Y F T E S A S Y I L P L K E T tttaccggctctaacaacaaaggtcaaagcggcaagaatgacggtaagaattccaataac F T G S N N K G Q S G K N D G K N S N N acctttaaagccaaagcactagagctctttaaggaacagagcgaaaacaaaatagtgggt T F K A K A L E L F K E Q S E N K I V G tacagtgatccaagcgatttcaggggtggtaaattccgcgaatcaattggttctgccttt Y S D P S D F R G G K F R E S I G S A F aacgccacggtcaattcacacgttgactttaatactttcgttgctaacttccgtgctaac N A T V N S H V D F N T F V A N F R A N ttaggttctggttacgataaatag L G S G Y D K *
MMTQGPNLIGIHTNTKENEETQKFVNWFLNTSLTWDNNESKTPAQYFTESASYILPLKETFTGSNNKGQSGKNDGKNSNN TFKAKALELFKEQSENKIVGYSDPSDFRGGKFRESIGSAFNATVNSHVDFNTFVANFRANLGSGYDK
This information was obtained using the Protein Parameters tool on the ExPASy Molecular Biology Server.
Number of amino acids: 147
Molecular weight: 16412.9
Theoretical pI: 6.74
Amino acid composition:
Ala (A) 8 5.4%
Arg (R) 3 2.0%
Asn (N) 19 12.9%
Asp (D) 6 4.1%
Cys (C) 0 0.0%
Gln (Q) 5 3.4%
Glu (E) 10 6.8%
Gly (G) 12 8.2%
His (H) 2 1.4%
Ile (I) 5 3.4%
Leu (L) 8 5.4%
Lys (K) 13 8.8%
Met (M) 2 1.4%
Phe (F) 12 8.2%
Pro (P) 4 2.7%
Ser (S) 14 9.5%
Thr (T) 13 8.8%
Trp (W) 2 1.4%
Tyr (Y) 4 2.7%
Val (V) 5 3.4%
Asx (B) 0 0.0%
Glx (Z) 0 0.0%
Xaa (X) 0 0.0%
Total number of negatively charged residues (Asp + Glu): 16
Total number of positively charged residues (Arg + Lys): 16
Atomic composition:
Carbon C 724
Hydrogen H 1097
Nitrogen N 199
Oxygen O 235
Sulfur S 2
Formula: C724H1097N199O235S2
Total number of atoms: 2257
Extinction coefficients:
Conditions: 6.0 M guanidium hydrochloride
0.02 M phosphate buffer
pH 6.5
Extinction coefficients are in units of M-1 cm-1 .
276 278 279 280 282
nm nm nm nm nm
Ext. coefficient 16600 16800 16700 16500 16000
Abs 0.1% (=1 g/l) 1.011 1.024 1.017 1.005 0.975
Estimated half-life:
The N-terminal of the sequence considered is M (Met).
The estimated half-life is: 30 hours (mammalian reticulocytes, in vitro).
>20 hours (yeast, in vivo).
>10 hours (Escherichia coli, in vivo).
Instability index:
The instability index (II) is computed to be 19.55
This classifies the protein as stable.
Aliphatic index: 49.80
Grand average of hydropathicity (GRAVY): -0.839| Model | Percentage of MP Gene |
|---|---|
| Coiled Coil (CCP) | 0 |
| Disordered (SEG) | 0 |
| Transmembrane (PHDhtm) | 0 |
| Transmembrane (TMHMM) | 0 |
| Homologous to known structure (PSIBLAST) | 0 |
Sequence: Amino acid sequence.
CCP (C or -): coiled coil prediction from the ccp program (NCBI toolkit)
SEG (D or -): low complexity regions (possibly disordered) from SEG
PHDhtm (H or -): Transmembrane prediction from PHDhtm
TMHMM (H or -): Transmembrane prediction from TMHMM
PSIBLAST (3 or -): Regions potentially homologous to a protein of known 3D structure, according to PSIBLAST
Pred2ary (H, E, or -): Secondary structure prediction from Pred2ary
10 20 30 40 50 60 70 80
| | | | | | | |
Sequence MMTQGPNLIGIHTNTKENEETQKFVNWFLNTSLTWDNNESKTPAQYFTESASYILPLKETFTGSNNKGQSGKNDGKNSNN
CCP --------------------------------------------------------------------------------
SEG --------------------------------------------------------------------------------
PHDhtm --------------------------------------------------------------------------------
TMHMM --------------------------------------------------------------------------------
PSIBLAST --------------------------------------------------------------------------------
Pred2ary -------EEEEE------HHHHHHHHHHH--------------HHHHH-----E----H---------------------
90 100 110 120 130 140
| | | | | |
Sequence TFKAKALELFKEQSENKIVGYSDPSDFRGGKFRESIGSAFNATVNSHVDFNTFVANFRANLGSGYDK
CCP -------------------------------------------------------------------
SEG -------------------------------------------------------------------
PHDhtm -------------------------------------------------------------------
TMHMM -------------------------------------------------------------------
PSIBLAST -------------------------------------------------------------------
Pred2ary HHHHHHHHHHHHH----EEEE-----------HHHHHHHHHHH------HHHHHHHHHHHH------