|
|
Available sections:
| Target | log10(PSIBLAST E) | % identical | % coverage (of MP gene) | Latest Status |
|---|---|---|---|---|
| 1012B | -12.10 | 14.66 | 92.43 | Override suggest work stop (low sequencial homology to the known structures) |
| 1330B | -5.40 | 13.26 | 29.38 | Purified |
ATGAACAAAAATAGTGGTAAAGACTACGAATCGATTCAAGGTAAGATTAAGAGCAAAGCTGAAATAGAAGATTCAGTTAT ATTGGATCAAGACACTTTTCAAGGCAAGGGTGAAGAAAAAACTAAATTAGATGAGTTGATTAGGTGAGAGGCCGAGAAAA ATGACTTACTCAAGTTAATTGACAACGTCGGTAATGACGAAGTTTTCACTCCCGTAGAAACTTGTCAAAGAATGCTCGAT GAGCTCTTTCCTGAAGATCATGAAGTTTGATCAAATCCCGATCTCAAGTGATTAAATCCGTGTGATAAAAACGGAGTTTT CTTTCGCGAAATTGCGCTAAGACTTGACAAAGGATTAACCAAAATTATTCCCGACGAATATGCCCGCAAGAAGCATATTA TGACCAAGATGCTTTTTAGCATTGGCTTAACTAAATTTACTAGTTTGATGGTGCGTCGCACGCTTTACTATTGCATTAAA GCTAACAAGCGTAAAACCAGCGAAGATGAGGGTTGTGCAATTGCAAATGGCGCACGATTTAATAATGAATTTGGCAATGT TGTTACCCCTTATAAAGAACACTATTTTGGAAAAGAATCGAAAAGCAAAAACTGTCACTTTTGTGGGACAAATAAGAATT CTAAATATGTAAATTCTATGACTGAAGAAAAGTATGCCTATGACTTCATTCATTTAAATTCGGATGAATATGAAAACTAC TTTAAAACTAATTTTGGTGTTATGAAATTTGATGTAATTATTGGAAATCCGCCTTACCAGCTATCTAATGGTAGCGGTTC TGATGGCAATGGAGCCAAAGCAATTTTTCAAGATTTTGTTTTAAAAGCAATTGACTTAGAACCAAAATATTTAGCGATGA TTATCCCAGCAAAATGAATGATTTCTGCTGAAAATATTTTTTTAAATCTAAGAGATAAATTAAAGAAAAATAAAGATATA AAAGAAATTAATATTTTTTTCGACTCTAAAGACTGTTTTCCAAAGAGAGAAATAAAAGGTGGCATTTGTTACTTTATCTG GCAGAATAACTATCAAGGTAAGACTCTTATAAATACCAAATTTTCAAAGAAAGGAAAAAGCACAGAGATCGATAGTTCTA AAAGAAACTTATTTGTTTCCGTTTTAAACAACACTGAACAAATCATTTTTCGCAAAAGGATTTGACAAGACATTTACACA AAGATTGATTCCATCTCTGCTAGTGAAATGAAGAACACACACACAGAGAGAGAGCAATTTAGAAGTTAG
(423 codons) fields: [triplet] [frequency: per thousand] ([number]) UUU52.0( 22) UCU18.9( 8) UAU23.6( 10) UGU16.5( 7) UUC9.5( 4) UCC4.7( 2) UAC14.2( 6) UGC2.4( 1) UUA33.1( 14) UCA7.1( 3) UAA0.0( 0) UGA11.8( 5) UUG7.1( 3) UCG7.1( 3) UAG2.4( 1) UGG2.4( 1) CUU9.5( 4) CCU7.1( 3) CAU7.1( 3) CGU4.7( 2) CUC9.5( 4) CCC7.1( 3) CAC7.1( 3) CGC9.5( 4) CUA7.1( 3) CCA7.1( 3) CAA21.3( 9) CGA2.4( 1) CUG0.0( 0) CCG4.7( 2) CAG4.7( 2) CGG0.0( 0) AUU61.5( 26) ACU23.6( 10) AAU54.4( 23) AGU11.8( 5) AUC11.8( 5) ACC11.8( 5) AAC26.0( 11) AGC14.2( 6) AUA11.8( 5) ACA11.8( 5) AAA80.4( 34) AGA16.5( 7) AUG23.6( 10) ACG2.4( 1) AAG40.2( 17) AGG4.7( 2) GUU23.6( 10) GCU9.5( 4) GAU37.8( 16) GGU23.6( 10) GUC2.4( 1) GCC9.5( 4) GAC28.4( 12) GGC14.2( 6) GUA7.1( 3) GCA14.2( 6) GAA59.1( 25) GGA14.2( 6) GUG2.4( 1) GCG4.7( 2) GAG18.9( 8) GGG2.4( 1)
Note: This was generated using the standard codon usage table; UGA codons in MP/MP genes will show up as terminators ("*")
atgaacaaaaatagtggtaaagactacgaatcgattcaaggtaagattaagagcaaagct M N K N S G K D Y E S I Q G K I K S K A gaaatagaagattcagttatattggatcaagacacttttcaaggcaagggtgaagaaaaa E I E D S V I L D Q D T F Q G K G E E K actaaattagatgagttgattaggtgagaggccgagaaaaatgacttactcaagttaatt T K L D E L I R * E A E K N D L L K L I gacaacgtcggtaatgacgaagttttcactcccgtagaaacttgtcaaagaatgctcgat D N V G N D E V F T P V E T C Q R M L D gagctctttcctgaagatcatgaagtttgatcaaatcccgatctcaagtgattaaatccg E L F P E D H E V * S N P D L K * L N P tgtgataaaaacggagttttctttcgcgaaattgcgctaagacttgacaaaggattaacc C D K N G V F F R E I A L R L D K G L T aaaattattcccgacgaatatgcccgcaagaagcatattatgaccaagatgctttttagc K I I P D E Y A R K K H I M T K M L F S attggcttaactaaatttactagtttgatggtgcgtcgcacgctttactattgcattaaa I G L T K F T S L M V R R T L Y Y C I K gctaacaagcgtaaaaccagcgaagatgagggttgtgcaattgcaaatggcgcacgattt A N K R K T S E D E G C A I A N G A R F aataatgaatttggcaatgttgttaccccttataaagaacactattttggaaaagaatcg N N E F G N V V T P Y K E H Y F G K E S aaaagcaaaaactgtcacttttgtgggacaaataagaattctaaatatgtaaattctatg K S K N C H F C G T N K N S K Y V N S M actgaagaaaagtatgcctatgacttcattcatttaaattcggatgaatatgaaaactac T E E K Y A Y D F I H L N S D E Y E N Y tttaaaactaattttggtgttatgaaatttgatgtaattattggaaatccgccttaccag F K T N F G V M K F D V I I G N P P Y Q ctatctaatggtagcggttctgatggcaatggagccaaagcaatttttcaagattttgtt L S N G S G S D G N G A K A I F Q D F V ttaaaagcaattgacttagaaccaaaatatttagcgatgattatcccagcaaaatgaatg L K A I D L E P K Y L A M I I P A K * M atttctgctgaaaatatttttttaaatctaagagataaattaaagaaaaataaagatata I S A E N I F L N L R D K L K K N K D I aaagaaattaatatttttttcgactctaaagactgttttccaaagagagaaataaaaggt K E I N I F F D S K D C F P K R E I K G ggcatttgttactttatctggcagaataactatcaaggtaagactcttataaataccaaa G I C Y F I W Q N N Y Q G K T L I N T K ttttcaaagaaaggaaaaagcacagagatcgatagttctaaaagaaacttatttgtttcc F S K K G K S T E I D S S K R N L F V S gttttaaacaacactgaacaaatcatttttcgcaaaaggatttgacaagacatttacaca V L N N T E Q I I F R K R I * Q D I Y T aagattgattccatctctgctagtgaaatgaagaacacacacacagagagagagcaattt K I D S I S A S E M K N T H T E R E Q F agaagttag R S *
MNKNSGKDYESIQGKIKSKAEIEDSVILDQDTFQGKGEEKTKLDELIRWEAEKNDLLKLIDNVGNDEVFTPVETCQRMLD ELFPEDHEVWSNPDLKWLNPCDKNGVFFREIALRLDKGLTKIIPDEYARKKHIMTKMLFSIGLTKFTSLMVRRTLYYCIK ANKRKTSEDEGCAIANGARFNNEFGNVVTPYKEHYFGKESKSKNCHFCGTNKNSKYVNSMTEEKYAYDFIHLNSDEYENY FKTNFGVMKFDVIIGNPPYQLSNGSGSDGNGAKAIFQDFVLKAIDLEPKYLAMIIPAKWMISAENIFLNLRDKLKKNKDI KEINIFFDSKDCFPKREIKGGICYFIWQNNYQGKTLINTKFSKKGKSTEIDSSKRNLFVSVLNNTEQIIFRKRIWQDIYT KIDSISASEMKNTHTEREQFRS
This information was obtained using the Protein Parameters tool on the ExPASy Molecular Biology Server.
Number of amino acids: 422
Molecular weight: 49061.9
Theoretical pI: 8.62
Amino acid composition:
Ala (A) 16 3.8%
Arg (R) 16 3.8%
Asn (N) 34 8.1%
Asp (D) 28 6.6%
Cys (C) 8 1.9%
Gln (Q) 11 2.6%
Glu (E) 33 7.8%
Gly (G) 23 5.5%
His (H) 6 1.4%
Ile (I) 36 8.5%
Leu (L) 28 6.6%
Lys (K) 51 12.1%
Met (M) 10 2.4%
Phe (F) 26 6.2%
Pro (P) 11 2.6%
Ser (S) 27 6.4%
Thr (T) 21 5.0%
Trp (W) 6 1.4%
Tyr (Y) 16 3.8%
Val (V) 15 3.6%
Asx (B) 0 0.0%
Glx (Z) 0 0.0%
Xaa (X) 0 0.0%
Total number of negatively charged residues (Asp + Glu): 61
Total number of positively charged residues (Arg + Lys): 67
Atomic composition:
Carbon C 2197
Hydrogen H 3426
Nitrogen N 584
Oxygen O 654
Sulfur S 18
Formula: C2197H3426N584O654S18
Total number of atoms: 6879
Extinction coefficients:
Conditions: 6.0 M guanidium hydrochloride
0.02 M phosphate buffer
pH 6.5
Extinction coefficients are in units of M-1 cm-1 .
The first table lists values computed assuming ALL Cys
residues appear as half cystines, whereas the second table
assumes that NONE do.
276 278 279 280 282
nm nm nm nm nm
Ext. coefficient 56180 56508 55960 55100 53280
Abs 0.1% (=1 g/l) 1.145 1.152 1.141 1.123 1.086
276 278 279 280 282
nm nm nm nm nm
Ext. coefficient 55600 56000 55480 54620 52800
Abs 0.1% (=1 g/l) 1.133 1.141 1.131 1.113 1.076
Estimated half-life:
The N-terminal of the sequence considered is M (Met).
The estimated half-life is: 30 hours (mammalian reticulocytes, in vitro).
>20 hours (yeast, in vivo).
>10 hours (Escherichia coli, in vivo).
Instability index:
The instability index (II) is computed to be 34.40
This classifies the protein as stable.
Aliphatic index: 73.25
Grand average of hydropathicity (GRAVY): -0.660| Model | Percentage of MP Gene |
|---|---|
| Coiled Coil (CCP) | 0 |
| Disordered (SEG) | 5 |
| Transmembrane (PHDhtm) | 0 |
| Transmembrane (TMHMM) | 0 |
| Homologous to known structure (PSIBLAST) | 0 |
Sequence: Amino acid sequence.
CCP (C or -): coiled coil prediction from the ccp program (NCBI toolkit)
SEG (D or -): low complexity regions (possibly disordered) from SEG
PHDhtm (H or -): Transmembrane prediction from PHDhtm
TMHMM (H or -): Transmembrane prediction from TMHMM
PSIBLAST (3 or -): Regions potentially homologous to a protein of known 3D structure, according to PSIBLAST
Pred2ary (H, E, or -): Secondary structure prediction from Pred2ary
10 20 30 40 50 60 70 80
| | | | | | | |
Sequence MNKNSGKDYESIQGKIKSKAEIEDSVILDQDTFQGKGEEKTKLDELIRWEAEKNDLLKLIDNVGNDEVFTPVETCQRMLD
CCP --------------------------------------------------------------------------------
SEG --------------------------------------------------------------------------------
PHDhtm --------------------------------------------------------------------------------
TMHMM --------------------------------------------------------------------------------
PSIBLAST --------------------------------------------------------------------------------
Pred2ary ---------------EE---E----EEE-------------HHHHHHHHHH---HHHHHHH----------HHHHHHHHH
90 100 110 120 130 140 150 160
| | | | | | | |
Sequence ELFPEDHEVWSNPDLKWLNPCDKNGVFFREIALRLDKGLTKIIPDEYARKKHIMTKMLFSIGLTKFTSLMVRRTLYYCIK
CCP --------------------------------------------------------------------------------
SEG --------------------------------------------------------------------------------
PHDhtm --------------------------------------------------------------------------------
TMHMM --------------------------------------------------------------------------------
PSIBLAST --------------------------------------------------------------------------------
Pred2ary H------------------------HHHHHHHHHHHH-------HHHHHHHHHHHHHHEEE----HHHHHHHHHHHHHHH
170 180 190 200 210 220 230 240
| | | | | | | |
Sequence ANKRKTSEDEGCAIANGARFNNEFGNVVTPYKEHYFGKESKSKNCHFCGTNKNSKYVNSMTEEKYAYDFIHLNSDEYENY
CCP --------------------------------------------------------------------------------
SEG --------------------------------------------------------------------------------
PHDhtm --------------------------------------------------------------------------------
TMHMM --------------------------------------------------------------------------------
PSIBLAST --------------------------------------------------------------------------------
Pred2ary H------------E--------------------EE---------EE-------EEEE-----------------HHHHH
250 260 270 280 290 300 310 320
| | | | | | | |
Sequence FKTNFGVMKFDVIIGNPPYQLSNGSGSDGNGAKAIFQDFVLKAIDLEPKYLAMIIPAKWMISAENIFLNLRDKLKKNKDI
CCP --------------------------------------------------------------------------------
SEG ------------------------------------------------------------------DDDDDDDDDDDDDD
PHDhtm --------------------------------------------------------------------------------
TMHMM --------------------------------------------------------------------------------
PSIBLAST --------------------------------------------------------------------------------
Pred2ary HH----EEEEEEEE-------------------HHHHHHHHHHHH-----EEEEE--------HHHHHHHHHHHHHH---
330 340 350 360 370 380 390 400
| | | | | | | |
Sequence KEINIFFDSKDCFPKREIKGGICYFIWQNNYQGKTLINTKFSKKGKSTEIDSSKRNLFVSVLNNTEQIIFRKRIWQDIYT
CCP --------------------------------------------------------------------------------
SEG DDDDDDD-------------------------------------------------------------------------
PHDhtm --------------------------------------------------------------------------------
TMHMM --------------------------------------------------------------------------------
PSIBLAST --------------------------------------------------------------------------------
Pred2ary EEEEEE---------------EEEEEEE------EEE------------H---H----HHHHHHHHHHHHHHHHHHHHHH
410 420
| |
Sequence KIDSISASEMKNTHTEREQFRS
CCP ----------------------
SEG ----------------------
PHDhtm ----------------------
TMHMM ----------------------
PSIBLAST ----------------------
Pred2ary ----------------------