|
|
Available sections:
| Target | log10(PSIBLAST E) | % identical | % coverage (of MP gene) | Latest Status |
|---|---|---|---|---|
| 1001B | -45.70 | 32.04 | 69.36 | Suggested work stop (1192 1192 4957 4981 4981 5931 5931 6021 6021 6116 6116 7498 7960 7960 8131 8131 9084 9084 13286 13286 13830 13830) |
| 1741T | -111.00 | 100.00 | 100.00 | Work stopped (uncertainty in predicting domain boundaries) |
ATGCGATACTTTGCCCTCACTGATTTAACCAAGCCCACTGCAAAATTTACCACTGATGCCATGGAACTGCGTCAGATATC CAATGCTTACCTCAACCAAGCCCAAGCATATTTGCAAAAGGGTTTAAAGCAACTCAAAAAGGATTACAAGAATGCCATTT TGTATACCCCCAAAACGGAGTACAAGCGCTTTTTGAAATGGAAGCAAACTTTTCTTCAAGATTTAAACCAAACCCAAAAG CGCTTCTTTATTGTTCGTGCCCAACATTTTTCTTACGTGTTGTTGTTTAACTTATTGGATGAACAGGTGGAAGCGGTGAT TGCGACCTTTAACAACTTCTTGGATGAACACCGCTTGGAGGCACAATCCAAGCAGTTTGATTTAGACACAGCTGTCAATG AACTGCATGATTACTTTGACCAACTGCAAAAGAACACCGCTTACAGCGAGGATTTACCAACTCACCTCACGCAAAAAACC GAGCAATTAATTAACGCTCGTAATACCCAGTTAACGAATTTGTTAAACAAGATTGCTACTACCAAACCACCCCTAAACAA ACAGCAACGTTTGTTAGCCAGTTTTCGCAATTACCACGAACACCTCTTTTTAAAGAACGAGGTCAAAAAGGTGACTTGGT TAAACGAACCACGCGCCAAAAAAGAGTCTGTTACCCCTGATGAAGAACACATTATTGAACTTAAAAACGTTTACAAGTAC ATTACTAATGGGGTGACTACCAATGCCGTTTTAAAGGGCATAGATCTCAAACTCAAAGCCCATGATTTCATTGTGATTTT AGGTCCCTCTGGCTCGGGGAAAACGACCCTGCTCAACATTATTTCCGGGATGGACCGGCCATCAAGCGGATCTGTTGTCG TTAATGGTCAGGAAATGATTTGCATGAACGACCGTCAGTTAACTAACTTTCGCCGCAACTATGTCGGTTACATCTTCCAG CAGTACGGGCTTTTACCCAACCTAACTGTAAGGGAAAACGTTGAAGTGGGTGCTAACTTGCAACGTAACCCTGATAAGCG GATTAACATTGATGAACTGTTAGAAGCAGTAGGGATGAAACACCTCCAAAAGAAACTCCCCAACGAGTTGAGTGGTGGAC AACAACAACGGGTCTCGATTGCACGGGCGTTTGCTAAAAATCCCTTACTCATTTTTGGTGATGAACCAACTGGGGCTTTG GATTTGGAAATGACCCAAATTGTGCTCAAACAGTTTTTAGCCATTAAACAACGCTATAAAACCACGATGGTCATAGTGAC GCACAATAACCTAATTGCCCAACTAGCCGATTTAGTCATTTATGTGGCAGATGGCAAAATTCAAGCACTGCAAGCAAACC CTAATCCCAAACAGGTTGAGGACATTAACTGAATTTAG
(466 codons) fields: [triplet] [frequency: per thousand] ([number]) UUU34.3( 16) UCU8.6( 4) UAU10.7( 5) UGU0.0( 0) UUC8.6( 4) UCC6.4( 3) UAC25.8( 12) UGC2.1( 1) UUA42.9( 20) UCA2.1( 1) UAA0.0( 0) UGA2.1( 1) UUG30.0( 14) UCG4.3( 2) UAG2.1( 1) UGG4.3( 2) CUU6.4( 3) CCU6.4( 3) CAU6.4( 3) CGU12.9( 6) CUC25.8( 12) CCC17.2( 8) CAC15.0( 7) CGC17.2( 8) CUA8.6( 4) CCA10.7( 5) CAA53.6( 25) CGA2.1( 1) CUG12.9( 6) CCG0.0( 0) CAG23.6( 11) CGG8.6( 4) AUU51.5( 24) ACU25.8( 12) AAU25.8( 12) AGU4.3( 2) AUC2.1( 1) ACC30.0( 14) AAC53.6( 25) AGC4.3( 2) AUA6.4( 3) ACA2.1( 1) AAA47.2( 22) AGA0.0( 0) AUG17.2( 8) ACG12.9( 6) AAG36.5( 17) AGG2.1( 1) GUU17.2( 8) GCU17.2( 8) GAU38.6( 18) GGU15.0( 7) GUC15.0( 7) GCC25.8( 12) GAC10.7( 5) GGC6.4( 3) GUA4.3( 2) GCA17.2( 8) GAA36.5( 17) GGA4.3( 2) GUG21.5( 10) GCG6.4( 3) GAG17.2( 8) GGG12.9( 6)
Note: This was generated using the standard codon usage table; UGA codons in MP/MP genes will show up as terminators ("*")
atgcgatactttgccctcactgatttaaccaagcccactgcaaaatttaccactgatgcc M R Y F A L T D L T K P T A K F T T D A atggaactgcgtcagatatccaatgcttacctcaaccaagcccaagcatatttgcaaaag M E L R Q I S N A Y L N Q A Q A Y L Q K ggtttaaagcaactcaaaaaggattacaagaatgccattttgtatacccccaaaacggag G L K Q L K K D Y K N A I L Y T P K T E tacaagcgctttttgaaatggaagcaaacttttcttcaagatttaaaccaaacccaaaag Y K R F L K W K Q T F L Q D L N Q T Q K cgcttctttattgttcgtgcccaacatttttcttacgtgttgttgtttaacttattggat R F F I V R A Q H F S Y V L L F N L L D gaacaggtggaagcggtgattgcgacctttaacaacttcttggatgaacaccgcttggag E Q V E A V I A T F N N F L D E H R L E gcacaatccaagcagtttgatttagacacagctgtcaatgaactgcatgattactttgac A Q S K Q F D L D T A V N E L H D Y F D caactgcaaaagaacaccgcttacagcgaggatttaccaactcacctcacgcaaaaaacc Q L Q K N T A Y S E D L P T H L T Q K T gagcaattaattaacgctcgtaatacccagttaacgaatttgttaaacaagattgctact E Q L I N A R N T Q L T N L L N K I A T accaaaccacccctaaacaaacagcaacgtttgttagccagttttcgcaattaccacgaa T K P P L N K Q Q R L L A S F R N Y H E cacctctttttaaagaacgaggtcaaaaaggtgacttggttaaacgaaccacgcgccaaa H L F L K N E V K K V T W L N E P R A K aaagagtctgttacccctgatgaagaacacattattgaacttaaaaacgtttacaagtac K E S V T P D E E H I I E L K N V Y K Y attactaatggggtgactaccaatgccgttttaaagggcatagatctcaaactcaaagcc I T N G V T T N A V L K G I D L K L K A catgatttcattgtgattttaggtccctctggctcggggaaaacgaccctgctcaacatt H D F I V I L G P S G S G K T T L L N I atttccgggatggaccggccatcaagcggatctgttgtcgttaatggtcaggaaatgatt I S G M D R P S S G S V V V N G Q E M I tgcatgaacgaccgtcagttaactaactttcgccgcaactatgtcggttacatcttccag C M N D R Q L T N F R R N Y V G Y I F Q cagtacgggcttttacccaacctaactgtaagggaaaacgttgaagtgggtgctaacttg Q Y G L L P N L T V R E N V E V G A N L caacgtaaccctgataagcggattaacattgatgaactgttagaagcagtagggatgaaa Q R N P D K R I N I D E L L E A V G M K cacctccaaaagaaactccccaacgagttgagtggtggacaacaacaacgggtctcgatt H L Q K K L P N E L S G G Q Q Q R V S I gcacgggcgtttgctaaaaatcccttactcatttttggtgatgaaccaactggggctttg A R A F A K N P L L I F G D E P T G A L gatttggaaatgacccaaattgtgctcaaacagtttttagccattaaacaacgctataaa D L E M T Q I V L K Q F L A I K Q R Y K accacgatggtcatagtgacgcacaataacctaattgcccaactagccgatttagtcatt T T M V I V T H N N L I A Q L A D L V I tatgtggcagatggcaaaattcaagcactgcaagcaaaccctaatcccaaacaggttgag Y V A D G K I Q A L Q A N P N P K Q V E gacattaactgaatttag D I N * I *
MRYFALTDLTKPTAKFTTDAMELRQISNAYLNQAQAYLQKGLKQLKKDYKNAILYTPKTEYKRFLKWKQTFLQDLNQTQK RFFIVRAQHFSYVLLFNLLDEQVEAVIATFNNFLDEHRLEAQSKQFDLDTAVNELHDYFDQLQKNTAYSEDLPTHLTQKT EQLINARNTQLTNLLNKIATTKPPLNKQQRLLASFRNYHEHLFLKNEVKKVTWLNEPRAKKESVTPDEEHIIELKNVYKY ITNGVTTNAVLKGIDLKLKAHDFIVILGPSGSGKTTLLNIISGMDRPSSGSVVVNGQEMICMNDRQLTNFRRNYVGYIFQ QYGLLPNLTVRENVEVGANLQRNPDKRINIDELLEAVGMKHLQKKLPNELSGGQQQRVSIARAFAKNPLLIFGDEPTGAL DLEMTQIVLKQFLAIKQRYKTTMVIVTHNNLIAQLADLVIYVADGKIQALQANPNPKQVEDINWI
This information was obtained using the Protein Parameters tool on the ExPASy Molecular Biology Server.
Number of amino acids: 465
Molecular weight: 53511.5
Theoretical pI: 9.30
Amino acid composition:
Ala (A) 31 6.7%
Arg (R) 20 4.3%
Asn (N) 37 8.0%
Asp (D) 23 4.9%
Cys (C) 1 0.2%
Gln (Q) 36 7.7%
Glu (E) 25 5.4%
Gly (G) 18 3.9%
His (H) 10 2.2%
Ile (I) 28 6.0%
Leu (L) 59 12.7%
Lys (K) 39 8.4%
Met (M) 8 1.7%
Phe (F) 20 4.3%
Pro (P) 16 3.4%
Ser (S) 14 3.0%
Thr (T) 33 7.1%
Trp (W) 3 0.6%
Tyr (Y) 17 3.7%
Val (V) 27 5.8%
Asx (B) 0 0.0%
Glx (Z) 0 0.0%
Xaa (X) 0 0.0%
Total number of negatively charged residues (Asp + Glu): 48
Total number of positively charged residues (Arg + Lys): 59
Atomic composition:
Carbon C 2408
Hydrogen H 3842
Nitrogen N 660
Oxygen O 699
Sulfur S 9
Formula: C2408H3842N660O699S9
Total number of atoms: 7618
Extinction coefficients:
Conditions: 6.0 M guanidium hydrochloride
0.02 M phosphate buffer
pH 6.5
Extinction coefficients are in units of M-1 cm-1 .
The first table lists values computed assuming ALL Cys
residues appear as half cystines, whereas the second table
assumes that NONE do.
276 278 279 280 282
nm nm nm nm nm
Ext. coefficient 40850 40600 39845 38830 37200
Abs 0.1% (=1 g/l) 0.763 0.759 0.745 0.726 0.695
276 278 279 280 282
nm nm nm nm nm
Ext. coefficient 40850 40600 39845 38830 37200
Abs 0.1% (=1 g/l) 0.763 0.759 0.745 0.726 0.695
Estimated half-life:
The N-terminal of the sequence considered is M (Met).
The estimated half-life is: 30 hours (mammalian reticulocytes, in vitro).
>20 hours (yeast, in vivo).
>10 hours (Escherichia coli, in vivo).
Instability index:
The instability index (II) is computed to be 27.55
This classifies the protein as stable.
Aliphatic index: 96.47
Grand average of hydropathicity (GRAVY): -0.422| Model | Percentage of MP Gene |
|---|---|
| Coiled Coil (CCP) | 0 |
| Disordered (SEG) | 0 |
| Transmembrane (PHDhtm) | 0 |
| Transmembrane (TMHMM) | 0 |
| Homologous to known structure (PSIBLAST) | 98 |
Sequence: Amino acid sequence.
CCP (C or -): coiled coil prediction from the ccp program (NCBI toolkit)
SEG (D or -): low complexity regions (possibly disordered) from SEG
PHDhtm (H or -): Transmembrane prediction from PHDhtm
TMHMM (H or -): Transmembrane prediction from TMHMM
PSIBLAST (3 or -): Regions potentially homologous to a protein of known 3D structure, according to PSIBLAST
Pred2ary (H, E, or -): Secondary structure prediction from Pred2ary
10 20 30 40 50 60 70 80
| | | | | | | |
Sequence MRYFALTDLTKPTAKFTTDAMELRQISNAYLNQAQAYLQKGLKQLKKDYKNAILYTPKTEYKRFLKWKQTFLQDLNQTQK
CCP --------------------------------------------------------------------------------
SEG --------------------------------------------------------------------------------
PHDhtm --------------------------------------------------------------------------------
TMHMM --------------------------------------------------------------------------------
PSIBLAST ----3333333333333333333333333333333333333333333333333333333333333333333333333333
Pred2ary --HHHHHHHHH----------EEEE-------------HHHHHHH----EEEEE-----------HHHHHHHHHHHH---
90 100 110 120 130 140 150 160
| | | | | | | |
Sequence RFFIVRAQHFSYVLLFNLLDEQVEAVIATFNNFLDEHRLEAQSKQFDLDTAVNELHDYFDQLQKNTAYSEDLPTHLTQKT
CCP --------------------------------------------------------------------------------
SEG --------------------------------------------------------------------------------
PHDhtm --------------------------------------------------------------------------------
TMHMM --------------------------------------------------------------------------------
PSIBLAST 33333333333333333333333333333333333333333333333333333333333333333333333333333333
Pred2ary ---EEE-----HHHHHHHHHHHHHHH------------------HHHHHHHHHHHH------------HHHHHHHHHHHH
170 180 190 200 210 220 230 240
| | | | | | | |
Sequence EQLINARNTQLTNLLNKIATTKPPLNKQQRLLASFRNYHEHLFLKNEVKKVTWLNEPRAKKESVTPDEEHIIELKNVYKY
CCP --------------------------------------------------------------------------------
SEG --------------------------------------------------------------------------------
PHDhtm --------------------------------------------------------------------------------
TMHMM --------------------------------------------------------------------------------
PSIBLAST 33333333333333333333333333333333333333333333333333333333333333333333333333333333
Pred2ary HHHHHH---EEEEEE--------EEEEE---EEEE---------------EE------------------EEEEEEEEEE
250 260 270 280 290 300 310 320
| | | | | | | |
Sequence ITNGVTTNAVLKGIDLKLKAHDFIVILGPSGSGKTTLLNIISGMDRPSSGSVVVNGQEMICMNDRQLTNFRRNYVGYIFQ
CCP --------------------------------------------------------------------------------
SEG --------------------------------------------------------------------------------
PHDhtm --------------------------------------------------------------------------------
TMHMM --------------------------------------------------------------------------------
PSIBLAST 33333333333333333333333333333333333333333333333333333333333333333333333333333333
Pred2ary E------EEEEE-EEEEE----EEEEE-------HHHHHHHHH-------EEEE--------------H-----EEEEEE
330 340 350 360 370 380 390 400
| | | | | | | |
Sequence QYGLLPNLTVRENVEVGANLQRNPDKRINIDELLEAVGMKHLQKKLPNELSGGQQQRVSIARAFAKNPLLIFGDEPTGAL
CCP --------------------------------------------------------------------------------
SEG --------------------------------------------------------------------------------
PHDhtm --------------------------------------------------------------------------------
TMHMM --------------------------------------------------------------------------------
PSIBLAST 33333333333333333333333333333333333333333333333333333333333333333333333333333333
Pred2ary --------HHHHHHHH-------HHHHHHHHHHHHHH---HHH----------HHHHHHHHHHHH----EEEE-------
410 420 430 440 450 460
| | | | | |
Sequence DLEMTQIVLKQFLAIKQRYKTTMVIVTHNNLIAQLADLVIYVADGKIQALQANPNPKQVEDINWI
CCP -----------------------------------------------------------------
SEG -----------------------------------------------------------------
PHDhtm -----------------------------------------------------------------
TMHMM -----------------------------------------------------------------
PSIBLAST 33333333333333333333333333333333-----------3333333333333333333333
Pred2ary HHHHHHHHHHHHHHHHHH---EEEEE--HHHHHHH---EEEE---EEEE----------------