|
|
Available sections:
| Target | log10(PSIBLAST E) | % identical | % coverage (of MP gene) | Latest Status |
|---|---|---|---|---|
| 1104B | -58.52 | 34.32 | 90.45 | Override suggest work stop (low sequence homology to the known structures) |
| 1161B | -59.00 | 32.67 | 92.66 | Override suggest work stop (low sequence homology to the known structures) |
| 1227B | -55.52 | 30.17 | 91.33 | Override suggest work stop (false positive) |
| 1265B | -86.05 | 28.33 | 54.87 | Solubility tested (S) |
| 1319B | -80.70 | 27.24 | 54.21 | Purified |
| 1348B | -90.00 | 27.89 | 55.58 | Purified |
| 1359B | -48.22 | 12.30 | 95.41 | Override suggest work stop (false positive) |
| 1369B | -66.40 | 35.11 | 90.37 | Solubility tested (I) |
ATGGAACAACCTTTGTGTATCTTGGGAATTGAAACCACTTGTGATGACACTAGCATAGGGGTAATCACTGAGAGTAAGGT ACAAGCACACATTGTTTTATCATCCGCCAAATTACACGCCCAAACTGGTGGCGTAGTGCCCGAAGTTGCAGCTAGGAGTC ACGAACAAAACTTGCTTAAAGCACTGCAACAATCAGGGGTTGTTTTAGAACAAATTACTCACATTGCTTATGCCGCTAAT CCGGGATTACCCGGTTGTCTGCATGTGGGAGCTACCTTTGCGCGCAGTTTAAGCTTCTTATTGGACAAACCCTTATTGCC CATCAACCACCTTTATGCCCACATCTTCTCGGCCTTAATTGACCAAGATATTAATCAGTTAAAACTCCCCGCTTTAGGCT TAGTGGTGTCCGGGGGGCACACTGCCATTTACCTAATTAAGTCCTTGTTTGACTTGGAGTTAATTGCTGAGACTAGCGAT GATGCGATTGGCGAAGTTTATGATAAGGTGGGCCGCGCTATGGGTTTTCCTTATCCAGCTGGACCCCAGTTAGATAGTTT GTTTCAACCGGAGTTAGTTAAGTCACATTATTTCTTTCGTCCTTCCACGAAGTGAACGAAATTTTCCTACTCGGGGTTGA AGTCACAGTGTTTTACCAAGATTAAACAGTTAAGGGAAAGGAAAGGCTTTAATCCCCAAACCCACGATTGGAATGAGTTT GCTTCTAACTTCCAGGCCACCATTATTGATCACTACATCAATCATGTCAAGGATGCCATTCAGCAACACCAACCGCAAAT GTTGCTGTTAGGTGGTGGGGTAAGTGCTAATAAATACTTAAGAGAACAGGTAACACAGTTACAGCTGCCTTACTTAATTG CGCCGCTCAAGTACACGAGCGATAACGGGGCGATGATTGGTTTTTATGCTAATCTTTTAATTAATGGCAAAAACAATTAA
(320 codons) fields: [triplet] [frequency: per thousand] ([number]) UUU31.2( 10) UCU3.1( 1) UAU18.8( 6) UGU12.5( 4) UUC12.5( 4) UCC15.6( 5) UAC18.8( 6) UGC0.0( 0) UUA62.5( 20) UCA12.5( 4) UAA3.1( 1) UGA3.1( 1) UUG31.2( 10) UCG6.2( 2) UAG0.0( 0) UGG3.1( 1) CUU9.4( 3) CCU12.5( 4) CAU9.4( 3) CGU3.1( 1) CUC6.2( 2) CCC21.9( 7) CAC31.2( 10) CGC6.2( 2) CUA3.1( 1) CCA3.1( 1) CAA40.6( 13) CGA0.0( 0) CUG12.5( 4) CCG12.5( 4) CAG28.1( 9) CGG0.0( 0) AUU53.1( 17) ACU21.9( 7) AAU28.1( 9) AGU15.6( 5) AUC15.6( 5) ACC15.6( 5) AAC15.6( 5) AGC12.5( 4) AUA3.1( 1) ACA3.1( 1) AAA28.1( 9) AGA3.1( 1) AUG12.5( 4) ACG9.4( 3) AAG28.1( 9) AGG9.4( 3) GUU18.8( 6) GCU34.4( 11) GAU31.2( 10) GGU18.8( 6) GUC3.1( 1) GCC25.0( 8) GAC12.5( 4) GGC18.8( 6) GUA15.6( 5) GCA9.4( 3) GAA25.0( 8) GGA12.5( 4) GUG15.6( 5) GCG12.5( 4) GAG15.6( 5) GGG21.9( 7)
Note: This was generated using the standard codon usage table; UGA codons in MP/MP genes will show up as terminators ("*")
atggaacaacctttgtgtatcttgggaattgaaaccacttgtgatgacactagcataggg M E Q P L C I L G I E T T C D D T S I G gtaatcactgagagtaaggtacaagcacacattgttttatcatccgccaaattacacgcc V I T E S K V Q A H I V L S S A K L H A caaactggtggcgtagtgcccgaagttgcagctaggagtcacgaacaaaacttgcttaaa Q T G G V V P E V A A R S H E Q N L L K gcactgcaacaatcaggggttgttttagaacaaattactcacattgcttatgccgctaat A L Q Q S G V V L E Q I T H I A Y A A N ccgggattacccggttgtctgcatgtgggagctacctttgcgcgcagtttaagcttctta P G L P G C L H V G A T F A R S L S F L ttggacaaacccttattgcccatcaaccacctttatgcccacatcttctcggccttaatt L D K P L L P I N H L Y A H I F S A L I gaccaagatattaatcagttaaaactccccgctttaggcttagtggtgtccggggggcac D Q D I N Q L K L P A L G L V V S G G H actgccatttacctaattaagtccttgtttgacttggagttaattgctgagactagcgat T A I Y L I K S L F D L E L I A E T S D gatgcgattggcgaagtttatgataaggtgggccgcgctatgggttttccttatccagct D A I G E V Y D K V G R A M G F P Y P A ggaccccagttagatagtttgtttcaaccggagttagttaagtcacattatttctttcgt G P Q L D S L F Q P E L V K S H Y F F R ccttccacgaagtgaacgaaattttcctactcggggttgaagtcacagtgttttaccaag P S T K * T K F S Y S G L K S Q C F T K attaaacagttaagggaaaggaaaggctttaatccccaaacccacgattggaatgagttt I K Q L R E R K G F N P Q T H D W N E F gcttctaacttccaggccaccattattgatcactacatcaatcatgtcaaggatgccatt A S N F Q A T I I D H Y I N H V K D A I cagcaacaccaaccgcaaatgttgctgttaggtggtggggtaagtgctaataaatactta Q Q H Q P Q M L L L G G G V S A N K Y L agagaacaggtaacacagttacagctgccttacttaattgcgccgctcaagtacacgagc R E Q V T Q L Q L P Y L I A P L K Y T S gataacggggcgatgattggtttttatgctaatcttttaattaatggcaaaaacaattaa D N G A M I G F Y A N L L I N G K N N *
MEQPLCILGIETTCDDTSIGVITESKVQAHIVLSSAKLHAQTGGVVPEVAARSHEQNLLKALQQSGVVLEQITHIAYAAN PGLPGCLHVGATFARSLSFLLDKPLLPINHLYAHIFSALIDQDINQLKLPALGLVVSGGHTAIYLIKSLFDLELIAETSD DAIGEVYDKVGRAMGFPYPAGPQLDSLFQPELVKSHYFFRPSTKWTKFSYSGLKSQCFTKIKQLRERKGFNPQTHDWNEF ASNFQATIIDHYINHVKDAIQQHQPQMLLLGGGVSANKYLREQVTQLQLPYLIAPLKYTSDNGAMIGFYANLLINGKNN
This information was obtained using the Protein Parameters tool on the ExPASy Molecular Biology Server.
Number of amino acids: 319
Molecular weight: 35210.4
Theoretical pI: 6.66
Amino acid composition:
Ala (A) 26 8.2%
Arg (R) 7 2.2%
Asn (N) 14 4.4%
Asp (D) 14 4.4%
Cys (C) 4 1.3%
Gln (Q) 22 6.9%
Glu (E) 13 4.1%
Gly (G) 23 7.2%
His (H) 13 4.1%
Ile (I) 23 7.2%
Leu (L) 40 12.5%
Lys (K) 18 5.6%
Met (M) 4 1.3%
Phe (F) 14 4.4%
Pro (P) 16 5.0%
Ser (S) 21 6.6%
Thr (T) 16 5.0%
Trp (W) 2 0.6%
Tyr (Y) 12 3.8%
Val (V) 17 5.3%
Asx (B) 0 0.0%
Glx (Z) 0 0.0%
Xaa (X) 0 0.0%
Total number of negatively charged residues (Asp + Glu): 27
Total number of positively charged residues (Arg + Lys): 25
Atomic composition:
Carbon C 1597
Hydrogen H 2498
Nitrogen N 422
Oxygen O 459
Sulfur S 8
Formula: C1597H2498N422O459S8
Total number of atoms: 4984
Extinction coefficients:
Conditions: 6.0 M guanidium hydrochloride
0.02 M phosphate buffer
pH 6.5
Extinction coefficients are in units of M-1 cm-1 .
The first table lists values computed assuming ALL Cys
residues appear as half cystines, whereas the second table
assumes that NONE do.
276 278 279 280 282
nm nm nm nm nm
Ext. coefficient 28490 28254 27700 26980 25840
Abs 0.1% (=1 g/l) 0.809 0.802 0.787 0.766 0.734
276 278 279 280 282
nm nm nm nm nm
Ext. coefficient 28200 28000 27460 26740 25600
Abs 0.1% (=1 g/l) 0.801 0.795 0.780 0.759 0.727
Estimated half-life:
The N-terminal of the sequence considered is M (Met).
The estimated half-life is: 30 hours (mammalian reticulocytes, in vitro).
>20 hours (yeast, in vivo).
>10 hours (Escherichia coli, in vivo).
Instability index:
The instability index (II) is computed to be 43.07
This classifies the protein as unstable.
Aliphatic index: 100.63
Grand average of hydropathicity (GRAVY): -0.042| Model | Percentage of MP Gene |
|---|---|
| Coiled Coil (CCP) | 0 |
| Disordered (SEG) | 0 |
| Transmembrane (PHDhtm) | 0 |
| Transmembrane (TMHMM) | 0 |
| Homologous to known structure (PSIBLAST) | 0 |
Sequence: Amino acid sequence.
CCP (C or -): coiled coil prediction from the ccp program (NCBI toolkit)
SEG (D or -): low complexity regions (possibly disordered) from SEG
PHDhtm (H or -): Transmembrane prediction from PHDhtm
TMHMM (H or -): Transmembrane prediction from TMHMM
PSIBLAST (3 or -): Regions potentially homologous to a protein of known 3D structure, according to PSIBLAST
Pred2ary (H, E, or -): Secondary structure prediction from Pred2ary
10 20 30 40 50 60 70 80
| | | | | | | |
Sequence MEQPLCILGIETTCDDTSIGVITESKVQAHIVLSSAKLHAQTGGVVPEVAARSHEQNLLKALQQSGVVLEQITHIAYAAN
CCP --------------------------------------------------------------------------------
SEG --------------------------------------------------------------------------------
PHDhtm --------------------------------------------------------------------------------
TMHMM --------------------------------------------------------------------------------
PSIBLAST --------------------------------------------------------------------------------
Pred2ary ----EEEEEEE------EEEEE----HHHHHHHHHH-----------HHHHHHHHHHHHHHHHH--------EEEEEE--
90 100 110 120 130 140 150 160
| | | | | | | |
Sequence PGLPGCLHVGATFARSLSFLLDKPLLPINHLYAHIFSALIDQDINQLKLPALGLVVSGGHTAIYLIKSLFDLELIAETSD
CCP --------------------------------------------------------------------------------
SEG --------------------------------------------------------------------------------
PHDhtm --------------------------------------------------------------------------------
TMHMM --------------------------------------------------------------------------------
PSIBLAST --------------------------------------------------------------------------------
Pred2ary -----HHHHHHHHHHHHHHH----------HHHHHHHHH-----------EEEEEE-----EEEEE-----EEE------
170 180 190 200 210 220 230 240
| | | | | | | |
Sequence DAIGEVYDKVGRAMGFPYPAGPQLDSLFQPELVKSHYFFRPSTKWTKFSYSGLKSQCFTKIKQLRERKGFNPQTHDWNEF
CCP --------------------------------------------------------------------------------
SEG --------------------------------------------------------------------------------
PHDhtm --------------------------------------------------------------------------------
TMHMM --------------------------------------------------------------------------------
PSIBLAST --------------------------------------------------------------------------------
Pred2ary HHHHHHHHHHHHHH------HHHHHHHHH----------------------HHHHHHHHHHHHHH-----------HHHH
250 260 270 280 290 300 310
| | | | | | |
Sequence ASNFQATIIDHYINHVKDAIQQHQPQMLLLGGGVSANKYLREQVTQLQLPYLIAPLKYTSDNGAMIGFYANLLINGKNN
CCP -------------------------------------------------------------------------------
SEG -------------------------------------------------------------------------------
PHDhtm -------------------------------------------------------------------------------
TMHMM -------------------------------------------------------------------------------
PSIBLAST -------------------------------------------------------------------------------
Pred2ary HHHHHHHHHHHHHHHHHHHHHH----EEEEE----HHHHHHHHHHH---------------HHHHHHHHHHHHHHH---