Berkeley Structural Genomics Center

Target info for 1031B

This target is gi number 4982034 from Thermotoga maritima.

This gene has the annotation "conserved hypothetical protein".

The target was selected as part of Igor's sets.

Available sections:


Experimental status of 1031B

Note: Dates prior to 10/24/01 may be inaccurate, due to conversion from a record-keeping system which tracked only the last experimental date, to a new system which keeps track of all dates. The user is the person who entered the data, not the person who did the experiment. This will also change in the future.

ExperimentResultUserExperiment Date
SelectedDoneshkim2001-08-24
ClonedDoneabwaight2001-08-24
Expression testedDoneabwaight2001-08-24
Solubility testedSabwaight2001-08-24
PurifiedDoneabwaight2001-08-24
CrystallizedDoneabwaight2001-08-24
Diffraction quality crystalsDoneabwaight2001-08-24
Phasing diffraction dataDoneabwaight2001-08-24
Traceable mapDoneabwaight2001-08-24
Crystal structureDoneabwaight2001-08-24
In PDB1MGPdhshin2002-11-01

Back to Top


Potential MP Homologues of 1031B

MP genelog10(PSIBLAST E)% identical% coverage (of MP gene)
MP178 / MPN664-37.4023.8978.47
MP369 / MPN472-54.7019.8699.65

Back to Top


1031B DNA sequence

ATGAAGGTAAAGATTCTCGTTGACAGCACGGCCGATGTTCCTTTTTCCTGGATGGAAAAATACGATATAGATAGCATCCC
ACTTTACGTGGTGTGGGAAGACGGAAGATCCGAGCCGGACGAAAGAGAACCGGAAGAGATAATGAACTTTTACAAGAGAA
TAAGAGAAGCAGGCAGTGTACCAAAGACTTCCCAGCCGAGTGTGGAGGATTTCAAGAAGAGATATCTGAAGTACAAAGAA
GAGGATTACGATGTTGTCCTGGTTCTCACTTTATCATCGAAACTCTCCGGAACTTACAATTCCGCTGTTCTCGCATCCAA
AGAGGTGGACATACCCGTTTACGTTGTGGATACTCTTCTCGCTTCGGGTGCGATACCCCTTCCGGCCCGTGTGGCTCGTG
AAATGCTCGAAAATGGTGCCACCATCGAGGAAGTACTGAAAAAACTCGATGAGAGAATGAAGAACAAGGACTTCAAAGCG
ATTTTTTATGTTTCGAATTTCGATTATCTTGTTAAGGGGGGAAGGGTGTCGAAGTTCCAGGGATTTGTGGGAAATCTTCT
CAAGATAAGAGTGTGTCTTCACATAGAGAACGGTGAGCTCATCCCTTACAGAAAGGTCAGAGGAGACAAGAAGGCGATAG
AAGCCCTCATAGAAAAACTGCGTGAGGATACTCCTGAAGGCTCGAAGCTGAGAGTGATAGGTGTTCACGCGGACAACGAG
GCTGGAGTCGTGGAGCTTTTGAACACACTCAGAAAAAGTTACGAGGTTGTCGACGAGATCATATCGCCGATGGGAAAGGT
GATCACGACTCATGTGGGACCCGGGACGGTTGGGTTTGGAATCGAGGTTTTAGAACGAAAAAGATGA

Back to Top


1031B Codon Usage

(289 codons)
fields: [triplet] [frequency: per thousand] ([number])

UUU17.3(   5)  UCU0.0(   0)  UAU10.4(   3)  UGU3.5(   1)  
UUC13.8(   4)  UCC20.8(   6)  UAC31.1(   9)  UGC0.0(   0)  
UUA6.9(   2)  UCA3.5(   1)  UAA0.0(   0)  UGA3.5(   1)  
UUG3.5(   1)  UCG20.8(   6)  UAG0.0(   0)  UGG6.9(   2)  

CUU24.2(   7)  CCU10.4(   3)  CAU3.5(   1)  CGU10.4(   3)  
CUC38.1(  11)  CCC10.4(   3)  CAC6.9(   2)  CGC0.0(   0)  
CUA0.0(   0)  CCA6.9(   2)  CAA0.0(   0)  CGA3.5(   1)  
CUG17.3(   5)  CCG17.3(   5)  CAG6.9(   2)  CGG0.0(   0)  

AUU6.9(   2)  ACU20.8(   6)  AAU13.8(   4)  AGU10.4(   3)  
AUC20.8(   6)  ACC3.5(   1)  AAC17.3(   5)  AGC6.9(   2)  
AUA38.1(  11)  ACA3.5(   1)  AAA34.6(  10)  AGA41.5(  12)  
AUG20.8(   6)  ACG10.4(   3)  AAG58.8(  17)  AGG3.5(   1)  

GUU45.0(  13)  GCU13.8(   4)  GAU34.6(  10)  GGU13.8(   4)  
GUC13.8(   4)  GCC13.8(   4)  GAC27.7(   8)  GGC6.9(   2)  
GUA10.4(   3)  GCA6.9(   2)  GAA48.4(  14)  GGA34.6(  10)  
GUG45.0(  13)  GCG13.8(   4)  GAG51.9(  15)  GGG10.4(   3)  

Back to Top


1031B Potential Cloning Problems

No UGA codons.
1 restriction site(s) found.

Details (problem DNA region is in upper case, with the key on the line below):

Key:

atgaaggtaaagattctcgttgacagcacggccgatgttcctttttcctggatggaaaaa
                                                            
tacgatatagatagcatcccactttacgtggtgtgggaagacggaagatccgagccggac
                                                            
gaaagagaaccggaagagataatgaacttttacaagagaataagagaagcaggcagtgta
                                                            
ccaaagacttcccagccgagtgtggaggatttcaagaagagatatctgaagtacaaagaa
                                                            
gaggattacgatgttgtcctggttctcactttatcatcgaaactctccggaacttacaat
                                                            
tccgctgttctcgcatccaaagaggtggacatacccgtttacgttgtggatactcttctc
                                                            
gcttcgggtgcgataccccttccggcccgtgtggctcgtgaaatgctcgaaaatggtgcc
                                                            
accatcgaggaagtactgaaaaaactcgatgagagaatgaagaacaaggacttcaaagcg
                                                            
attttttatgtttcgaatttcgattatcttgttaaggggggaagggtgtcgaagttccag
                                                            
ggatttgtgggaaatcttctcaagataagagtgtgtcttcacatagagaacggtgagctc
                                                            
atcccttacagaaaggtcagaggagacaagaaggcgatagaagccctcatagaaaaactg
                                                            
cgtgaggatactcctgaaggctcgaagctgagagtgataggtgttcacgcggacaacgag
                                                            
gctggagtcgtggagcttttgaacacactcagaaaaagttacgaggttgtcgacgagatc
                                                            
atatcgccgatgggaaaggTGATCAcgactcatgtgggacccgggacggttgggtttgga
                   3                                        
atcgaggttttagaacgaaaaagatga
                           

Back to Top


1031B Automatically Selected Primers

Note: These were generated using Hisao's algorithm, and do not take into account possible mispriming, self-complementarity, or requirements for mutations anywhere in the sequences. The -R1 primers are for the N-terminal end, and begin with an NdeI restriction site (CATATG). The -R2 primers are for the C-terminal end, and begin with a BamHI site (GGATCC) if the target has no other BamHI sites. If the target does have a BamHI site, the -R2 primer wil begin with a BglII site (AGATCT), unless the target has no BglII sites. If the target has both sites, the design program gives up and doesn't prepend a restriction site to the -R2 primer!

Predicted Tm was obtained using the Tm determination tool at the Virtual Genome Center.

Primer IDPredicted Tmsequence
1031B-R160.30CATATGAAGGTAAAGATTCTCGTTGACA
1031B-R261.20GGATCCTTATCATCTTTGTTCTAAAACCTCGATTC
1031B-R360.90GGATCCTTATCTTTTTCGTTCTAAAACCTCGATT
1031B-R1064.00GGCGGTGGTGGCGGCATGAAGGTAAAGATTCTCGTTGACAGC
1031B-R1164.80GTTCTTCTCCTTTGCGCCCCTATCTTTTTCGTTCTAAAACCTCGATTCC

Back to Top


1031B Translation

Note: This was generated using the standard codon usage table; UGA codons in MP/MP genes will show up as terminators ("*")

atgaaggtaaagattctcgttgacagcacggccgatgttcctttttcctggatggaaaaa
 M  K  V  K  I  L  V  D  S  T  A  D  V  P  F  S  W  M  E  K 
tacgatatagatagcatcccactttacgtggtgtgggaagacggaagatccgagccggac
 Y  D  I  D  S  I  P  L  Y  V  V  W  E  D  G  R  S  E  P  D 
gaaagagaaccggaagagataatgaacttttacaagagaataagagaagcaggcagtgta
 E  R  E  P  E  E  I  M  N  F  Y  K  R  I  R  E  A  G  S  V 
ccaaagacttcccagccgagtgtggaggatttcaagaagagatatctgaagtacaaagaa
 P  K  T  S  Q  P  S  V  E  D  F  K  K  R  Y  L  K  Y  K  E 
gaggattacgatgttgtcctggttctcactttatcatcgaaactctccggaacttacaat
 E  D  Y  D  V  V  L  V  L  T  L  S  S  K  L  S  G  T  Y  N 
tccgctgttctcgcatccaaagaggtggacatacccgtttacgttgtggatactcttctc
 S  A  V  L  A  S  K  E  V  D  I  P  V  Y  V  V  D  T  L  L 
gcttcgggtgcgataccccttccggcccgtgtggctcgtgaaatgctcgaaaatggtgcc
 A  S  G  A  I  P  L  P  A  R  V  A  R  E  M  L  E  N  G  A 
accatcgaggaagtactgaaaaaactcgatgagagaatgaagaacaaggacttcaaagcg
 T  I  E  E  V  L  K  K  L  D  E  R  M  K  N  K  D  F  K  A 
attttttatgtttcgaatttcgattatcttgttaaggggggaagggtgtcgaagttccag
 I  F  Y  V  S  N  F  D  Y  L  V  K  G  G  R  V  S  K  F  Q 
ggatttgtgggaaatcttctcaagataagagtgtgtcttcacatagagaacggtgagctc
 G  F  V  G  N  L  L  K  I  R  V  C  L  H  I  E  N  G  E  L 
atcccttacagaaaggtcagaggagacaagaaggcgatagaagccctcatagaaaaactg
 I  P  Y  R  K  V  R  G  D  K  K  A  I  E  A  L  I  E  K  L 
cgtgaggatactcctgaaggctcgaagctgagagtgataggtgttcacgcggacaacgag
 R  E  D  T  P  E  G  S  K  L  R  V  I  G  V  H  A  D  N  E 
gctggagtcgtggagcttttgaacacactcagaaaaagttacgaggttgtcgacgagatc
 A  G  V  V  E  L  L  N  T  L  R  K  S  Y  E  V  V  D  E  I 
atatcgccgatgggaaaggtgatcacgactcatgtgggacccgggacggttgggtttgga
 I  S  P  M  G  K  V  I  T  T  H  V  G  P  G  T  V  G  F  G 
atcgaggttttagaacgaaaaagatga
 I  E  V  L  E  R  K  R  * 

Back to Top


1031B AA Sequence

MKVKILVDSTADVPFSWMEKYDIDSIPLYVVWEDGRSEPDEREPEEIMNFYKRIREAGSVPKTSQPSVEDFKKRYLKYKE
EDYDVVLVLTLSSKLSGTYNSAVLASKEVDIPVYVVDTLLASGAIPLPARVAREMLENGATIEEVLKKLDERMKNKDFKA
IFYVSNFDYLVKGGRVSKFQGFVGNLLKIRVCLHIENGELIPYRKVRGDKKAIEALIEKLREDTPEGSKLRVIGVHADNE
AGVVELLNTLRKSYEVVDEIISPMGKVITTHVGPGTVGFGIEVLERKR

Back to Top


1031B Protein Parameters

This information was obtained using the Protein Parameters tool on the ExPASy Molecular Biology Server.

Number of amino acids: 288

Molecular weight: 32574.6

Theoretical pI: 5.86

Amino acid composition:

Ala (A)  14	  4.9%
Arg (R)  17	  5.9%
Asn (N)   9	  3.1%
Asp (D)  18	  6.2%
Cys (C)   1	  0.3%
Gln (Q)   2	  0.7%
Glu (E)  29	 10.1%
Gly (G)  19	  6.6%
His (H)   3	  1.0%
Ile (I)  19	  6.6%
Leu (L)  26	  9.0%
Lys (K)  27	  9.4%
Met (M)   6	  2.1%
Phe (F)   9	  3.1%
Pro (P)  13	  4.5%
Ser (S)  18	  6.2%
Thr (T)  11	  3.8%
Trp (W)   2	  0.7%
Tyr (Y)  12	  4.2%
Val (V)  33	 11.5%

Asx (B)   0	  0.0%
Glx (Z)   0	  0.0%
Xaa (X)   0	  0.0%

Total number of negatively charged residues (Asp + Glu): 47
Total number of positively charged residues (Arg + Lys): 44

Atomic composition:

Carbon      C	      1467
Hydrogen    H	      2359
Nitrogen    N	       385
Oxygen      O	       435
Sulfur      S	         7

Formula: C1467H2359N385O435S7
Total number of atoms: 4653

Extinction coefficients:

Conditions: 6.0 M guanidium hydrochloride
            0.02 M phosphate buffer
            pH 6.5

Extinction coefficients are in units of  M-1 cm-1 .

The first table lists values computed assuming ALL Cys 
residues appear as half cystines, whereas the second table 
assumes that NONE do. 

                      276     278     279     280     282
                       nm      nm      nm      nm      nm
Ext. coefficient    28200   28000   27460   26740   25600
Abs 0.1% (=1 g/l)   0.866   0.860   0.843   0.821   0.786



                      276     278     279     280     282
                       nm      nm      nm      nm      nm
Ext. coefficient    28200   28000   27460   26740   25600
Abs 0.1% (=1 g/l)   0.866   0.860   0.843   0.821   0.786


Estimated half-life:

The N-terminal of the sequence considered is M (Met).

The estimated half-life is: 30 hours (mammalian reticulocytes, in vitro).
                            >20 hours (yeast, in vivo).
                            >10 hours (Escherichia coli, in vivo).


Instability index:

The instability index (II) is computed to be 36.17
This classifies the protein as stable.



Aliphatic index: 99.03

Grand average of hydropathicity (GRAVY): -0.261

Back to Top