Berkeley Structural Genomics Center

Target info for 1071B

This target is gi number 4981173 from Thermotoga maritima.

This gene has the annotation "conserved hypothetical protein".

The target was selected as part of Igor's sets.

Available sections:


Experimental status of 1071B

Note: Dates prior to 10/24/01 may be inaccurate, due to conversion from a record-keeping system which tracked only the last experimental date, to a new system which keeps track of all dates. The user is the person who entered the data, not the person who did the experiment. This will also change in the future.

ExperimentResultUserExperiment Date
SelectedDonejmc2001-08-24
ClonedDoneabwaight2001-08-24
Expression testedDoneabwaight2001-08-24
Solubility testedSabwaight2001-08-24
PurifiedDoneabwaight2001-08-24
CrystallizedDoneabwaight2001-08-24
Diffraction quality crystalsDoneabwaight2001-08-24
Native diffraction dataDoneabwaight2001-08-24
Phasing diffraction dataDoneabwaight2001-08-24
Traceable mapDoneabwaight2001-08-24
Crystal structureDoneabwaight2001-08-24
In PDB1NF2dhshin2002-12-12
Biochemical functionDonedhshin2002-12-11

Back to Top


Potential MP Homologues of 1071B

MP genelog10(PSIBLAST E)% identical% coverage (of MP gene)
MP290 / MPN552-6.2211.4378.36
MP414 / MPN427-37.0022.7896.64
MP454 / MPN383-44.5221.4697.39
MP456 / MPN381-40.1021.3797.76
MP569 / MPN264-35.0023.2992.91
MP622 / MPN209-4.4017.3294.78

Back to Top


1071B DNA sequence

TTGTACAGGGTGTTCGTTTTTGACCTGGACGGAACACTTCTCAACGACAACCTGGAGATATCAGAAAAGGACAGAAGAAA
CATAGAGAAACTTTCCAGAAAGTGCTATGTTGTCTTTGCAAGCGGAAGAATGCTCGTTTCCACACTGAACGTGGAAAAGA
AGTACTTCAAAAGAACCTTTCCAACGATCGCTTACAACGGTGCGATAGTGTACCTTCCAGAAGAAGGCGTGATCCTGAAC
GAGAAGATCCCGCCTGAGGTAGCAAAAGACATCATAGAATACATAAAACCGCTCAACGTTCACTGGCAGGCTTACATTGA
CGATGTGCTCTACTCCGAAAAGGACAACGAAGAGATAAAAAGCTACGCAAGACACTCGAACGTGGACTACCGCGTTGAAC
CGAACCTTTCTGAACTCGTCTCAAAGATGGGAACGACGAAACTTCTTCTCATCGATACCCCGGAAAGACTCGACGAGTTG
AAAGAGATTCTCTCTGAAAGGTTCAAAGATGTGGTGAAGGTCTTCAAGTCCTTCCCTACTTACCTTGAAATCGTTCCGAA
GAACGTGGACAAGGGAAAGGCCCTGAGGTTTTTGAGAGAGAGGATGAACTGGAAAAAGGAAGAGATCGTCGTCTTCGGTG
ACAACGAGAACGACCTGTTCATGTTCGAAGAGGCAGGCCTTCGTGTTGCAATGGAAAACGCCATAGAGAAGGTGAAGGAG
GCATCGGATATTGTCACGCTCACTAACAACGATTCTGGTGTGTCTTATGTTCTTGAGCGCATTTCCACAGATTGTCTTGA
TGAGTGA

Back to Top


1071B Codon Usage

(269 codons)
fields: [triplet] [frequency: per thousand] ([number])

UUU14.9(   4)  UCU14.9(   4)  UAU7.4(   2)  UGU3.7(   1)  
UUC29.7(   8)  UCC18.6(   5)  UAC37.2(  10)  UGC3.7(   1)  
UUA0.0(   0)  UCA7.4(   2)  UAA0.0(   0)  UGA3.7(   1)  
UUG11.2(   3)  UCG7.4(   2)  UAG0.0(   0)  UGG7.4(   2)  

CUU37.2(  10)  CCU7.4(   2)  CAU0.0(   0)  CGU3.7(   1)  
CUC33.5(   9)  CCC0.0(   0)  CAC7.4(   2)  CGC7.4(   2)  
CUA0.0(   0)  CCA7.4(   2)  CAA0.0(   0)  CGA0.0(   0)  
CUG22.3(   6)  CCG18.6(   5)  CAG3.7(   1)  CGG0.0(   0)  

AUU14.9(   4)  ACU7.4(   2)  AAU0.0(   0)  AGU0.0(   0)  
AUC26.0(   7)  ACC7.4(   2)  AAC63.2(  17)  AGC7.4(   2)  
AUA26.0(   7)  ACA11.2(   3)  AAA33.5(   9)  AGA29.7(   8)  
AUG18.6(   5)  ACG14.9(   4)  AAG55.8(  15)  AGG14.9(   4)  

GUU29.7(   8)  GCU7.4(   2)  GAU26.0(   7)  GGU11.2(   3)  
GUC22.3(   6)  GCC7.4(   2)  GAC44.6(  12)  GGC7.4(   2)  
GUA3.7(   1)  GCA22.3(   6)  GAA55.8(  15)  GGA14.9(   4)  
GUG40.9(  11)  GCG3.7(   1)  GAG55.8(  15)  GGG0.0(   0)  

Back to Top


1071B Potential Cloning Problems

No UGA codons or (bad) restriction sites!

Back to Top


1071B Automatically Selected Primers

Note: These were generated using Hisao's algorithm, and do not take into account possible mispriming, self-complementarity, or requirements for mutations anywhere in the sequences. The -R1 primers are for the N-terminal end, and begin with an NdeI restriction site (CATATG). The -R2 primers are for the C-terminal end, and begin with a BamHI site (GGATCC) if the target has no other BamHI sites. If the target does have a BamHI site, the -R2 primer wil begin with a BglII site (AGATCT), unless the target has no BglII sites. If the target has both sites, the design program gives up and doesn't prepend a restriction site to the -R2 primer!

Predicted Tm was obtained using the Tm determination tool at the Virtual Genome Center.

Primer IDPredicted Tmsequence
1071B-R160.30CATATGTACAGGGTGTTCGTTTTTGA
1071B-R260.60GGATCCTTATCACTCATGACAATCTGTGGAAA
1071B-R360.10GGATCCTTACTCATCAAGACAATCTGTGGAAAT
1071B-R1064.20GGCGGTGGTGGCGGCATGTACAGGGTGTTCGTTTTTGACC
1071B-R1165.70GTTCTTCTCCTTTGCGCCCCTACTCATCAAGACAATCTGTGGAAATGC

Back to Top


1071B Translation

Note: This was generated using the standard codon usage table; UGA codons in MP/MP genes will show up as terminators ("*")

ttgtacagggtgttcgtttttgacctggacggaacacttctcaacgacaacctggagata
 L  Y  R  V  F  V  F  D  L  D  G  T  L  L  N  D  N  L  E  I 
tcagaaaaggacagaagaaacatagagaaactttccagaaagtgctatgttgtctttgca
 S  E  K  D  R  R  N  I  E  K  L  S  R  K  C  Y  V  V  F  A 
agcggaagaatgctcgtttccacactgaacgtggaaaagaagtacttcaaaagaaccttt
 S  G  R  M  L  V  S  T  L  N  V  E  K  K  Y  F  K  R  T  F 
ccaacgatcgcttacaacggtgcgatagtgtaccttccagaagaaggcgtgatcctgaac
 P  T  I  A  Y  N  G  A  I  V  Y  L  P  E  E  G  V  I  L  N 
gagaagatcccgcctgaggtagcaaaagacatcatagaatacataaaaccgctcaacgtt
 E  K  I  P  P  E  V  A  K  D  I  I  E  Y  I  K  P  L  N  V 
cactggcaggcttacattgacgatgtgctctactccgaaaaggacaacgaagagataaaa
 H  W  Q  A  Y  I  D  D  V  L  Y  S  E  K  D  N  E  E  I  K 
agctacgcaagacactcgaacgtggactaccgcgttgaaccgaacctttctgaactcgtc
 S  Y  A  R  H  S  N  V  D  Y  R  V  E  P  N  L  S  E  L  V 
tcaaagatgggaacgacgaaacttcttctcatcgataccccggaaagactcgacgagttg
 S  K  M  G  T  T  K  L  L  L  I  D  T  P  E  R  L  D  E  L 
aaagagattctctctgaaaggttcaaagatgtggtgaaggtcttcaagtccttccctact
 K  E  I  L  S  E  R  F  K  D  V  V  K  V  F  K  S  F  P  T 
taccttgaaatcgttccgaagaacgtggacaagggaaaggccctgaggtttttgagagag
 Y  L  E  I  V  P  K  N  V  D  K  G  K  A  L  R  F  L  R  E 
aggatgaactggaaaaaggaagagatcgtcgtcttcggtgacaacgagaacgacctgttc
 R  M  N  W  K  K  E  E  I  V  V  F  G  D  N  E  N  D  L  F 
atgttcgaagaggcaggccttcgtgttgcaatggaaaacgccatagagaaggtgaaggag
 M  F  E  E  A  G  L  R  V  A  M  E  N  A  I  E  K  V  K  E 
gcatcggatattgtcacgctcactaacaacgattctggtgtgtcttatgttcttgagcgc
 A  S  D  I  V  T  L  T  N  N  D  S  G  V  S  Y  V  L  E  R 
atttccacagattgtcttgatgagtga
 I  S  T  D  C  L  D  E  * 

Back to Top


1071B AA Sequence

MYRVFVFDLDGTLLNDNLEISEKDRRNIEKLSRKCYVVFASGRMLVSTLNVEKKYFKRTFPTIAYNGAIVYLPEEGVILN
EKIPPEVAKDIIEYIKPLNVHWQAYIDDVLYSEKDNEEIKSYARHSNVDYRVEPNLSELVSKMGTTKLLLIDTPERLDEL
KEILSERFKDVVKVFKSFPTYLEIVPKNVDKGKALRFLRERMNWKKEEIVVFGDNENDLFMFEEAGLRVAMENAIEKVKE
ASDIVTLTNNDSGVSYVLERISTDCLDE

Back to Top


1071B Protein Parameters

This information was obtained using the Protein Parameters tool on the ExPASy Molecular Biology Server.

Number of amino acids: 268

Molecular weight: 31186.7

Theoretical pI: 5.04

Amino acid composition:

Ala (A)  11	  4.1%
Arg (R)  15	  5.6%
Asn (N)  17	  6.3%
Asp (D)  19	  7.1%
Cys (C)   2	  0.7%
Gln (Q)   1	  0.4%
Glu (E)  30	 11.2%
Gly (G)   9	  3.4%
His (H)   2	  0.7%
Ile (I)  18	  6.7%
Leu (L)  27	 10.1%
Lys (K)  24	  9.0%
Met (M)   6	  2.2%
Phe (F)  12	  4.5%
Pro (P)   9	  3.4%
Ser (S)  15	  5.6%
Thr (T)  11	  4.1%
Trp (W)   2	  0.7%
Tyr (Y)  12	  4.5%
Val (V)  26	  9.7%

Asx (B)   0	  0.0%
Glx (Z)   0	  0.0%
Xaa (X)   0	  0.0%

Total number of negatively charged residues (Asp + Glu): 49
Total number of positively charged residues (Arg + Lys): 39

Atomic composition:

Carbon      C	      1404
Hydrogen    H	      2225
Nitrogen    N	       361
Oxygen      O	       423
Sulfur      S	         8

Formula: C1404H2225N361O423S8
Total number of atoms: 4421

Extinction coefficients:

Conditions: 6.0 M guanidium hydrochloride
            0.02 M phosphate buffer
            pH 6.5

Extinction coefficients are in units of  M-1 cm-1 .

The first table lists values computed assuming ALL Cys 
residues appear as half cystines, whereas the second table 
assumes that NONE do. 

                      276     278     279     280     282
                       nm      nm      nm      nm      nm
Ext. coefficient    28345   28127   27580   26860   25720
Abs 0.1% (=1 g/l)   0.909   0.902   0.884   0.861   0.825



                      276     278     279     280     282
                       nm      nm      nm      nm      nm
Ext. coefficient    28200   28000   27460   26740   25600
Abs 0.1% (=1 g/l)   0.904   0.898   0.881   0.857   0.821


Estimated half-life:

The N-terminal of the sequence considered is M (Met).

The estimated half-life is: 30 hours (mammalian reticulocytes, in vitro).
                            >20 hours (yeast, in vivo).
                            >10 hours (Escherichia coli, in vivo).


Instability index:

The instability index (II) is computed to be 34.87
This classifies the protein as stable.



Aliphatic index: 97.72

Grand average of hydropathicity (GRAVY): -0.353

Back to Top