Berkeley Structural Genomics Center

Target info for 1234B

This target is gi number 4982294 from Thermotoga maritima.

This gene has the annotation "conserved hypothetical protein".

The target was selected as part of Igor's other set.

Available sections:


Experimental status of 1234B

Note: Dates prior to 10/24/01 may be inaccurate, due to conversion from a record-keeping system which tracked only the last experimental date, to a new system which keeps track of all dates. The user is the person who entered the data, not the person who did the experiment. This will also change in the future.

ExperimentResultUserExperiment Date
SelectedDonejmc2001-12-05
ClonedDonedhshin2003-11-13
Expression testedDonedhshin2003-11-13
Solubility testedSdhshin2004-01-29
PurifiedDonedhshin2004-01-29
CrystallizedDonedhshin2004-01-29
Diffraction quality crystalsDonedhshin2004-03-03
Native diffraction dataDonedhshin2004-04-07
Phasing diffraction dataDonedhshin2004-05-25
Traceable mapDonedhshin2004-05-25
Crystal structureDonedhshin2004-06-09
In PDB1U0Ldhshin2004-07-13
Suggested work stop0 670 670 670 670 670 670 670 670 670 670 670 1219 1219 1219 1219 1219 1219 1219 1219 1219 3146 3146 3146 3146 3146 3146 3146 3146 3146 3146 3146 3146 3289 3289 3289 3289 3289 3289 3289 3289 3289 3289 3289 3328 3328 3328 3328 3328 3328 3328 3328 3328 3328 3328 3328 3328 3328 3328 3477 3477 3477 3477 3477 3477 3477 3477 3477 4359 4359 4359 4359 4359 4359 4359 4359 4359 4362 4362 4362 4362 4362 4362 4362 4362 4362 4362 4362 4362 4362 4362 4362 4367 4367 4367 4367 4367 4367 4367 4367 4367 4374 4374 4374 4374 4374 4374 4374 4374 4375 4375 4375 4375 4375 4375 4375 4375 4375 4402 4402 4402 4402 4402 4402 4402 4402 4402 4402 4402 4730 4730 4730 4730 4730 4730 4730 4730 4730 4730 4730 4825 4825 4825 4825 4825 4825 4825 4825 4825 4825 5046 5046 5046 5046 5046 5058 5058 5058 5545 5545 5545 5545 5545 5545 5545 5545 5545 5545 5545 5545 5545 5545 5545 6055 6055 6055 6055 6055 6055 6055 6055 6055 6056 6056 6056 6056 6056 6056 6056 6056 6056 6056 8052 8052 8052 8052 8052 8052 8052 8052 8052 8052 8052 12665 12665 12665 12665 13102 13102 13102 13102 13102 13102 13102 13102 13102 13102 13103 13103 13103 13103 13103 13103 13103 13103 13103 13103 13104 13104 13104 13104 13104 13104 13104 13104 13894 13894 13894 13894 13894 13894 13894 13894 13894 13894 13894 13894 13894 15042 15042 15684 15684 15684 15684 15684 15689 15689 15689 15689 15689 15689 16177 16177 16177 16177 16177 16177 16177 16177 16177 16177 16453 16453 16453 16453 16453 16453 16453 16453 16453 17109 17109 17109 17109 17109 17109 17109 17109 17109 17109 17109 17109 17309 17469 17469 18382 18382 18382 18382 18382 18382 18382 18382 18382 20092 20092 20092 20092 20092 20092 20092 20092 20093 20093 20093 20093 20093 20093 20622 20622 20622 20622 20622 20622 20622 20622 20623 20623 20623 20623 20623 20623 22550 22886 22886 22886 22886 22886 22886 22886 22886 22886 22886 23567 23567 23567 23567 24747 24747 24747 24747 24749 24749 24749 24749 24750 24750 24750 24750 27803 27803 27803 27804 27804 27804 27805 27805 27805 27989 27989 27989 27989 28250 28250 28250 29603 29603 29603 29603 29628 29628 29628 29628 29638 29638 29638 29638 29638 29638 29638 29638 29821 29821 29821 29821 29821 29821 29821 32444 32444 32444 34232 34232 34232 34233 34233 34233 34389 34389 34389 34389 34390 34390 34390 34390 38398 38398 38398 38398 38525 38525 38525 38525 38525 38525 38525 40110 40110 40110 40110 40111 40111 40111 40111 40577 40577 40577 40577 40577 40577 40577 40578 40578 40578 40578 40578 40578 40578 41792 41792 41792 41792 41983 41983 41983 41983 42467 42467 42467 44132 44132 44132 44132 44132 44133 44133 44133 44133 44133 49095 49095 49095 49095 49095 49095 49095 49401 49401 49401 51150 51150 51150 51150 52810 52810 52810 52810 52811 52811 52811 52811 52812 52812 52812 52812 52813 52813 52813 52813 52814 52814 52814 52814 52815 52815 52815 52815 54758 54758 54758 54758 54758 54758 54758 58607 58607 59985dallen2004-04-26
Override suggest work stopStructure solveddhshin2004-05-25

Back to Top


Potential MP Homologues of 1234B

MP genelog10(PSIBLAST E)% identical% coverage (of MP gene)
MP146 / MPN008-9.7026.1928.47
MP177 / MPN665-11.7016.9871.86
MP366-12.5222.7329.83
MP556 / MPN279-55.1017.0295.59
MP583 / MPN249-29.1028.5797.29
MP604 / MPN227-14.0515.5450.17
MP676 / MPN155-61.1515.9095.93

Back to Top


1234B DNA sequence

ATGAACTTGAGAAGAAGAGGAATAGTGGTGAGTTTTCACTCGAACATGGTCACGGTCGAAGACGAGGAAACTGGTGAAAG
AATCCTCTGCAAACTGAGGGGGAAATTTCGCCTGCAGAATCTCAAGATATACGTTGGAGACAGGGTGGAGTACACACCGG
ATGAAACAGGATCCGGAGTCATAGAGAACGTGCTACACAGGAAAAATCTACTCACCAAACCTCACGTCGCCAATGTGGAT
CAAGTGATTCTCGTGGTAACGGTGAAAATGCCGGAAACTTCGACCTACATAATCGATAAATTTCTTGTTCTCGCTGAAAA
GAACGAACTGGAAACGGTGATGGTCATCAACAAGATGGACCTCTACGATGAAGACGATCTGAGGAAGGTAAGAGAGCTGG
AAGAGATCTATTCTGGACTCTATCCAATCGTGAAAACCAGTGCGAAAACCGGAATGGGAATCGAAGAGCTCAAGGAATAT
CTGAAGGGAAAGATCAGCACGATGGCCGGACTCTCCGGAGTTGGAAAAAGTAGCTTGCTCAACGCCATAAATCCCGGTTT
GAAATTACGGGTGAGCGAAGTTTCAGAGAAACTCCAAAGGGGAAGGCATACTACAACAACTGCCCAGCTCTTGAAATTCG
ATTTCGGAGGTTACGTAGTCGATACTCCGGGTTTCGCAAATCTGGAGATCAACGACATAGAACCCGAGGAACTGAAACAC
TACTTCAAGGAATTTGGAGACAAACAGTGTTTCTTCTCGGACTGCAACCACGTGGATGAACCGGAGTGTGGAGTGAAAGA
AGCCGTGGAGAACGGAGAAATCGCAGAAAGTCGATACGAAAACTACGTGAAGATGTTTTACGAGCTTCTGGGAAGGAGGA
AAAAGTAA

Back to Top


1234B Codon Usage

(296 codons)
fields: [triplet] [frequency: per thousand] ([number])

UUU16.9(   5)  UCU3.4(   1)  UAU10.1(   3)  UGU6.8(   2)  
UUC20.3(   6)  UCC6.8(   2)  UAC30.4(   9)  UGC6.8(   2)  
UUA3.4(   1)  UCA3.4(   1)  UAA3.4(   1)  UGA0.0(   0)  
UUG13.5(   4)  UCG10.1(   3)  UAG0.0(   0)  UGG0.0(   0)  

CUU6.8(   2)  CCU3.4(   1)  CAU3.4(   1)  CGU0.0(   0)  
CUC40.5(  12)  CCC6.8(   2)  CAC16.9(   5)  CGC3.4(   1)  
CUA6.8(   2)  CCA3.4(   1)  CAA6.8(   2)  CGA3.4(   1)  
CUG30.4(   9)  CCG13.5(   4)  CAG10.1(   3)  CGG3.4(   1)  

AUU3.4(   1)  ACU16.9(   5)  AAU16.9(   5)  AGU13.5(   4)  
AUC30.4(   9)  ACC13.5(   4)  AAC33.8(  10)  AGC10.1(   3)  
AUA20.3(   6)  ACA13.5(   4)  AAA54.1(  16)  AGA16.9(   5)  
AUG27.0(   8)  ACG13.5(   4)  AAG33.8(  10)  AGG27.0(   8)  

GUU13.5(   4)  GCU3.4(   1)  GAU27.0(   8)  GGU13.5(   4)  
GUC20.3(   6)  GCC16.9(   5)  GAC23.6(   7)  GGC0.0(   0)  
GUA10.1(   3)  GCA6.8(   2)  GAA70.9(  21)  GGA57.4(  17)  
GUG50.7(  15)  GCG3.4(   1)  GAG40.5(  12)  GGG3.4(   1)  

Back to Top


1234B Potential Cloning Problems

No UGA codons.
2 restriction site(s) found.

Details (problem DNA region is in upper case, with the key on the line below):

Key:

atgaacttgagaagaagaggaatagtggtgagttttcactcgaacatggtcacggtcgaa
                                                            
gacgaggaaactggtgaaagaatcctctgcaaactgagggggaaatttcgcctgcagaat
                                                            
ctcaagatatacgttggagacagggtggagtacacaccggatgaaacaGGATCCggagtc
                                                2           
atagagaacgtgctacacaggaaaaatctactcaccaaacctcacgtcgccaatgtggat
                                                            
caagtgattctcgtggtaacggtgaaaatgccggaaacttcgacctacataatcgataaa
                                                            
tttcttgttctcgctgaaaagaacgaactggaaacggtgatggtcatcaacaagatggac
                                                            
ctctacgatgaagacgatctgaggaaggtaagagagctggaagAGATCTattctggactc
                                           4                
tatccaatcgtgaaaaccagtgcgaaaaccggaatgggaatcgaagagctcaaggaatat
                                                            
ctgaagggaaagatcagcacgatggccggactctccggagttggaaaaagtagcttgctc
                                                            
aacgccataaatcccggtttgaaattacgggtgagcgaagtttcagagaaactccaaagg
                                                            
ggaaggcatactacaacaactgcccagctcttgaaattcgatttcggaggttacgtagtc
                                                            
gatactccgggtttcgcaaatctggagatcaacgacatagaacccgaggaactgaaacac
                                                            
tacttcaaggaatttggagacaaacagtgtttcttctcggactgcaaccacgtggatgaa
                                                            
ccggagtgtggagtgaaagaagccgtggagaacggagaaatcgcagaaagtcgatacgaa
                                                            
aactacgtgaagatgttttacgagcttctgggaaggaggaaaaagtaa
                                                

Back to Top


1234B Automatically Selected Primers

Note: These were generated using Hisao's algorithm, and do not take into account possible mispriming, self-complementarity, or requirements for mutations anywhere in the sequences. The -R1 primers are for the N-terminal end, and begin with an NdeI restriction site (CATATG). The -R2 primers are for the C-terminal end, and begin with a BamHI site (GGATCC) if the target has no other BamHI sites. If the target does have a BamHI site, the -R2 primer wil begin with a BglII site (AGATCT), unless the target has no BglII sites. If the target has both sites, the design program gives up and doesn't prepend a restriction site to the -R2 primer!

Predicted Tm was obtained using the Tm determination tool at the Virtual Genome Center.

Primer IDPredicted Tmsequence
1234B-R162.50CATATGAACTTGAGAAGAAGAGGAATAGTGG
1234B-R260.30TTACTTTTTCCTCCTTCCCAGAA
1234B-R1063.10GGCGGTGGTGGCGGCATGAACTTGAGAAGAAGAGGAATAGTGGT
1234B-R1163.10GTTCTTCTCCTTTGCGCCCCTACTTTTTCCTCCTTCCCAGAAGC

Back to Top


1234B Translation

Note: This was generated using the standard codon usage table; UGA codons in MP/MP genes will show up as terminators ("*")

atgaacttgagaagaagaggaatagtggtgagttttcactcgaacatggtcacggtcgaa
 M  N  L  R  R  R  G  I  V  V  S  F  H  S  N  M  V  T  V  E 
gacgaggaaactggtgaaagaatcctctgcaaactgagggggaaatttcgcctgcagaat
 D  E  E  T  G  E  R  I  L  C  K  L  R  G  K  F  R  L  Q  N 
ctcaagatatacgttggagacagggtggagtacacaccggatgaaacaggatccggagtc
 L  K  I  Y  V  G  D  R  V  E  Y  T  P  D  E  T  G  S  G  V 
atagagaacgtgctacacaggaaaaatctactcaccaaacctcacgtcgccaatgtggat
 I  E  N  V  L  H  R  K  N  L  L  T  K  P  H  V  A  N  V  D 
caagtgattctcgtggtaacggtgaaaatgccggaaacttcgacctacataatcgataaa
 Q  V  I  L  V  V  T  V  K  M  P  E  T  S  T  Y  I  I  D  K 
tttcttgttctcgctgaaaagaacgaactggaaacggtgatggtcatcaacaagatggac
 F  L  V  L  A  E  K  N  E  L  E  T  V  M  V  I  N  K  M  D 
ctctacgatgaagacgatctgaggaaggtaagagagctggaagagatctattctggactc
 L  Y  D  E  D  D  L  R  K  V  R  E  L  E  E  I  Y  S  G  L 
tatccaatcgtgaaaaccagtgcgaaaaccggaatgggaatcgaagagctcaaggaatat
 Y  P  I  V  K  T  S  A  K  T  G  M  G  I  E  E  L  K  E  Y 
ctgaagggaaagatcagcacgatggccggactctccggagttggaaaaagtagcttgctc
 L  K  G  K  I  S  T  M  A  G  L  S  G  V  G  K  S  S  L  L 
aacgccataaatcccggtttgaaattacgggtgagcgaagtttcagagaaactccaaagg
 N  A  I  N  P  G  L  K  L  R  V  S  E  V  S  E  K  L  Q  R 
ggaaggcatactacaacaactgcccagctcttgaaattcgatttcggaggttacgtagtc
 G  R  H  T  T  T  T  A  Q  L  L  K  F  D  F  G  G  Y  V  V 
gatactccgggtttcgcaaatctggagatcaacgacatagaacccgaggaactgaaacac
 D  T  P  G  F  A  N  L  E  I  N  D  I  E  P  E  E  L  K  H 
tacttcaaggaatttggagacaaacagtgtttcttctcggactgcaaccacgtggatgaa
 Y  F  K  E  F  G  D  K  Q  C  F  F  S  D  C  N  H  V  D  E 
ccggagtgtggagtgaaagaagccgtggagaacggagaaatcgcagaaagtcgatacgaa
 P  E  C  G  V  K  E  A  V  E  N  G  E  I  A  E  S  R  Y  E 
aactacgtgaagatgttttacgagcttctgggaaggaggaaaaagtaa
 N  Y  V  K  M  F  Y  E  L  L  G  R  R  K  K  * 

Back to Top


1234B AA Sequence

MNLRRRGIVVSFHSNMVTVEDEETGERILCKLRGKFRLQNLKIYVGDRVEYTPDETGSGVIENVLHRKNLLTKPHVANVD
QVILVVTVKMPETSTYIIDKFLVLAEKNELETVMVINKMDLYDEDDLRKVRELEEIYSGLYPIVKTSAKTGMGIEELKEY
LKGKISTMAGLSGVGKSSLLNAINPGLKLRVSEVSEKLQRGRHTTTTAQLLKFDFGGYVVDTPGFANLEINDIEPEELKH
YFKEFGDKQCFFSDCNHVDEPECGVKEAVENGEIAESRYENYVKMFYELLGRRKK

Back to Top


1234B Protein Parameters

This information was obtained using the Protein Parameters tool on the ExPASy Molecular Biology Server.

Number of amino acids: 295

Molecular weight: 33641.6

Theoretical pI: 5.70

Amino acid composition:

Ala (A)   9	  3.1%
Arg (R)  16	  5.4%
Asn (N)  15	  5.1%
Asp (D)  15	  5.1%
Cys (C)   4	  1.4%
Gln (Q)   5	  1.7%
Glu (E)  33	 11.2%
Gly (G)  22	  7.5%
His (H)   6	  2.0%
Ile (I)  16	  5.4%
Leu (L)  30	 10.2%
Lys (K)  26	  8.8%
Met (M)   8	  2.7%
Phe (F)  11	  3.7%
Pro (P)   8	  2.7%
Ser (S)  14	  4.7%
Thr (T)  17	  5.8%
Trp (W)   0	  0.0%
Tyr (Y)  12	  4.1%
Val (V)  28	  9.5%

Asx (B)   0	  0.0%
Glx (Z)   0	  0.0%
Xaa (X)   0	  0.0%

Total number of negatively charged residues (Asp + Glu): 48
Total number of positively charged residues (Arg + Lys): 42

Atomic composition:

Carbon      C	      1494
Hydrogen    H	      2397
Nitrogen    N	       401
Oxygen      O	       455
Sulfur      S	        12

Formula: C1494H2397N401O455S12
Total number of atoms: 4759

Extinction coefficients:

Conditions: 6.0 M guanidium hydrochloride
            0.02 M phosphate buffer
            pH 6.5

Extinction coefficients are in units of  M-1 cm-1 .

The first table lists values computed assuming ALL Cys 
residues appear as half cystines, whereas the second table 
assumes that NONE do. 

                      276     278     279     280     282
                       nm      nm      nm      nm      nm
Ext. coefficient    17690   17054   16380   15600   14640
Abs 0.1% (=1 g/l)   0.526   0.507   0.487   0.464   0.435



                      276     278     279     280     282
                       nm      nm      nm      nm      nm
Ext. coefficient    17400   16800   16140   15360   14400
Abs 0.1% (=1 g/l)   0.517   0.499   0.480   0.457   0.428


Estimated half-life:

The N-terminal of the sequence considered is M (Met).

The estimated half-life is: 30 hours (mammalian reticulocytes, in vitro).
                            >20 hours (yeast, in vivo).
                            >10 hours (Escherichia coli, in vivo).


Instability index:

The instability index (II) is computed to be 37.64
This classifies the protein as stable.



Aliphatic index: 91.39

Grand average of hydropathicity (GRAVY): -0.390

Back to Top