Berkeley Structural Genomics Center

Target info for 1114B

This target is gi number 2650718 from Archaeoglobus fulgidus.

This gene has the annotation "conserved hypothetical protein".

The target was selected as part of JMC's first set.

Available sections:


Experimental status of 1114B

Note: Dates prior to 10/24/01 may be inaccurate, due to conversion from a record-keeping system which tracked only the last experimental date, to a new system which keeps track of all dates. The user is the person who entered the data, not the person who did the experiment. This will also change in the future.

ExperimentResultUserExperiment Date
SelectedDonejmc2001-08-28
ClonedDoner_kim2002-03-02
Expression testedDoner_kim2002-03-02
Solubility testedSr_kim2002-03-02
PurifiedDoner_kim2003-12-01
CrystallizedDonedhshin2003-12-02
Diffraction quality crystalsDonedhshin2004-01-27
Native diffraction dataDonedhshin2004-01-27
Phasing diffraction dataDonedhshin2004-01-27
Traceable mapDonedhshin2004-02-06
Crystal structureDonedhshin2005-04-13
In PDB1SUW 1Z0S 1Z0U 1Z0Zdhshin2005-04-13

Back to Top


Potential MP Homologues of 1114B

MP genelog10(PSIBLAST E)% identical% coverage (of MP gene)
MP566 / MPN267-46.3021.0599.20

Back to Top


1114B DNA sequence

ATGAGGGCCGCAGTGGTTTACAAAACCGATGGTCACGTGAAGAGGATTGAGGAAGCTTTGAAAAGGCTGGAGGTTGAGGT
TGAGCTCTTCAACCAGCCTTCAGAAGAGCTTGAGAATTTTGACTTCATTGTGAGTGTTGGGGGAGACGGGACAATTCTGA
GAATACTGCAGAAGCTGAAGCGCTGCCCGCCCATTTTCGGGATAAACACCGGAAGAGTCGGTCTGCTTACCCACGCCAGC
CCTGAAAACTTTGAGGTTGAGCTTAAAAAGGCGGTGGAGAAGTTTGAGGTTGAGAGGTTTCCGAGAGTAAGCTGCTCCGC
AATGCCCGATGTTCTTGCCCTCAACGAGATTGCCGTTTTGAGCAGAAAACCGGCGAAGATGATAGACGTCGCTTTGAGGG
TCGATGGCGTGGAGGTGGACAGAATAAGGTGCGACGGCTTTATTGTTGCAACTCAGATTGGCTCCACAGGCTACGCCTTC
TCTGCTGGAGGGCCGGTAGTTGAGCCCTACCTCGAATGCTTCATCCTCATCCCCATTGCACCCTTCCGCTTCGGCTGGAA
GCCCTACGTGGTCAGCATGGAGAGGAAAATTGAGGTTATTGCTGAGAAAGCCATTGTGGTGGCTGATGGGCAGAAGAGCG
TGGATTTTGATGGTGAGATAACCATAGAAAAGTCGGAATTTCCGGCTGTGTTCTTCAAAAACGAGAAAAGGTTCAGGAAC
CTTTTCGGGAAAGTTAGGAGCATAGGTTAA

Back to Top


1114B Codon Usage

(250 codons)
fields: [triplet] [frequency: per thousand] ([number])

UUU28.0(   7)  UCU4.0(   1)  UAU0.0(   0)  UGU0.0(   0)  
UUC44.0(  11)  UCC8.0(   2)  UAC16.0(   4)  UGC16.0(   4)  
UUA0.0(   0)  UCA4.0(   1)  UAA4.0(   1)  UGA0.0(   0)  
UUG12.0(   3)  UCG4.0(   1)  UAG0.0(   0)  UGG4.0(   1)  

CUU20.0(   5)  CCU8.0(   2)  CAU0.0(   0)  CGU0.0(   0)  
CUC16.0(   4)  CCC24.0(   6)  CAC8.0(   2)  CGC8.0(   2)  
CUA0.0(   0)  CCA0.0(   0)  CAA0.0(   0)  CGA0.0(   0)  
CUG20.0(   5)  CCG20.0(   5)  CAG16.0(   4)  CGG0.0(   0)  

AUU44.0(  11)  ACU4.0(   1)  AAU4.0(   1)  AGU4.0(   1)  
AUC8.0(   2)  ACC16.0(   4)  AAC24.0(   6)  AGC24.0(   6)  
AUA28.0(   7)  ACA8.0(   2)  AAA36.0(   9)  AGA20.0(   5)  
AUG16.0(   4)  ACG0.0(   0)  AAG36.0(   9)  AGG40.0(  10)  

GUU48.0(  12)  GCU24.0(   6)  GAU24.0(   6)  GGU16.0(   4)  
GUC16.0(   4)  GCC24.0(   6)  GAC20.0(   5)  GGC20.0(   5)  
GUA8.0(   2)  GCA16.0(   4)  GAA24.0(   6)  GGA12.0(   3)  
GUG44.0(  11)  GCG8.0(   2)  GAG76.0(  19)  GGG24.0(   6)  

Back to Top


1114B Potential Cloning Problems

No UGA codons or (bad) restriction sites!

Back to Top


1114B Automatically Selected Primers

Note: These were generated using Hisao's algorithm, and do not take into account possible mispriming, self-complementarity, or requirements for mutations anywhere in the sequences. The -R1 primers are for the N-terminal end, and begin with an NdeI restriction site (CATATG). The -R2 primers are for the C-terminal end, and begin with a BamHI site (GGATCC) if the target has no other BamHI sites. If the target does have a BamHI site, the -R2 primer wil begin with a BglII site (AGATCT), unless the target has no BglII sites. If the target has both sites, the design program gives up and doesn't prepend a restriction site to the -R2 primer!

Predicted Tm was obtained using the Tm determination tool at the Virtual Genome Center.

Primer IDPredicted Tmsequence
1114B-R161.70CATATGAGGGCCGCAGTGG
1114B-R262.80GGATCCTTAACCTATGCTCCTAACTTTCCCG
1114B-R1063.90GGCGGTGGTGGCGGCATGAGGGCCGCAGTGGTT
1114B-R1163.20GTTCTTCTCCTTTGCGCCCCTAACCTATGCTCCTAACTTTCCCGA

Back to Top


1114B Translation

Note: This was generated using the standard codon usage table; UGA codons in MP/MP genes will show up as terminators ("*")

atgagggccgcagtggtttacaaaaccgatggtcacgtgaagaggattgaggaagctttg
 M  R  A  A  V  V  Y  K  T  D  G  H  V  K  R  I  E  E  A  L 
aaaaggctggaggttgaggttgagctcttcaaccagccttcagaagagcttgagaatttt
 K  R  L  E  V  E  V  E  L  F  N  Q  P  S  E  E  L  E  N  F 
gacttcattgtgagtgttgggggagacgggacaattctgagaatactgcagaagctgaag
 D  F  I  V  S  V  G  G  D  G  T  I  L  R  I  L  Q  K  L  K 
cgctgcccgcccattttcgggataaacaccggaagagtcggtctgcttacccacgccagc
 R  C  P  P  I  F  G  I  N  T  G  R  V  G  L  L  T  H  A  S 
cctgaaaactttgaggttgagcttaaaaaggcggtggagaagtttgaggttgagaggttt
 P  E  N  F  E  V  E  L  K  K  A  V  E  K  F  E  V  E  R  F 
ccgagagtaagctgctccgcaatgcccgatgttcttgccctcaacgagattgccgttttg
 P  R  V  S  C  S  A  M  P  D  V  L  A  L  N  E  I  A  V  L 
agcagaaaaccggcgaagatgatagacgtcgctttgagggtcgatggcgtggaggtggac
 S  R  K  P  A  K  M  I  D  V  A  L  R  V  D  G  V  E  V  D 
agaataaggtgcgacggctttattgttgcaactcagattggctccacaggctacgccttc
 R  I  R  C  D  G  F  I  V  A  T  Q  I  G  S  T  G  Y  A  F 
tctgctggagggccggtagttgagccctacctcgaatgcttcatcctcatccccattgca
 S  A  G  G  P  V  V  E  P  Y  L  E  C  F  I  L  I  P  I  A 
cccttccgcttcggctggaagccctacgtggtcagcatggagaggaaaattgaggttatt
 P  F  R  F  G  W  K  P  Y  V  V  S  M  E  R  K  I  E  V  I 
gctgagaaagccattgtggtggctgatgggcagaagagcgtggattttgatggtgagata
 A  E  K  A  I  V  V  A  D  G  Q  K  S  V  D  F  D  G  E  I 
accatagaaaagtcggaatttccggctgtgttcttcaaaaacgagaaaaggttcaggaac
 T  I  E  K  S  E  F  P  A  V  F  F  K  N  E  K  R  F  R  N 
cttttcgggaaagttaggagcataggttaa
 L  F  G  K  V  R  S  I  G  * 

Back to Top


1114B AA Sequence

MRAAVVYKTDGHVKRIEEALKRLEVEVELFNQPSEELENFDFIVSVGGDGTILRILQKLKRCPPIFGINTGRVGLLTHAS
PENFEVELKKAVEKFEVERFPRVSCSAMPDVLALNEIAVLSRKPAKMIDVALRVDGVEVDRIRCDGFIVATQIGSTGYAF
SAGGPVVEPYLECFILIPIAPFRFGWKPYVVSMERKIEVIAEKAIVVADGQKSVDFDGEITIEKSEFPAVFFKNEKRFRN
LFGKVRSIG

Back to Top


1114B Protein Parameters

This information was obtained using the Protein Parameters tool on the ExPASy Molecular Biology Server.

Number of amino acids: 249

Molecular weight: 27868.4

Theoretical pI: 6.25

Amino acid composition:

Ala (A)  18	  7.2%
Arg (R)  17	  6.8%
Asn (N)   7	  2.8%
Asp (D)  11	  4.4%
Cys (C)   4	  1.6%
Gln (Q)   4	  1.6%
Glu (E)  25	 10.0%
Gly (G)  18	  7.2%
His (H)   2	  0.8%
Ile (I)  20	  8.0%
Leu (L)  17	  6.8%
Lys (K)  18	  7.2%
Met (M)   4	  1.6%
Phe (F)  18	  7.2%
Pro (P)  13	  5.2%
Ser (S)  12	  4.8%
Thr (T)   7	  2.8%
Trp (W)   1	  0.4%
Tyr (Y)   4	  1.6%
Val (V)  29	 11.6%

Asx (B)   0	  0.0%
Glx (Z)   0	  0.0%
Xaa (X)   0	  0.0%

Total number of negatively charged residues (Asp + Glu): 36
Total number of positively charged residues (Arg + Lys): 35

Atomic composition:

Carbon      C	      1266
Hydrogen    H	      2016
Nitrogen    N	       334
Oxygen      O	       356
Sulfur      S	         8

Formula: C1266H2016N334O356S8
Total number of atoms: 3980

Extinction coefficients:

Conditions: 6.0 M guanidium hydrochloride
            0.02 M phosphate buffer
            pH 6.5

Extinction coefficients are in units of  M-1 cm-1 .

The first table lists values computed assuming ALL Cys 
residues appear as half cystines, whereas the second table 
assumes that NONE do. 

                      276     278     279     280     282
                       nm      nm      nm      nm      nm
Ext. coefficient    11490   11454   11280   11050   10640
Abs 0.1% (=1 g/l)   0.412   0.411   0.405   0.397   0.382



                      276     278     279     280     282
                       nm      nm      nm      nm      nm
Ext. coefficient    11200   11200   11040   10810   10400
Abs 0.1% (=1 g/l)   0.402   0.402   0.396   0.388   0.373


Estimated half-life:

The N-terminal of the sequence considered is M (Met).

The estimated half-life is: 30 hours (mammalian reticulocytes, in vitro).
                            >20 hours (yeast, in vivo).
                            >10 hours (Escherichia coli, in vivo).


Instability index:

The instability index (II) is computed to be 42.54
This classifies the protein as unstable.



Aliphatic index: 98.96

Grand average of hydropathicity (GRAVY): 0.043

Back to Top