Rules for Protein Info Tables


Last Update: 2002-07-16 Kara

These tables are used for storing the data for the protein info and composition pages in AspGD.


TABLE Protein_Info
========
protein_info_no                 Required.
				Primary key.  Oracle sequence number.
				Cannot be updated.
				NUMBER(10)

molecular_weight		Optional.
				NUMBER(6)

pi	                        Optional.
				NUMBER(4,2)

cai				Optional.
				NUMBER(4)

length			        Optional.
				NUMBER(5)

n_term_seq			Optional.
				VARCHAR2(7)

c_term_seq			Optional.
				VARCHAR2(7)

codon_bias			Optional.
				NUMBER(4,3)

ala				Optional.
				NUMBER(3)

arg				Optional.
				NUMBER(3)

asn				Optional.
				NUMBER(3)

asp				Optional.
				NUMBER(3)

cys				Optional.
				NUMBER(3)

gln				Optional.
                                NUMBER(3)

glu				Optional.
                                NUMBER(3)

gly				Optional.
                                NUMBER(3)

his				Optional.
                                NUMBER(3)

ile				Optional.
                                NUMBER(3)

leu				Optional.
                                NUMBER(3)

lys				Optional.
                                NUMBER(3)

met				Optional.
                                NUMBER(3)

phe				Optional.
                                NUMBER(3)

pro				Optional.
                                NUMBER(3)

ser				Optional.
                                NUMBER(3)

thr				Optional.
                                NUMBER(3)

trp				Optional.
                                NUMBER(3)

tyr				Optional.
                                NUMBER(3)

val				Optional.
                                NUMBER(3)

feature_no                      Required.
				NUMBER(10)
				Foreign key to feature table.

Other notes:
============


TABLE Protein_Detail
========
protein_detail_no               Required.
				Primary key.  Oracle sequence number.
				Cannot be updated.
				NUMBER(10)

protein_info_no                 Required.
				Foreign key to protein_info table.
				NUMBER(10)

protein_detail_type		Required.
                                VARCHAR2(50)

protein_detail_value		Required.
                                VARCHAR2(240)

start_coord                     Optional.
				NUMBER(10)

stop_coord                      Optional.
				NUMBER(10)

date_created                    Required.
				On insert, set to SYSDATE.
				On update, keep original date.
 
created_by                      Required.
				On insert, set to USER


Other notes:
============

Cascade delete rows from the protein_detail table if a row in
protein_info is deleted.

Some types of information include coordinate data while others do
not.  Some sample data illustrating this:

protein_info_no    protein_detail_type    value    start    stop
=================================================================
1                  transmembrane domain   Y        10       20
1                  transmembrane domain   Y        35       45
1                  molecular weight       140      null     null

Note that this is just an example to show different types of data; we
do already have molecular weight in the main protein_info table and
will not repeat it in the protein_detail table.

Also note that the initial data type for value is VARCHAR2(240).  This
is probably longer than we need, but we can use this length for now to
see if we add data that requires a larger column size.