Difference between revisions of "Populating the OA and New Dumping Script"

Revision as of 17:59, 19 February 2015

1. Need to populate postgres (GO OA tables with data from Phenotype2GO file)

2. File to parse into postgres is currently in this directory on mangolassi:

/home/postgres/work/pgpopulation/go/go_curation/20141106_kevin_godata/newGpaEntries

3. Table mapping columns in the newGpaEntries file to GO OA tables is here:

http://wiki.wormbase.org/index.php/Mapping_the_GAF_to_GO_OA_tables

Will need to add three values to the Qualifier drop-down menu, ‘enables’, ‘involved_in’, ‘part_of’

4. Once file is read into OA tables, then we’ll need a new dumping script.

5. Specs for new OA dumping script

Model for WS248:

http://cvs.sanger.ac.uk/cgi-bin/viewvc.cgi/wormbase/wspec/models.wrm?revision=1.416&root=ensembl&view=markup

Annotation ID

Will need to determine what the starting ID should be Could look at the main .ace file (from gpad conversion - gp_annotation.ace) and start with +1 from the end Currently, that file is on tazendra here: /home/acedb/kimberly/citace_upload/go/gp_annotation.ace

GO_annotation : “00055495”

Gene

From gop_wbgene

Gene “WBGene00004362”

GO_code

From gop_goinference

GO_code “IMP”

Annotation_relation

From gop_qualifier Will need to add three values to the drop-down menu, ‘enables’, ‘involved_in’, ‘part_of’ Then populate all ‘P’ with ‘involved_in’; all ‘F’ with ‘enables’, all ‘C’ with ‘part_of’ Can be a multi-value field, i.e., can have something like ‘NOT’ ‘enables’

Annotation_relation “enables”

Annotation_relation “NOT” Annotation_relation “enables”

Annotation_made_with

gop_with_wbgene

Interacting_gene “WBGene00000936”

Multiple entries

Interacting_gene “WBGene00000936” Interacting_gene “WBGene00000937”

gop_with

Inferred_from_GO_term “GO:0035195”

	gop_with_variation

Variation “WBVar00278468”

gop_with_RNAi

RNAi_result “WBRNAi00035226”

gop_with_phenotype

Phenotype “WBPhenotype:0000059”

Annotation_extension

Same syntax, relation(entity) e.g., has_regulation_target(WB:WBGene00002015) Tag used depends on what type of entity is in parenthesis

go_xrefto

If entity = WBls:nnnnnn Life_stage_relation “relation” “WBlsnnnnn”

If entity = WB:WBGenennnnnnnn Gene_relation “relation” “WBGenennnnnnnn”

If entity = ChEBI:nnnnn Map ChEBI:nnnnn to WBMol Molecule_relation “relation” “WBMolnnnnnnn”

If entity = WB:WBbtnnnnnnn Anatomy_relation “relation” “WBbtnnnnnnn”

If entity = GO:GOnnnnnnn GO_term_relation “relation” “GO:nnnnnnn”

gop_protein (don’t have any at the moment)

Annotation_isoform “UniProtKB:nnnnnn”

gop_paper

Reference “WBPaper00028482”

gop_accession

GO_reference “Gene Ontology Consortium” “GO_REF” “nnnnnnn”

Contributed_by “WormBase”

This will be a default value.

gop_lastupdate

Date_last_updated “YYYY-MM-DD”

Difference between revisions of "Populating the OA and New Dumping Script"

Revision as of 17:59, 19 February 2015

Navigation menu

Page actions

Page actions

Personal tools

Navigation

Search

Tools

@@ Line 1: / Line 1: @@
 . Need to populate postgres (GO OA tables with data from Phenotype2GO file)
-. File to parse into postgres is in this directory on mangolassi:
+. File to parse into postgres is currently in this directory on mangolassi:
 /home/postgres/work/pgpopulation/go/go_curation/20141106_kevin_godata/newGpaEntries
-. Mapping of columns in the newGpaEntries file to GO OA tables is here:
+. Table mapping columns in the newGpaEntries file to GO OA tables is here:
 http://wiki.wormbase.org/index.php/Mapping_the_GAF_to_GO_OA_tables
-.  Once file is read into OA tables, then we’ll need a new dumping script.
+Will need to add three values to the Qualifier drop-down menu, ‘enables’, ‘involved_in’, ‘part_of’
-Specs for new OA dumping script
+. Once file is read into OA tables, then we’ll need a new dumping script.
+. Specs for new OA dumping script
 Model for WS248:
@@ Line 17: / Line 19: @@
 http://cvs.sanger.ac.uk/cgi-bin/viewvc.cgi/wormbase/wspec/models.wrm?revision=1.416&root=ensembl&view=markup
-. Annotation ID
+*Annotation ID
 	Will need to determine what the starting ID should be
 	Could look at the main .ace file (from gpad conversion - gp_annotation.ace) and start with +1 from the end
@@ Line 25: / Line 27: @@
-. Gene
+*Gene
 	From gop_wbgene
@@ Line 31: / Line 33: @@
-. GO_code
+*GO_code
 	From gop_goinference
@@ Line 37: / Line 39: @@
-. Annotation_relation
+*Annotation_relation
 	From gop_qualifier
 	Will need to add three values to the drop-down menu, ‘enables’, ‘involved_in’, ‘part_of’
@@ Line 49: / Line 51: @@
-. Annotation_made_with
+*Annotation_made_with
 	gop_with_wbgene
@@ Line 66: / Line 68: @@
-	gop_with_variation
+ 	gop_with_variation
 	Variation	“WBVar00278468”
@@ Line 81: / Line 83: @@
-. Annotation_extension
+*Annotation_extension
 	Same syntax, relation(entity)
@@ Line 106: / Line 108: @@
-. gop_protein (don’t have any at the moment)
+*gop_protein (don’t have any at the moment)
 	Annotation_isoform	“UniProtKB:nnnnnn”
-. gop_paper
+*gop_paper
 	Reference “WBPaper00028482”
-. gop_accession
+*gop_accession
 	GO_reference “Gene Ontology Consortium” “GO_REF” “nnnnnnn”
-. Contributed_by “WormBase”
+*Contributed_by “WormBase”
 	This will be a default value.
-. gop_lastupdate
+*gop_lastupdate
 	Date_last_updated	“YYYY-MM-DD”