Difference between revisions of "Updating ontology (.obo) files for the OA"

From WormBaseWiki
Jump to navigationJump to search
Line 1: Line 1:
What fields link to an .obo file
+
=Updating .obo files=
  
Where are they stored
+
===Populating Postgres with object connections from the latest build files===
  
Which ones need to updated
+
Variation-gene and variation-paper connection information retrieved in phenote depends on information from the latest WS. 
 +
After the upload, these .obo files in Postgres need to be repopulated with the most current information. 
  
Updating Allele-Gene connections
+
Run query for the most current information (it probably matters that the queries maintain the same column output?):
 +
#Allele_gene connections for the variations in the allele group, and not SNPs etc. AQL
 +
(AQL) select g, g->gene, g->gene->public_name from g in class variation where exists_tag g-> allele
 +
Sort (optional),
 +
Export (choose Separator character set to blank (TAB))  as WS#_vargene into folder Phenote
  
Updating Transgene-Paper connections
+
#Transgene_summary_paper connections
 +
->List transgenes associated with paper
 +
(AQL) select t, t->reference, t->summary from t in class transgene where exists t->reference
 +
Sort (optional),
 +
Export (choose Separator character set to blank (TAB))  as WS#_transpapsum into folder Phenote on Desktop
 +
 
 +
#Rearrangement_inside_gene connections
 +
->List rearrangements with LG and 'genes inside' and ‘gene outside’ (public names only)
 +
(AQL) select r, r->map, r->gene_inside->public_name, r->gene_outside->public_name from r in class rearrangement
 +
Sort (optional)
 +
Export as WS#_rearragene
 +
 
 +
Repopulating Postgres
 +
Transfer files to acedb@tazendra.caltech.edu:/home/acedb/karen/populate_gin_variation
 +
 
 +
//renaming WS188_vargene.txt to Variation_gene.txt//
 +
$scp WS188_vargene.txt acedb@tazendra.caltech.edu:/home/acedb/karen/populate_gin_variation/Variation_gene.txt
 +
 
 +
//renaming WS188_rearragene.txt to rear_simple.txt
 +
$scp WS188_rearragene.txt acedb@tazendra.caltech.edu:/home/acedb/karen/populate_gin_variation/rear_simple.txt
 +
 
 +
//renaming WS188_transpapsum to transgene_summary_reference.txt
 +
$scp WS188_transpapsum.txt acedb@tazendra.caltech.edu:/home/acedb/karen/populate_gin_variation/transgene_summary_reference.txt
 +
 
 +
Running scripts to populate Phenote .obo files
 +
There are two scripts,  populate_ali_alleleinfo.pl and make_obo.pl that need to be run to populate Postgres with the updated variation information. It is important to run these scripts every time the variation information is updated.
 +
Both apps are on tazendra in the same directory as the updated variation info.
 +
Run scripts:
 +
cd to /home/acedb/karen/populate_gin_variation/
 +
$ ./populate_ali_alleleinfo.pl
 +
$ ./ make_obo.pl
 +
 
 +
populate_ali_alleleinfo.pl updates information based on Variation_gene.txt and
 +
transgene_summary_reference.txt. Make sure files are named accordingly or the program won’t see them. 
 +
 
 +
make_obo.pl creates a text .obo based on rearr_simple.txt and
 +
Variation_gene.txt. This script populates the WS current info.
 +
 
 +
NOTE: populate_gin_variation updates data based on variation_tab_wbgene
 +
file (in postgres / cgi) , which is no longer current.
 +
 
 +
To check if the repopulation scripts worked, check out the ws current info field
 +
http://tazendra.caltech.edu/~azurebrd/var/work/phenote/ws_current.obo
 +
The date will tell you when it was last updated; it should reflect the date the script was run.
  
Updating Rearrangement-Gene connections
 
  
  

Revision as of 21:45, 13 March 2008

Updating .obo files

Populating Postgres with object connections from the latest build files

Variation-gene and variation-paper connection information retrieved in phenote depends on information from the latest WS. After the upload, these .obo files in Postgres need to be repopulated with the most current information.

Run query for the most current information (it probably matters that the queries maintain the same column output?):

  1. Allele_gene connections for the variations in the allele group, and not SNPs etc. AQL

(AQL) select g, g->gene, g->gene->public_name from g in class variation where exists_tag g-> allele Sort (optional), Export (choose Separator character set to blank (TAB)) as WS#_vargene into folder Phenote

  1. Transgene_summary_paper connections

->List transgenes associated with paper

(AQL) select t, t->reference, t->summary from t in class transgene where exists t->reference

Sort (optional), Export (choose Separator character set to blank (TAB)) as WS#_transpapsum into folder Phenote on Desktop

  1. Rearrangement_inside_gene connections

->List rearrangements with LG and 'genes inside' and ‘gene outside’ (public names only) (AQL) select r, r->map, r->gene_inside->public_name, r->gene_outside->public_name from r in class rearrangement Sort (optional) Export as WS#_rearragene

Repopulating Postgres Transfer files to acedb@tazendra.caltech.edu:/home/acedb/karen/populate_gin_variation

//renaming WS188_vargene.txt to Variation_gene.txt// $scp WS188_vargene.txt acedb@tazendra.caltech.edu:/home/acedb/karen/populate_gin_variation/Variation_gene.txt

//renaming WS188_rearragene.txt to rear_simple.txt $scp WS188_rearragene.txt acedb@tazendra.caltech.edu:/home/acedb/karen/populate_gin_variation/rear_simple.txt

//renaming WS188_transpapsum to transgene_summary_reference.txt $scp WS188_transpapsum.txt acedb@tazendra.caltech.edu:/home/acedb/karen/populate_gin_variation/transgene_summary_reference.txt

Running scripts to populate Phenote .obo files There are two scripts, populate_ali_alleleinfo.pl and make_obo.pl that need to be run to populate Postgres with the updated variation information. It is important to run these scripts every time the variation information is updated. Both apps are on tazendra in the same directory as the updated variation info. Run scripts: cd to /home/acedb/karen/populate_gin_variation/ $ ./populate_ali_alleleinfo.pl $ ./ make_obo.pl

populate_ali_alleleinfo.pl updates information based on Variation_gene.txt and transgene_summary_reference.txt. Make sure files are named accordingly or the program won’t see them.

make_obo.pl creates a text .obo based on rearr_simple.txt and Variation_gene.txt. This script populates the WS current info.

NOTE: populate_gin_variation updates data based on variation_tab_wbgene file (in postgres / cgi) , which is no longer current.

To check if the repopulation scripts worked, check out the ws current info field http://tazendra.caltech.edu/~azurebrd/var/work/phenote/ws_current.obo The date will tell you when it was last updated; it should reflect the date the script was run.


back

--kjy