Difference between revisions of "Dumping Script"

From WormBaseWiki
Jump to navigationJump to search
Line 5: Line 5:
  
 
The papers.ace file is dumped automatically at 2am on the Thursday of the upload and copied to spica at 4am on that same Thursday.
 
The papers.ace file is dumped automatically at 2am on the Thursday of the upload and copied to spica at 4am on that same Thursday.
 +
  
 
The dumping script will check for any dead gene IDs attached to papers and comment them out of the .ace file until they are fixed/deleted from postgres by a curator.
 
The dumping script will check for any dead gene IDs attached to papers and comment them out of the .ace file until they are fixed/deleted from postgres by a curator.
Line 13: Line 14:
  
  
Back to [[2010_-_Paper_Pipeline:_Documentation_and_Instructions]]
+
The dumping script will also check that all associated genes are in the format: WBGenennnnnnnn where the 8 n's correspond to numbers.
 +
 
  
  
The dumping script will also check that all associated genes are in the format: WBGenennnnnnnn where the 8 n's correspond to numbers.
+
Back to [[2010_-_Paper_Pipeline:_Documentation_and_Instructions]]
  
  
 
[[Category:Curation]]
 
[[Category:Curation]]

Revision as of 19:14, 30 August 2010

The dumping script lives here:

/home/postgres/work/citace_upload/papers/dumpPapAce.pl


The papers.ace file is dumped automatically at 2am on the Thursday of the upload and copied to spica at 4am on that same Thursday.


The dumping script will check for any dead gene IDs attached to papers and comment them out of the .ace file until they are fixed/deleted from postgres by a curator.

The AQL query that finds all dead genes in WB is:

select all class gene where ->Species like "*elegans" and ->Status like "Dead"


The dumping script will also check that all associated genes are in the format: WBGenennnnnnnn where the 8 n's correspond to numbers.


Back to 2010_-_Paper_Pipeline:_Documentation_and_Instructions