Difference between revisions of "Attaching Genes to Papers"
Line 24: | Line 24: | ||
1) With the exception of a dash (-), split text (words or letters) from punctuation. | 1) With the exception of a dash (-), split text (words or letters) from punctuation. | ||
− | Papers where gene associations have been missed for re-training the script: | + | |
+ | '''Papers where gene associations have been missed for re-training the script:''' | ||
WBPaper00035164 - missed BLI-4 | WBPaper00035164 - missed BLI-4 |
Revision as of 20:20, 9 November 2010
Gene Associations Based Upon Abstracts
When papers are added to postgres using the Enter New Papers function of the Paper Editor, the corresponding abstracts are scanned, via a script, for matches to loci, sequence names, and synonyms.
Postgres tables used for this are:
gin_locus
gin_seqname
gin_synonyms
To view the contents of these tables, perform the following type of query using the referenceform.cgi:
SELECT * FROM gin_locus;
Updating the script:
The script that associates genes based upon abstracts does miss some genes because of the way they're expressed in the abstract.
Some ideas on what to change:
1) With the exception of a dash (-), split text (words or letters) from punctuation.
Papers where gene associations have been missed for re-training the script:
WBPaper00035164 - missed BLI-4
Back to 2010_-_Paper_Pipeline:_Documentation_and_Instructions