Difference between revisions of "Paper Pipeline - To Do List"
From WormBaseWiki
Jump to navigationJump to searchLine 11: | Line 11: | ||
'''Finish documentation of current paper type mappings - PubMed vs postgres vs Journal - for informing SVMs and Textpresso searches - Caltech''' | '''Finish documentation of current paper type mappings - PubMed vs postgres vs Journal - for informing SVMs and Textpresso searches - Caltech''' | ||
− | '''Decide how to handle upload of Genetics papers | + | '''Decide how to handle upload of Genetics papers - Karen, Juancarlos, Kimberly''' |
==Long-Term== | ==Long-Term== |
Revision as of 11:23, 21 July 2009
Short-Term
Correct invalid PMIDs and their associated paper types - Kimberly
- How often do we want to check for invalid PMIDs?
- PubMed maintains a file of obsolete PMIDs on their ftp site: ftp://ftp.ncbi.nlm.nih.gov/pubmed/deleted_pmids.txt
Write up a summary of the weekly checking script - Kimberly and Juancarlos
Add a limited number of new paper types to allow for single type classification - Kimberly and Juancarlos
Finish documentation of current paper type mappings - PubMed vs postgres vs Journal - for informing SVMs and Textpresso searches - Caltech
Decide how to handle upload of Genetics papers - Karen, Juancarlos, Kimberly
Long-Term
Discuss timeline and implications for changing the Paper models to allow for multiple types - WormBase
Decide if, and how, we want to run a script that cross-checks PubMed and postgres data - Caltech