Difference between revisions of "Specifications for CCC Curation from Textpresso Search Page"
Line 10: | Line 10: | ||
3) Removed information within the <field_references> tags - this was a scrambled sentence, is this how they are typically identified? | 3) Removed information within the <field_references> tags - this was a scrambled sentence, is this how they are typically identified? | ||
+ | |||
+ | 4) Potentially curatable sentences are found within the <field_results> tags. | ||
+ | |||
+ | 5) Going from XML to curation form: | ||
+ | |||
+ | |||
Revision as of 22:58, 5 January 2011
Requirements for Using Textpresso Search Results in General CCC Curation
These specifications are for allowing a curator to search any Textpresso implementation using the CCC categories, submit the resulting sentences to a curation form, make annotations, and download the annotations in a gene_association file format.
This pipeline would make use of the XML format of a returned sentence. An XML version of sample search results from WBPaper00037859 was edited:
1) Removed all category names between the <annotation> tags.
2) Kept all information in within the <bibliography> tags.
3) Removed information within the <field_references> tags - this was a scrambled sentence, is this how they are typically identified?
4) Potentially curatable sentences are found within the <field_results> tags.
5) Going from XML to curation form:
Back to Gene Ontology