Difference between revisions of "Specifications for CCC Curation from Textpresso Search Page"

Revision as of 23:31, 5 January 2011

Requirements for Using Textpresso Search Results in General CCC Curation

These specifications are for allowing a curator to search any Textpresso implementation using the CCC categories, submit the resulting sentences to a curation form, make annotations, and download the annotations in a gene_association file format.

This pipeline would make use of the XML format of a returned sentence. An XML version of sample search results from WBPaper00037859 was edited:

1) Removed all category names between the <annotation> tags.

2) Kept all information in within the <bibliography> tags.

3) Removed information within the <field_references> tags - this was a scrambled sentence, is this how they are typically identified?

4) Potentially curatable sentences are found within the <field_results> tags.

5) Going from XML to curation form:

Display information within <bibliography> at the top of the page:

Title:

Authors:

Journal:

Year:

DocID:

Type:

Literature:

Accession (PMID):

Abstract:

6) Working from left to right on the curation form:

First box: all entities within the protein or gene tag. The exact tag name for this will be different for each implementation, for example:

protein_celegans

genes_arabidopsis

dicty_genes

Back to Gene Ontology

Difference between revisions of "Specifications for CCC Curation from Textpresso Search Page"

Revision as of 23:31, 5 January 2011

Requirements for Using Textpresso Search Results in General CCC Curation

Navigation menu

Page actions

Page actions

Personal tools

Navigation

Search

Tools

@@ Line 34: / Line 34: @@
 Abstract:
+) Working from left to right on the curation form:
+First box: all entities within the protein or gene tag.  The exact tag name for this will be different for each implementation, for example:
+protein_celegans
+genes_arabidopsis
+dicty_genes