TAIR CCC

From WormBaseWiki
Revision as of 16:38, 9 December 2010 by Vanaukenk (talk | contribs) (Created page with '==Specifications for Curation Pipeline== ===Summary=== This document is an outline of the Arabidopsis Textpresso for CCC pipeline for the initial trial run. The trial run wi…')
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigationJump to search

Specifications for Curation Pipeline

Summary

This document is an outline of the Arabidopsis Textpresso for CCC pipeline for the initial trial run.

The trial run will be a search on all papers in the Textpresso for Arabidopsis corpus published in 2008.

Search results will be stored in three files:

1) all sentences returned by the search

2) sentences from papers already curated by TAIR for GO Cellular Component

3) sentences from papers not curated by TAIR for GO Cellular Component

Annotations can be made using an on-line curation form:

http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/tair/tair_ccc.cgi

with two different outputs:

1) a three-column 'user submission' output

2) a standard GO Gene Association File (GAF) format

Pipeline Details

Paper Acquisition