Difference between revisions of "November 25, 2009 - Sequence Curation Flags"

From WormBaseWiki
Jump to navigationJump to search
Line 53: Line 53:
 
! Curator(s)
 
! Curator(s)
 
! Comments
 
! Comments
 +
! Current pipeline sufficient?
 
|-
 
|-
 
! gene symbol
 
! gene symbol
Line 60: Line 61:
 
! Kimberly, Mary Ann?
 
! Kimberly, Mary Ann?
 
! Currently being combined with seqchange.  Could possibly employ secondary screen with categories.
 
! Currently being combined with seqchange.  Could possibly employ secondary screen with categories.
 +
!
 
|-
 
|-
 
! mapping data
 
! mapping data
Line 66: Line 68:
 
!  
 
!  
 
!  
 
!  
 +
!
 
!
 
!
 
|-
 
|-
Line 71: Line 74:
 
! 248
 
! 248
 
! worm-bug, stlouis, xiaodong (xdwang)
 
! worm-bug, stlouis, xiaodong (xdwang)
 +
!
 
!
 
!
 
!
 
!
Line 81: Line 85:
 
! Textpresso categories
 
! Textpresso categories
 
! Ruihua, Gary?
 
! Ruihua, Gary?
 +
!
 
!
 
!
 
!
 
!
Line 90: Line 95:
 
!
 
!
 
! Ideally divided into four categories: a change in a gene's structure, the addition of an isoform, a change to one of the SL1/SL2 or polyA site features, a sequence correction in the N2 reference genome
 
! Ideally divided into four categories: a change in a gene's structure, the addition of an isoform, a change to one of the SL1/SL2 or polyA site features, a sequence correction in the N2 reference genome
 +
!
 
|-
 
|-
 
! sequence change
 
! sequence change
 
! 981
 
! 981
 
! genenames
 
! genenames
 +
! SVM
 
!
 
!
 
!
 
!
Line 101: Line 108:
 
! 50
 
! 50
 
! tbieri
 
! tbieri
 +
!
 
!
 
!
 
!
 
!
Line 108: Line 116:
 
! 1372
 
! 1372
 
! Erich, Gary, Jolene
 
! Erich, Gary, Jolene
 +
!
 
!
 
!
 
!
 
!

Revision as of 16:04, 24 November 2009

Back to Caltech documentation

Call Information

4:30pm GMT | 11:30am EST | 10:30am CST | 8:30am PST

US: 1-877-384-2311, +1-480-629-1629

UK: 0800-358-3475, +44-207-154-0025

Canada: 1-866-243-1291

participant access code: 822114

Location: wherever you are

Participants

Review sequence-related first pass flags

http://tazendra.caltech.edu/~postgres/cgi-bin/curator_first_pass.cgi

Pipelines and options for flagging

SVMs

  • Curated papers are the best training set. Flagged papers can be used, if flagging was generally consistent.
  • Curation status form has lists, but not completely up-to-date.

Textpresso

  • pattern matching
  • category searches

Author flags

  • Curators need to tell Juancarlos they'd like to receive emails when authors flag a data type.
  • Caltech needs to supply list of papers flagged since September 2009.
  • Stats on return rates as of November 12, 2009 (supplied by Juancarlos):

Since Sept 1st, we have sent out 195 requests, and gotten back 72 results (36.9%).

Since Oct 1st, we have sent out 147 requests, and gotten back 52 results (35.3%).

Since Nov 1st, we have sent out 18 requests, and gotten back 7 results (38.9%).

Current status of each sequence-related flag

}
Flag name Number of papers flagged manually (from curation status form) Flag email (from first pass form) Current approach Curator(s) Comments Current pipeline sufficient?
gene symbol 342 genenames, vanauken SVM (see comments) Kimberly, Mary Ann? Currently being combined with seqchange. Could possibly employ secondary screen with categories.
mapping data 194 genenames
sequence features 248 worm-bug, stlouis, xiaodong (xdwang)
mass spectrometry 65 gw3, worm-bug Textpresso categories Ruihua, Gary?
structure correction 333 worm-ticket, worm-bug Ideally divided into four categories: a change in a gene's structure, the addition of an isoform, a change to one of the SL1/SL2 or polyA site features, a sequence correction in the N2 reference genome
sequence change 981 genenames SVM
new SNPs 50 tbieri
new mutant {alleles) 1372 Erich, Gary, Jolene

Minutes, Action Items