Difference between revisions of "WormBase-Caltech Weekly Calls"

From WormBaseWiki
Jump to navigationJump to search
m
Line 12: Line 12:
 
[[WormBase-Caltech_Weekly_Calls_February_2012|February]]
 
[[WormBase-Caltech_Weekly_Calls_February_2012|February]]
  
 +
[[WormBase-Caltech_Weekly_Calls_March_2012|March]]
  
  
== March 1, 2012 ==
 
  
 
GO Meeting
 
*Focused on annotation pipelines; improving efficiency/effectiveness
 
*How to make GO annotations more 'expressive'
 
*GO would like to move towards more expressive statements
 
*Example: If a gene is involved in a function or process, where in the cell does this take place
 
*Common Annotation Framework
 
*Current/future members of the GO network can annotate using the same version of GO, same tools and standards
 
*Quality controls checks: e.g. do you have all the fields necessary to make an annotation
 
*GO hopes to centralize all of the data handling, formatting
 
*LEGO - Logical Extensions of GO
 
*We should pilot how we want to handle this; similar to how concise descriptions are constructed
 
*WormBase curates phenotypes, pathways, etc.
 
*Defining useful relationships to curate/annotate: Cross-products with defined relations
 
*Pilot: Take subdomains, pathways, try extended version of curation on these
 
*How do we capture that fly eye development is relevant to human biology?
 
**Humans don't have compound eyes - not the point
 
**The pathways are the same or similar; EGF signaling
 
*WormBase Process curation could really benefit from GO's adoption of this strategy
 
*Need to consider what the "right" way to approach this issue; need good pilot
 
*Where is the value? How do we focus on this?
 
*Another annotation pipeline: Phylogenetic Annotation and INference Tool (PAINT)
 
**How best to make these inferences?
 
**What kind of inferences can you make about organismal- or organ-specific processes?
 
***Uberon has framework for interspecies anatomical comparisons
 
**PAINT tool for nematodes?
 
 
 
Upload for WS231
 
*Interaction file upload took several hours
 
*Check if virtual memory is being used
 
*Likely culprit is the extra data and XREFs in the Interactor_info hash
 
*Can objectify the Interactor_info to be a tag in the main ?Interaction model
 
*We should warn EBI/Hinxton about this
 
 
 
WormBase Curator Interview next Thursday
 
 
 
Migration of Reporter_gene object annotations from Expr_pattern OA to Transgene OA
 
*Everything seems OK
 
 
 
SPELL
 
*Papers with less than three experiments, statistics calculations cause slow-down, memory limitations
 
*Now can bypass this problem
 
*We are now operating SPELL on our local machines
 
*Do Amazon instances function/behave differently than local server?
 
**Need to compare; find benefits & drawbacks
 
*Use Amazon server as a dynamic name server
 
*Users shouldn't notice a difference
 
*We won't need to ask Todd for anything; we can fix it ourselves
 
 
 
GO Meeting breakout session
 
*Software architecture for upcoming GO expansion (CAT - Common Annotation Tool)
 
*How does Textpresso integrate?
 
*What kind of annotation would GO expect Textpresso to do?
 
*User will be able to do guided text mining operations
 
**Example: regular expressions, then HMM, then export to CAT
 
*No forseeable roadblocks
 
*Maybe standardize all of the text mining types and methods behind them
 
*Develop paper-viewer? Apart from CAT, text mining flow? Separate module
 
 
 
 
== March 15, 2012 ==
 
 
 
RNAi clone mappings
 
*Can we submit PCR primers to Hinxton for genome mapping?
 
**They say so, we'll have to test
 
**Usually we deal with PCR_product objects like sjj_* or mv_*
 
 
 
Transgene sequences
 
*Authors explain PCR construction of transgenes; provide explicit sequence?
 
 
 
Should we make a standard submission format for authors?
 
*Maybe, authors should have to submit explicit sequences for RNAi probes, transgenes, etc.
 
*Enforce standards at what level? Editors, reviewers, journal, user community, etc...
 
*Submit PCR primers?
 
*We will draft a letter to community/editors to request explicit sequences for RNAi experiments
 
 
 
Should we perform sequencing of Ahringer (and other) clones ourselves?
 
*Write grant, perform sequencing?
 
 
 
New website release
 
*We will draft an e-mail/blog post about official release of new website at end of March
 
 
 
SVM Precision/Recall
 
*Standardize methods of calculating precision and recall on SVM results?
 
*True Positive = a positive identified by SVM that is actually positive
 
*False Positive = a positive identified by SVM that is actually negative
 
*True Negative = a negative identified by SVM that is actually negative
 
*False Negative = a negative identified by SVM that is actually positive
 
*Precision Rate (Positive Predictive Value) - how many SVM-identified positives are true positives? = True Positives/(True Positives + False Positives)
 
*Recall Rate (Sensitivity) = True Positives/(True Positives + False Negatives)
 
*Specificity = True Negatives/(True Negatives + False Positives)
 
*NPV (Negative Predictive Value) = True Negatives/(True Negatives + False Negatives)
 
 
 
Curator Candidates
 
 
 
== March 22, 2012 ==
 
 
 
New website release
 
*Sending around mass e-mail to WormBase community (9960 e-mail addresses) announcing release
 
*Will notify that old site will still be available (at legacy.wormbase.org?)
 
*Provide link to old website on new website homepage?
 
 
 
Browsable worm model
 
*Implemented by Open Worm project team
 
 
 
SPELL
 
*Monitoring to see if it is up and running
 
*Had a few shutdowns that we weren't aware of
 
*Would like to be ahead of the users so we can restart it
 
*Trying to figure out how to be made aware of any shutdowns
 
*Yeast SPELL has set it up such that a user could paste in a URL to get results
 
**Worm SPELL cannot do this; not setup in the same way
 
**Wen will contact yeast SPELL team; already contacted Michael Cherry
 
*http://imperio.princeton.edu - has nice functionality
 
*Kimberly will ask around
 
 
 
RNAi OA
 
*Working on the new RNAi OA
 
*Will take advantage of Igor's RNAi processing script for dumping from the OA
 
*Would be good if we had a working tool (locally) to map primer sets and clones to DNA text
 
 
 
Interaction and Gene Regulation OAs
 
*Need to be updated (OA and dumping script) to accommodate the new ?Interaction model
 
 
 
== March 29, 2012 ==
 
 
 
BioCurator Meeting
 
*QC Fast poster looks good
 
*Wen presenting poster on SPELL
 
*Kimberly has multiple presentations
 
**WormBase workflow
 
**CCC for Dictybase
 
**Another presentation
 
*Michael and Yuling going; will meet with collaborators
 
*Text-linking groups (Reflect)
 
*Keep an eye out for PDF(paper) viewers/editors/annotators
 
 
 
SPELL upgrade
 
*Upgraded to newer version of software
 
*Automatically keep track of whether or not it is functioning
 
**Regular test query every 30 min to 1 hr
 
*Still using Amazon server as a forwarding service
 
 
 
OA edits
 
*Interaction and Gene_regulation OAs under modification for new ?Interaction model
 
*RNAi OA, generating OA and dumping
 
*Configuring OAs so that single objects exist on a single row in the OA
 
**Is it necessary to map interaction objects (Transgene, Antibody, etc.) to a core interaction entity (Gene, Protein, etc.)
 
  
  

Revision as of 20:58, 12 April 2012

2009 Meetings

2011 Meetings


2012 Meetings

January

February

March



April 12, 2012

RNAi OA

  • OA almost ready to go live
  • Testing now with test curation
  • Should go live next week for official curation


New Website

  • Most problems are being fixed in a timely manner
  • Curators can now edit links and add custom widgets
  • Issues (tracked on GitHub) being dealt with quickly


BioCurator Meeting

  • Good meeting, bigger than before
  • Common themes: data standards, how to educate users of database materials and how to use it (and think critically)
  • How can MODs work better with journals and PubMed to solve the 'triage' problem?
    • Streamlining the paper acquisition/curation process
    • MODs should ask NLM to take the burden of retrieving PDFs
    • Get lawyers involved to make available?
    • Publishers tend to be lax on text mining rules, maybe will evolve into an easier process
  • Maybe write a grant for research project as a proof-of-principle that triage can be done in an effective/efficient manner
  • May ask ISB (Int Society Biocurators) for help with this
  • Sequence and protein curation: tools, databases (topic-specific; pathways, cancer, etc.)
  • GeneWiki for human gene annotation
    • One page for each gene; already have ~10,000 articles
    • ~Dozen editors, credibility of authors checked (?)
    • Reasonably satisfied with coverage of human disease genes
  • Whole-genome sequencing of individuals
    • Newly identified genetic disorder
    • VAST instead of BLAST
  • Tool to identify primers from papers and map them to the genome automatically
  • Intermine discussed
    • Comparable to WormMart
    • Object-oriented database
    • Performs similar to WormBase
    • Many pre-canned queries
    • Advanced search Query-builder available
    • MODs switched over to Intermine from BioMart
    • WormMart - Will Spooner tried to provide queries that are more natural
    • We can work to build an interface on top of Intermine, etc.
    • Todd has made progress with getting Intermine for WormBase
  • Lot's of specialized talks, reduced the productivity (compared to BioCreative meeting)
  • Curators explaining their curation pipeline
  • Textpresso still popular ;)
    • Six out of seven MODs using Textpresso
    • Discussed text mining in particular applications (eg. CCC)
    • Textpresso only tool using full-text for mining
    • Pete from FlyBase: SVM results are deteriorating (similar to WormBase)
      • Start training from scratch; hopefully get better recall/precision numbers
  • Natural language processing on figure legends/captions
    • Tries to find text in the body that relates to figure
    • Possible collaboration with Texpresso
  • NLP research group in Germany
    • 'Actor', 'agent' etc. and relationships (RDF triplets)
  • Doug Howe (ZFIN), zebrafish corpus small enough, doesn't need Textpresso
  • Julio Collado-Vides, Textpresso for E. coli fell apart, but trying to get back together


Paul will meet someone from Elsevier

  • Image curation/ rights issues


Genetic Interaction ontology

  • SGD on board with ontology so far; performing trial curation
  • FlyBase interested in using as well; will meet with Chris and Rose in May to discuss