Difference between revisions of "WormBase-Caltech Weekly Calls"

From WormBaseWiki
Jump to navigationJump to search
m
Line 21: Line 21:
 
[[WormBase-Caltech_Weekly_Calls_February_2016|February]]
 
[[WormBase-Caltech_Weekly_Calls_February_2016|February]]
  
 
+
[[WormBase-Caltech_Weekly_Calls_March_2016|March]]
 
 
== March 3, 2016 ==
 
 
 
=== Sentence-level indexing ===
 
* Chris playing around with flagging sentences as positive/negative for phenotype and flagging keywords as true or false positives
 
* It may be helpful to objectify sentences and generate unambiguous, unique IDs for sentences
 
* There will likely be difficulty in unambiguously pulling out sentences from re-processed papers
 
* Best identifier would be the sentence itself
 
* Sentence 'objects' could be connected to annotations in WB
 
 
 
=== Proteomic datasets ===
 
* Wen wants to systematically collect proteomic datasets
 
* Many are scattered across different resources
 
* Would also help ParaSite to index similar datasets for parasites
 
* MESH terms may help identify proteomic papers
 
 
 
=== MOD-human disease collaboration ===
 
* FlyBase working on a shared template for curating human disease
 
* SGD, ZFIN, MGI, FlyBase, and WormBase meeting at Caltech to discuss
 
* One priority is minimizing redundancy of effort
 
* There is a common portal for human diseases in progress
 
* Should consider Monarch initiative approach
 
 
 
=== AddGene & WB constructs ===
 
* Karen working with AddGene to pull in worm reagents (plasmids/constructs)
 
* AddGene does want to have better linking to WormBase
 
 
 
=== Metabolomic data ===
 
* Can we show chemical structures for metabolites?
 
* User (M. Witting) says metabolism people will want monoisotopic masses - these masses aren't available through ChEBI at the moment and it will be a while for them to provide them.  Will find a formula and sources files for Juancarlos to implement a calculator in the OA.
 
* We already use, sync and collaborate with ChEBI
 
 
 
== March 10, 2016 ==
 
 
 
=== Adding WormBase-Parasite Paper to the Curation Pipeline (Kimberly) ===
 
*Developing new paper editor for Jane and Michael P. to approve parasite papers
 
*Adding species information to ALL papers; can be one or more species per paper
 
*Would like to then incorporate parasite papers into downstream SVM flagging pipeline and curation status form
 
*Species information could be used to keep curation stats clear
 
* We should discuss the curation and tracking of non-elegans species on the next site-wide call
 
* Might be best to have a separate ParaSite-specific curation status form; depends on use-case
 
* We may be able to just add new columns to the curation status form, Caltech vs EBI
 
* Michael and Jane want to curate gene function, e.g. GO; not sure what else
 
 
 
=== Dynamo-DB backend for Datomic ===
 
* REST-API now switched to dynamodb backend.
 
* Juancarlos could set up a form to try Datomic queries
 
 
 
=== WB FAQ page ===
 
* Need to request write access from Todd
 
* We should find out whether this page is used much. It needs updating considerably.
 
 
 
 
 
== March 17, 2016 ==
 
 
 
=== Xiaodong taking year leave ===
 
* Will try working with company in China for a year
 
* Xiaodong will start April 15
 
* Daniela taking over antibody curation, Chris & Daniela taking over gene regulation and sequence feature
 
* Xiaodong will discuss antibodies for disease with Ranjana
 
 
 
=== Tissue enrichment tool (Raymond) ===
 
* Should development continue with integration into WormBase? Yes, no one else is doing it
 
* GO enrichment tool in Intermine?
 
 
 
=== Text mining: brainstorming ===
 
* We can likely explore many more options in terms of text mining
 
* All possibly recognizable entities could have data filled and displayed to curators and/or end users
 
* What about showing PDF markup from papers in PubMed Central? There is increasing interest in participation
 
* We're trying sentence-level SVM for phenotype; will see how that works
 
* We can quote papers verbatim in annotations, but should probably use double quotes from now on
 
* We'll try to generate putative annotations automatically that can be confirmed or rejected by a curator or author
 
 
 
 
 
== March 24, 2016 ==
 
 
 
=== Outsourcing compute time ===
 
* SPELL requires 30 hours with 16GB memory
 
* Amazon cloud could offer 244GB memory at $3 per hour
 
* We already have an Amazon account
 
 
 
 
 
== March 31, 2016 ==
 
 
 
=== Reagent data submission form (from Nobert Perrimon) ===
 
* Data table provided as an example submission of reagent information at point of publication
 
* Several reagent types included
 
* We may want more specific information, e.g. sequence(s), relevant gene(s)
 
* Would we need/want other follow-up tables to fill in more specific information?
 
* Genetic variations should be added, including sequence
 
* WB curators should draw up a table of minimal information for their data type
 
* Chris will clean up and send around table for RNAi data submission
 
 
 
=== Proteomic papers (Wen) ===
 
* Searching title for "proteome"
 
* ~230 papers total
 
* ~187 proteomic papers might be curatable
 
* Many may not have expression clusters, but Wen did find new datasets
 

Revision as of 15:35, 7 April 2016