WormBase-Caltech Weekly Calls March 2016

March 3, 2016

Sentence-level indexing

Chris playing around with flagging sentences as positive/negative for phenotype and flagging keywords as true or false positives
It may be helpful to objectify sentences and generate unambiguous, unique IDs for sentences
There will likely be difficulty in unambiguously pulling out sentences from re-processed papers
Best identifier would be the sentence itself
Sentence 'objects' could be connected to annotations in WB

Proteomic datasets

Wen wants to systematically collect proteomic datasets
Many are scattered across different resources
Would also help ParaSite to index similar datasets for parasites
MESH terms may help identify proteomic papers

MOD-human disease collaboration

FlyBase working on a shared template for curating human disease
SGD, ZFIN, MGI, FlyBase, and WormBase meeting at Caltech to discuss
One priority is minimizing redundancy of effort
There is a common portal for human diseases in progress
Should consider Monarch initiative approach

AddGene & WB constructs

Karen working with AddGene to pull in worm reagents (plasmids/constructs)
AddGene does want to have better linking to WormBase

Metabolomic data

Can we show chemical structures for metabolites?
User (M. Witting) says metabolism people will want monoisotopic masses - these masses aren't available through ChEBI at the moment and it will be a while for them to provide them. Will find a formula and sources files for Juancarlos to implement a calculator in the OA.
We already use, sync and collaborate with ChEBI

March 10, 2016

Adding WormBase-Parasite Paper to the Curation Pipeline (Kimberly)

Developing new paper editor for Jane and Michael P. to approve parasite papers
Adding species information to ALL papers; can be one or more species per paper
Would like to then incorporate parasite papers into downstream SVM flagging pipeline and curation status form
Species information could be used to keep curation stats clear
We should discuss the curation and tracking of non-elegans species on the next site-wide call
Might be best to have a separate ParaSite-specific curation status form; depends on use-case
We may be able to just add new columns to the curation status form, Caltech vs EBI
Michael and Jane want to curate gene function, e.g. GO; not sure what else

Dynamo-DB backend for Datomic

REST-API now switched to dynamodb backend.
Juancarlos could set up a form to try Datomic queries

WB FAQ page

Need to request write access from Todd
We should find out whether this page is used much. It needs updating considerably.

March 17, 2016

Xiaodong taking year leave

Will try working with company in China for a year
Xiaodong will start April 15
Daniela taking over antibody curation, Chris & Daniela taking over gene regulation and sequence feature
Xiaodong will discuss antibodies for disease with Ranjana

Tissue enrichment tool (Raymond)

Should development continue with integration into WormBase? Yes, no one else is doing it
GO enrichment tool in Intermine?

Text mining: brainstorming

We can likely explore many more options in terms of text mining
All possibly recognizable entities could have data filled and displayed to curators and/or end users
What about showing PDF markup from papers in PubMed Central? There is increasing interest in participation
We're trying sentence-level SVM for phenotype; will see how that works
We can quote papers verbatim in annotations, but should probably use double quotes from now on
We'll try to generate putative annotations automatically that can be confirmed or rejected by a curator or author

March 24, 2016

Outsourcing compute time

SPELL requires 30 hours with 16GB memory
Amazon cloud could offer 244GB memory at $3 per hour
We already have an Amazon account

March 31, 2016

Reagent data submission form (from Nobert Perrimon)

Data table provided as an example submission of reagent information at point of publication
Several reagent types included
We may want more specific information, e.g. sequence(s), relevant gene(s)
Would we need/want other follow-up tables to fill in more specific information?
Genetic variations should be added, including sequence
WB curators should draw up a table of minimal information for their data type
Chris will clean up and send around table for RNAi data submission

Proteomic papers (Wen)

Searching title for "proteome"
~230 papers total
~187 proteomic papers might be curatable
Many may not have expression clusters, but Wen did find new datasets

WormBase-Caltech Weekly Calls March 2016

Contents

March 3, 2016

Sentence-level indexing

Proteomic datasets

MOD-human disease collaboration

AddGene & WB constructs

Metabolomic data

March 10, 2016

Adding WormBase-Parasite Paper to the Curation Pipeline (Kimberly)

Dynamo-DB backend for Datomic

WB FAQ page

March 17, 2016

Xiaodong taking year leave

Tissue enrichment tool (Raymond)

Text mining: brainstorming

March 24, 2016

Outsourcing compute time

March 31, 2016

Reagent data submission form (from Nobert Perrimon)

Proteomic papers (Wen)

Navigation menu

Page actions

Page actions

Personal tools

Navigation

Search

Tools