Difference between revisions of "WormBase-Caltech Weekly Calls"

From WormBaseWiki
Jump to navigationJump to search
Line 39: Line 39:
 
[[WormBase-Caltech_Weekly_Calls_November_2017|November]]
 
[[WormBase-Caltech_Weekly_Calls_November_2017|November]]
  
 
+
[[WormBase-Caltech_Weekly_Calls_December_2017|December]]
== December 7, 2017 ==
 
 
 
=== Next Citace upload ===
 
* Submit files to Wen by Jan 16, 2018
 
* There shouldn't be model changes (that affect Caltech) for this next upload
 
* Juancarlos goes on vacation Dec 19th
 
 
 
=== WB Curator candidate ===
 
* April, could join WB for 1-2 years
 
* Could possibly interview next Thursday
 
* Could curate sequence features, enhancers, alleles/mutations/variations?
 
* Could start in January
 
 
 
=== Expression clusters ===
 
* Wen and Raymond discussed; will add an "uncertain" tag or something similar
 
* Tag can be used to filter out datasets that are complex (like mixed cell/tissue types)
 
 
 
=== Automated descriptions & expression clusters ===
 
* Many expression clusters are associated with infection
 
* Such data not currently incorporated into automated gene descriptions
 
* Wen can talk with Ranjana to incorporate (particularly for info-sparse genes)
 
* Can use GO terms, but need to be careful about how they are applied (what relationships they have) to genes
 
 
 
=== Author First Pass Form ===
 
*[https://docs.google.com/spreadsheets/d/1sS_uAjBJ2r5H90Lam62Ai0HunjwvfjnklkFNrDoNXeU/edit Analysis of current flags and numbers of entries]
 
 
 
==== Overall approach ====
 
*AFP form would move from being just a flagging pipeline towards a validation and data entry pipeline
 
*Also move from free text boxes towards autocompletes, controlled vocabularies
 
*If this is a validation and a portal for data entry forms, should we display everything we can mine?
 
*Idea is to eventually create data entry forms for as many data types as possible
 
*Long term goal would include linking to TPC and evidence sentences for validation
 
*Data types not curated by WB would need to be re-evaluated: continue to mine but share with other curation groups? no longer mine? new WB objects or pages?
 
*Periodically review the form and add/remove data types and curation forms as needed
 
* Would be good to ask authors if they had to omit data from the recently published paper and if they want to micropublish
 
 
 
==== Questions ====
 
*Question for WB curators: Are we missing data types? 
 
*Question for WB curators: Are there data types that we don't curate (e.g. domain analysis) but for which we could share mining information with other groups, e.g. UniProt?
 
*Question for WB curators: Are there data types we should no longer mine?
 
*Question for Wen: Does it make sense to include an afp flag for expression cluster?
 
*Question for Karen: What afp tables are populated from the Genetics/G3 pipeline?
 
**journal first pass forms : http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/journal/journal_first_pass.cgi?
 
Details here: http://wiki.wormbase.org/index.php/Genetics_Markup_by_Textpresso_and_First_Pass
 
 
 
[[File:Screen_Shot_2017-12-07_at_9.28.56_AM.png]]
 
 
 
==== Thoughts ====
 
* Could we add to citation index score as incentive to provide validation/data?
 
* How do we make sure we are not turning off/overwhelming users/participants/authors?
 
* Can gamify the forms; give points?
 
* Create an app that has user-friendly data submission tools/forms?
 
 
 
 
 
== December 14, 2017 ==
 
 
 
=== MOD data in Data Commons ===
 
* NIH Data Commons wants DB data in common formats
 
* We've wanted APIs or web service to allow access all data in standard format
 
* Good to have one place for all data from all MODs
 
* This should be a work product of the AGR
 
* Consider priority: data site, website
 
* AGR working groups work towards data standards
 
* GAF (gene association file) could be used now; what can we learn about the GAF: development, usage?
 
* GAF hasn't changed much in last ~10 years
 
 
 
=== Contingency plan for Juancarlos' vacation ===
 
* If Tazendra crashes/dies, we can perform a hard drive swap at Caltech
 
 
 
=== Essential genes ===
 
* David Fay asked for list of essential C. elegans genes
 
* David asked for genes whose mutants are inviable as homozygotes, ignoring RNAi
 
* Could we generate a file for users? Can we make a conservative list with ~no false positives
 
* What are all the criteria (that are curated) that could be used to decide whether a gene is essential?
 
* How does any user define "essential"? What are the cutoffs/thresholds?
 
* Best option may be to provide a table with all relevant phenotype annotation attributes and let user decide based on filters
 
 
 
=== Expression certainty ===
 
* Proteomics experiments have subjective thresholds
 
* How do we handle cases where one peptide maps to multiple genes?
 
* How should large scale expression data be incorporated into enrichment analysis?
 
 
 
=== GitHub tracker for Caltech curation issues ===
 
* "Caltech curation" repository - primarily used for creating curation tool set for Caltech team
 
* "WormBase curation" repository - last used in 2015
 
* Should there be a single repository for tracking? Yes, use the "wormbase-curation" repository
 
* Curators will start submitting tickets in the "wormbase-curation" repository
 
* OA code is in there now
 
* We should add OA dumper code as well
 
 
 
 
 
== December 21, 2017 ==
 
 
 
=== Upload ===
 
* Jan 19th citace upload to Hinxton
 
* Get .ace files to Wen by Tuesday Jan 16th, 10am PST
 
 
 
=== Wen's outreach slides ===
 
* Wen created slides for presenting at San Diego area worm meeting
 
* Meeting on Jan 12, 2018
 
* Wen recreated her slides based on feedback from Sternberg lab meeting
 
* Slides cover step-by-step tutorials for using Wormbase website
 
* For future outreach meetings/presentations, we can reach out ahead of time to find out what topics would be of greatest interest
 
* Curators, review Wen's slides and send her feedback
 
* Can take a look at WormBase YouTube channel for existing videos
 
** https://www.youtube.com/user/WormBaseHD/featured
 
* Chris can send WormMine tutorial videos and/or slides to Wen to share with audience
 
 
 
=== Karen and Daniela going to Bay Area meeting ===
 
* January meeting
 
* WormBase outreach, micropublications
 
 
 
=== April will start in January ===
 
* Will start on transcriptional regulation
 
* May work on allele phenotypes
 
 
 
=== Any curation issues (tool bugs etc.) ===
 
* While Juancarlos is gone, submit tickets to GitHub
 
* Raymond will take a look at
 

Revision as of 16:52, 4 January 2018