Difference between revisions of "WormBase-Caltech Weekly Calls"

From WormBaseWiki
Jump to navigationJump to search
(147 intermediate revisions by 8 users not shown)
Line 22: Line 22:
  
  
GoToMeeting link: https://www.gotomeet.me/wormbase1
+
 
  
 
= 2020 Meetings =
 
= 2020 Meetings =
Line 32: Line 32:
 
[[WormBase-Caltech_Weekly_Calls_March_2020|March]]
 
[[WormBase-Caltech_Weekly_Calls_March_2020|March]]
  
 +
[[WormBase-Caltech_Weekly_Calls_April_2020|April]]
  
== April 2, 2020 ==
+
[[WormBase-Caltech_Weekly_Calls_May_2020|May]]
 
 
=== Community phenotype requests ===
 
* March 9-28
 
* 2,548 emails went out; 89 bounced; 6 resent; 13 backup; 2,478 successful emails
 
* 361 annotations overall
 
* 48 papers requested received curation (2% response rate)
 
* 53 distinct papers overall (5 papers without request)
 
* 53 distinct persons overall
 
 
 
=== Community curation volunteers ===
 
* Tracking volunteers [https://docs.google.com/spreadsheets/d/1ldECC44PXMilcDO6ctz-8AkRZntfDoV0Wtc4F-T_Zvg/edit?usp=sharing here]
 
* 14 volunteers so far, all have been assigned a WBPerson ID
 
* Chris will set up a webinar tutorial in the coming week or two
 
 
 
=== AFP pipeline ===
 
* Will resend email requests to authors that haven't already responded
 
* May also send out for older papers
 
* May work with people to help
 
* Does the old AFP form still work? It should
 
* If someone has a link to the old form, they won't get one for the new form
 
* Maybe could set up an automatic redirect from the old form to the new form
 
* Received many submissions recently (>20% response rate)
 
 
 
=== Ontology Annotator ===
 
* Need to work on Genotype OA dumper
 
* Turns out semicolons are problematic (currently in genotypes and transgenes) for object names (ontology fields)
 
* Ampersands (&) are also problematic for object names in the OA
 
** 20237  | Is[Pgcy-5::daf-2a::venus; Punc-122::mCherry]                          | 2014-10-08 10:32:45.874519-07
 
** 20239  | Ex[Pgcy-5::casy-1::venus; Pgcy-5::aman-2::mCherry; Punc-122::mCherry] | 2014-10-08 10:45:23.202362-07
 
** 20238  | Is[Pgcy-5::daf-2c::venus; Punc-122::mCherry]                         | 2014-10-08 10:38:19.859078-07
 
** 25249  | Ex[Prheb-1::rheb-1::GFP; unc-119(+]                                  | 2018-06-29 10:16:40.784295-07
 
** 16283  | [hlh-13::GFP;unc-119(+)]                                              | 2013-02-07 17:43:22.384819-08
 
** 26131  | Ex[pedc-3EDC-3::DsRed;pRF4]                                          | 2019-08-14 08:44:49.91063-07
 
 
 
=== Use Slack More ===
 
* Slack is a good tool for quick communication among team members; would be good for all curators to join Slack to enable efficient communication
 
 
 
 
 
== April 9, 2020 ==
 
 
 
=== Volunteer curators ===
 
* Have sent out emails to schedule tutorials
 
* Chris had one tutorial with Michael Davies (Alyson Ashe's lab) yesterday
 
* One already scheduled for next Monday with Wilber and Stephanie from Paul's lab
 
* Two others already scheduled for next Tuesday with Lina Dahlberg and Colin Dolphin
 
 
 
===TAGC is virtual (4.22-25.2020)===
 
FYI in case you missed it
 
*You still have to register (it's free), if you hadn't before
 
https://genetics-gsa.org/tagc-2020/registration/
 
 
 
===summer students===
 
* Caltech SURF students (and other summer students worldwide) now are looking for projects
 
* Maybe they could curate for WormBase
 
* In addition to phenotype, they could curate:
 
** Allele/lesion sequence curation (using Allele Sequence form); maybe Paul Davis could make a tutorial video?
 
** Anatomy function, looking for novel info; opportunity to program/code
 
 
 
=== OA semicolon issue ===
 
* Juancarlos has fixed the issues on sandbox
 
* Curators should test on Mangolassi
 
 
 
=== Textmining/automation ===
 
* Daniela will discuss with Christina Zorn from Xenbase
 
* Will discuss SVM, AFP, Textpresso, etc.
 
 
 
=== Retracted WBPapers ===
 
* Jae & Kimberly put in GitHub ticket to make retractions clear on WormBase site
 
* https://github.com/WormBase/website/issues/7637
 
* Can we systematically detect retractions? Yes
 
* What about finding papers that cite retractions? Maybe, but likely tricky
 
 
 
 
 
== April 16, 2020 ==
 
  
=== Community Phenotype Curation Tutorials ===
+
[[WormBase-Caltech_Weekly_Calls_June_2020|June]]
* Chris has run 6 tutorials, recorded 4
 
* MPG files saved on DropBox; ask Chris for access
 
* Plan to edit videos to make tutorial video to post on WB YouTube channel
 
  
=== Author First Pass ===
+
[[WormBase-Caltech_Weekly_Calls_July_2020|July]]
* May run a webinar and use Zoom to record
 
* May make a short tutorial video
 
* Jae: Is there documentation for terminology used in the form?
 
  
=== Zoom accounts ===
 
* People can try to use Caltech Zoom account
 
  
 +
==August 6th, 2020==
  
== April 23, 2020 ==
+
===Experimental conditions data flow into Alliance===
 +
*Experimental conditions in disease annotations: WB has inducers (used to recapitulate the disease condition) and modifiers (a modifier can ameliorate, exacerbate, or have no effect, on the disease condition)
 +
*We use the WB Molecule CV for Inducers and Modifiers in disease annotation
 +
*Experimental conditions in phenotype annotations: are free text (captured in remarks); will probably need to formalize later on
 +
*So for data flow into Alliance:
 +
**In the short term we will load the Molecule CV into the Alliance (Ranjana and Michael P. will work on this)
 +
**Groups will switch to using common data model that works for all and common ontology/ontologies in the near future.
 +
* How do we handle genetic sex? Part of condition?
 +
** Condition has been intended for external/environmental conditions, whereas genetic sex is inherent to the organism of study
 +
** Expression pattern curation needs genetic sex; needs a model at the Alliance for capturing sex
  
=== Community Phenotype Curation Tutorials ===
 
* Chris has finished first round of tutorials; 8 tutorials, 6 video recordings
 
* There are ~8 new volunteers; will setup tutorials for them soon
 
  
=== ECO code implementation ===
+
== August 13, 2020 ==
* ?ECO_term to replace ?GO_code in ACEDB models
 
* GAF files with three-letter codes can still be generated by mapping
 
  
=== Simplemine for Alliance ===
+
=== Species in Postgres and ACEDB/Datomic ===
* Wen has presented proposal to Search group
+
* Want to dump "Affected By Pathogen" fields in Phenotype OA and RNAi OA
* Plan is to have a link to the Alliance Simplemine prototype from the Alliance web page
+
* Want to be sure that what gets dumped aligns with species loaded into ACEDB
 +
* Currently one species annotated not in WS277: Streptococcus gallolyticus subsp. gallolyticus
 +
* We currently have multiple Postgres tables for storing species lists:
 +
** pap_species_index (used by "Affected By Pathogen" fields, AFP); Kimberly uses to assign species to papers and occasionally adds new ones
 +
** obo_name_ncbitaxonid
 +
** obo_name_taxon (original, smaller list)
 +
** h_pap_species_index (history for pap_species_index)
 +
* How do species get loaded into ACEDB? Dumps from Postgres? Which table(s)?
 +
* WS277 has 7,906 species (1,936 have no NCBI Taxon ID)
 +
* Kimberly has occasionally uploaded a species.ace file in the context of GO curation; but Hinxton otherwise handles it; should ask them
 +
* New species are associated with paper objects, but otherwise no additional data for those species come from Caltech
 +
* It might be useful to have species pages in WB that at least list papers for which we have species associations, maybe include other information?
  
=== Venn diagram tool ===
+
=== WS279 Citace upload ===
* Conceived by Jae, implemented by Sibyl
+
* When is it happening? Not sure; not on release schedule right now
* Currently used for interactions data
 
* Could use for other data types like phenotype (e.g. comparing RNAi vs. allele phenotype)
 
* Could also use for Expression data, e.g. comparing results from different methods
 
* Could maybe use for disease data
 
  
=== AFP tutorial ===
+
=== SOLR server security (IMSS) ===
* Daniela, Kimberly, Valerio will run through the AFP form with Nikita from Gupta lab tomorrow
+
* IMSS network security blocked network on our server due to its open SOLR web access.
* May record in the future to make a tutorial video
+
* Part of AMIGO stack, very old version, drives our ontology browser directly, SObA, Enrichment tools indirectly.
* Daniela may (re-)start curating markers for relevant expression patterns
+
* Added some firewall/URL filter and IMSS opens up the network (for now). IMSS still gripes about its service is open to the world.
* Wen noticed that many tissue markers are artificial (not necessarily endogenous sequence)
 
  
=== Expression markers ===
+
=== Alzheimer's disease portal ===
* SURF student projects: Identifying good expression markers? Maybe, but may require more curation experience
+
* Supplement grant awarded to Alliance for an Alzheimer's disease portal
* Wen looked at expression cluster data; hard to find good, very specific (i.e. neuron) markers
+
* Could involve automated/concise descriptions, interactions, etc.
* Daniela may (re-)start curating markers for relevant expression patterns
+
* Could establish useful pipelines that could be reused in other contexts
* Wen noticed that many tissue markers are artificial (not necessarily endogenous sequence)
 
* Already have an "Expression markers" widget on anatomy term pages
 
* Could combinations of genes (e.g. cGal) act as markers?
 

Revision as of 21:01, 13 August 2020

Previous Years

2009 Meetings

2011 Meetings

2012 Meetings

2013 Meetings

2014 Meetings

2015 Meetings

2016 Meetings

2017 Meetings

2018 Meetings

2019 Meetings



2020 Meetings

January

February

March

April

May

June

July


August 6th, 2020

Experimental conditions data flow into Alliance

  • Experimental conditions in disease annotations: WB has inducers (used to recapitulate the disease condition) and modifiers (a modifier can ameliorate, exacerbate, or have no effect, on the disease condition)
  • We use the WB Molecule CV for Inducers and Modifiers in disease annotation
  • Experimental conditions in phenotype annotations: are free text (captured in remarks); will probably need to formalize later on
  • So for data flow into Alliance:
    • In the short term we will load the Molecule CV into the Alliance (Ranjana and Michael P. will work on this)
    • Groups will switch to using common data model that works for all and common ontology/ontologies in the near future.
  • How do we handle genetic sex? Part of condition?
    • Condition has been intended for external/environmental conditions, whereas genetic sex is inherent to the organism of study
    • Expression pattern curation needs genetic sex; needs a model at the Alliance for capturing sex


August 13, 2020

Species in Postgres and ACEDB/Datomic

  • Want to dump "Affected By Pathogen" fields in Phenotype OA and RNAi OA
  • Want to be sure that what gets dumped aligns with species loaded into ACEDB
  • Currently one species annotated not in WS277: Streptococcus gallolyticus subsp. gallolyticus
  • We currently have multiple Postgres tables for storing species lists:
    • pap_species_index (used by "Affected By Pathogen" fields, AFP); Kimberly uses to assign species to papers and occasionally adds new ones
    • obo_name_ncbitaxonid
    • obo_name_taxon (original, smaller list)
    • h_pap_species_index (history for pap_species_index)
  • How do species get loaded into ACEDB? Dumps from Postgres? Which table(s)?
  • WS277 has 7,906 species (1,936 have no NCBI Taxon ID)
  • Kimberly has occasionally uploaded a species.ace file in the context of GO curation; but Hinxton otherwise handles it; should ask them
  • New species are associated with paper objects, but otherwise no additional data for those species come from Caltech
  • It might be useful to have species pages in WB that at least list papers for which we have species associations, maybe include other information?

WS279 Citace upload

  • When is it happening? Not sure; not on release schedule right now

SOLR server security (IMSS)

  • IMSS network security blocked network on our server due to its open SOLR web access.
  • Part of AMIGO stack, very old version, drives our ontology browser directly, SObA, Enrichment tools indirectly.
  • Added some firewall/URL filter and IMSS opens up the network (for now). IMSS still gripes about its service is open to the world.

Alzheimer's disease portal

  • Supplement grant awarded to Alliance for an Alzheimer's disease portal
  • Could involve automated/concise descriptions, interactions, etc.
  • Could establish useful pipelines that could be reused in other contexts