Difference between revisions of "WormBase-Caltech Weekly Calls"

From WormBaseWiki
Jump to navigationJump to search
 
Line 19: Line 19:
 
[[WormBase-Caltech_Weekly_Calls_2018|2018 Meetings]]
 
[[WormBase-Caltech_Weekly_Calls_2018|2018 Meetings]]
  
 +
[[WormBase-Caltech_Weekly_Calls_2019|2019 Meetings]]
  
GoToMeeting link: https://www.gotomeet.me/wormbase1
+
[[WormBase-Caltech_Weekly_Calls_2020|2020 Meetings]]
  
 +
[[WormBase-Caltech_Weekly_Calls_2021|2021 Meetings]]
  
 +
= 2022 Meetings =
  
= 2019 Meetings =
+
[[WormBase-Caltech_Weekly_Calls_January_2018|January]]
  
 
+
= January 13th, 2022 =
== January 3, 2019 ==
+
== tm variation - gene associations ==
 
+
*Update on progress and some questions for the Caltech curators
=== WS270 Citace upload ===
+
*Background: not all variations were being associated with genes in the OA table because some of those associations are in WS but not in geneace, so weren't coming through in the nightly geneace dump.  Some variation-gene associations are made as part of the VEP pipeline during the build.
* Next Tuesday, Jan 8th, 10am Pacific
+
**https://github.com/WormBase/website/issues/8262
 
+
**https://wiki.wormbase.org/index.php/WBGene_information_and_status_pipeline
=== Gene descriptions ===
+
**https://wiki.wormbase.org/index.php/Source_and_maintenance_of_non-WBGene_info
* Valerio generated new files to ignore/filter-out problematic genes
+
**https://wiki.wormbase.org/index.php/Updating_Postgres_with_New_WS_Information
* Still need to validate new pipeline
+
*Wen now downloads several full ACeDB classes from the latest WS release in the form of .ace files so we can also have whatever information is in WSRaymond wrote a script to sync those files to tazendra for further processing/use.
* Barring any major issues, will submit new files for WS270 (can load old files if needed)
+
*A few questions that we want to confirm before going forward:
* Maybe should define a test set (random sample) to test each release? Already have a test set
+
**In the WS variations file, there are 2,130,801 total variations (1,911,339 total Live) while in postgres there are currently 106,080.
 
+
***Only include Status = Live variations?
=== Protege Tutorial ===
+
***Include regardless of whether there is an associated gene (this seems to be the current practice?).
* Doodle poll open: https://doodle.com/poll/kn49rd3rggymn68g
+
***Currently, some variations with a given Method, e.g. Million_mutation, are NOT included.  We would continue this filtering.
* Please fill out poll if you are interested in attending; have responses from Kimberly and Gary S.
+
****SNP
 
+
****WGS_Hawaiian_Waterston
 
+
****WGS_Pasadena_Quinlan
==January 11th, 2019==
+
****WGS_Hobert
 
+
****Million_mutation
===WB workshop at IWM 2019===
+
****WGS_Yanai
Here's a draft, need to finalize as Jan 15th is the deadline
+
****WGS_De_Bono
<pre style="white-space: pre-wrap;
+
****WGS_Andersen
white-space: -moz-pre-wrap;
+
****WGS_Flibotte
white-space: -pre-wrap;
+
****WGS_Rose
white-space: -o-pre-wrap;
+
***Do we want other filters?
word-wrap: break-word">
+
**For genes, the ace file contains ALL the gene objects in WB regardless of species.
Possible Title 1: Data in WormBase and how to query it
+
***We've recently had an author request, via the Acknowledge pipeline, to associate genes of other, less well studied Caenorhabditis species, e.g. C. inopinata, to [https://academic.oup.com/g3journal/article/11/3/jkab022/6121926 their paper].
Possible Title 2: WormBase 2019 - Data, Tools and Community Curation
+
***Do we want all Caenorhabditis (and other nematode) species genes in our various gene tables, e.g. obo, paper? Any other species?
This workshop will be an interactive session with users in order to discuss the types of data in WormBase and how to query them using the right toolsWe will discuss recent changes to WormBase community annotation forms and how to use them to contribute data to WormBase. We will also present updates to ParaSite, a portal to parasitic worm genomic data, and how to find cross-species data at the Alliance of Genome Research.
+
***The effect on the autocomplete, if we include all, probably won't be a problem 1,018,332 vs 306116)
 
+
***Some of the gene ids from other species don't have 'WBGene' prefixes, e.g. Sp34_10109610.  Should we keep this in a separate table from genes with 'WBGene' prefixes?
1:00 pm 
 
Keep your widgets open: a wealth of gene-related data on the gene page
 
(This will be a quick walk-through of the gene page for orienting Users before we jump into the tools; can point to data related to Gene function, expression and disease models) - Ranjana Kishore
 
     
 
1:10 pm 
 
Use the right tool for the right data:
 
Get simple lists using SimpleMine - Wen Chen
 
Tissue enrichment analysis tools - Kimberly Van Auken
 
Tools for RNA seq data - Wen Chen
 
Get batch gene data using the WormBase Ontology Browser - Raymond Lee
 
Get the big picture: visualize annotations using the SOBA tool -Raymond Lee
 
     
 
1.50 pm  WormBase ParaSite: Exploring lots of genomes - Kevin Howe
 
 
 
2.00 pm  Find cross-species data at the Alliance of Genome Research - Chris Grove
 
 
 
2.10 pm  Be a Community Curator: submit your data to WormBase - Daniela Raciti
 
 
 
2.10-2.30pm. Open forum for questions
 
</pre>
 
 
 
=== Finalize Protege tutorial time ===
 
* Best final options:
 
** Wed, Jan 16th, 1pm Pacific/4pm Eastern
 
** Thurs, Jan 17th, 11am Pacific/2pm Eastern
 
** Thurs, Jan 17th, 1pm Pacific/4pm Eastern
 
* Propose we go with Wed, Jan 16th, 1pm Pacific/4pm Eastern
 

Latest revision as of 18:56, 13 January 2022

Previous Years

2009 Meetings

2011 Meetings

2012 Meetings

2013 Meetings

2014 Meetings

2015 Meetings

2016 Meetings

2017 Meetings

2018 Meetings

2019 Meetings

2020 Meetings

2021 Meetings

2022 Meetings

January

January 13th, 2022

tm variation - gene associations

  • Update on progress and some questions for the Caltech curators
  • Background: not all variations were being associated with genes in the OA table because some of those associations are in WS but not in geneace, so weren't coming through in the nightly geneace dump. Some variation-gene associations are made as part of the VEP pipeline during the build.
  • Wen now downloads several full ACeDB classes from the latest WS release in the form of .ace files so we can also have whatever information is in WS. Raymond wrote a script to sync those files to tazendra for further processing/use.
  • A few questions that we want to confirm before going forward:
    • In the WS variations file, there are 2,130,801 total variations (1,911,339 total Live) while in postgres there are currently 106,080.
      • Only include Status = Live variations?
      • Include regardless of whether there is an associated gene (this seems to be the current practice?).
      • Currently, some variations with a given Method, e.g. Million_mutation, are NOT included. We would continue this filtering.
        • SNP
        • WGS_Hawaiian_Waterston
        • WGS_Pasadena_Quinlan
        • WGS_Hobert
        • Million_mutation
        • WGS_Yanai
        • WGS_De_Bono
        • WGS_Andersen
        • WGS_Flibotte
        • WGS_Rose
      • Do we want other filters?
    • For genes, the ace file contains ALL the gene objects in WB regardless of species.
      • We've recently had an author request, via the Acknowledge pipeline, to associate genes of other, less well studied Caenorhabditis species, e.g. C. inopinata, to their paper.
      • Do we want all Caenorhabditis (and other nematode) species genes in our various gene tables, e.g. obo, paper? Any other species?
      • The effect on the autocomplete, if we include all, probably won't be a problem 1,018,332 vs 306116)
      • Some of the gene ids from other species don't have 'WBGene' prefixes, e.g. Sp34_10109610. Should we keep this in a separate table from genes with 'WBGene' prefixes?