Difference between revisions of "Specifications for data submission to the Alliance"

From WormBaseWiki
Jump to navigationJump to search
(Created page with "We need to explicitly submit annotations to all entities—meaning the directly annotated entity and the inferred entities. Otherwise they will not show on the relevant page/...")
 
(17 intermediate revisions by the same user not shown)
Line 1: Line 1:
 +
==Data that needs to be submitted (March 2020)==
 
We need to explicitly submit annotations to all entities—meaning the directly annotated entity and the inferred entities.  Otherwise they will not show on the relevant page/table in the Alliance.  
 
We need to explicitly submit annotations to all entities—meaning the directly annotated entity and the inferred entities.  Otherwise they will not show on the relevant page/table in the Alliance.  
  
All Strains, Alleles, Transgenes and Genes associated with disease need to be included in the JSON as long as the Alliance is ready to take them in.  (At this point, only transgenes are not ready to go into the Alliance. Also additional data such as modifiers are not ready to go into Alliance; so any disease annotation with modifiers needs to be withheld from the JSON.
+
Strains, Alleles, Transgenes and Genes associated with disease need to be included.
  
1. When AGM/strain is the object of the primary annotation, this is when you find all three --gene, strain and allele in a single annotation WB annotation (note that we always choose Strain as the primary object annotated to over allele, though you can’t tell by just the acedb tag names; acedb tags: Modeled_by  Strain, Modeled_by  Variation, and Modeled_by  Disease_relevant_gene) there should be 3 submitted annotations for the JSON as follows:
+
====Disease data submission for Alliance JSON====
 
+
{|Class="wikitable"
objectID: GeneID <br/>
+
|+Mapping of acedb tags to objects in JSON
primaryGeneticEntityIDs: strain
+
|-
 
+
! !!Acedb Tags in Annotation <br/> (From ?Disease_model_annotation class)!!Primary Object!!Number of Objects<br/>with annotations!!JSON Objects
objectID: AlleleID . <br/>
+
|-
primaryGeneticEntityIDs: strain
+
|1||Modeled_by  Disease_relevant_gene||Gene||1||objectID: GeneID
 
+
|-
objectID: strainID
+
|2||Modeled_by  Variation<br/>Modeled_by  Disease_relevant_gene||Variation||2||objectID: GeneID <br/> primaryGeneticEntityIDs: AlleleID
 
+
|-
 
+
|3||Modeled_by Strain||Strain||1||objectID: strainID
2. When AGM/strain is the object of the primary annotation, this is when there is only strain in a single WB annotation (acedb tag: Modeled_by Strain), there should be 1 submitted annotation for the JSON as follows:
+
|-
 
+
|4||Modeled_by Strain<br/>Modeled_by Disease_relevant_gene||Strain||2||objectID: strainID<br/>
objectID: strainID
+
objectID: GeneID <br/>primaryGeneticEntityIDs: strainID
 
+
|-
 
+
|5||Modeled_by Transgene||Transgene||1||objectID: TransgeneID
 
+
|-
3. When gene is the object of the primary annotation, this is when you find only gene in a single WB annotation (acedb tag: Modeled_by  Disease_relevant_gene) there should be 1 submitted annotation for the JSON:
+
|6||Modeled_by  Transgene<br/> Modeled_by  Disease_relevant_gene||Transgene||2||objectID: Transgene ID <br/>
 
+
objectID: GeneID <br/>primaryGeneticEntityIDs: TransgeneID
objectID: GeneID
+
|-
 
+
|7||Modeled_by Strain<br/>Modeled_by Variation<br/>Modeled_by Disease_relevant_gene||Strain||3||objectID: strainID
 
+
objectID: GeneID <br/>primaryGeneticEntityIDs: strainID <br/>
4. When allele is the object of the primary annotation, this is when you find allele and gene in a single WB annotation ((acedb tags: Modeled_by Variation, and Modeled_by Disease_relevant_gene), there should be 2 submitted annotations for the JSON:
+
 
+
objectID: AlleleID <br/>primaryGeneticEntityIDs: strainID <br/>
objectID: AlleleID
 
 
 
 
 
objectID: GeneID .  <br/>
 
primaryGeneticEntityIDs: AlleleID
 
  
 +
|}
  
 
Note:  
 
Note:  
 
*‘InferredGeneAssociation’ is not needed anymore because Alliance is not using this in the pipeline.
 
*‘InferredGeneAssociation’ is not needed anymore because Alliance is not using this in the pipeline.
 
*The old style ‘Experimental model’ annotations not be submitted to the Alliance at all.  In our curation database at Caltech, the new style ‘Disease_model_annotation’ and the old style ‘Experimental model’ annotations are the same data, we just take the genes and dump them as ‘Experimental models’ for display sake in WormBase.
 
*The old style ‘Experimental model’ annotations not be submitted to the Alliance at all.  In our curation database at Caltech, the new style ‘Disease_model_annotation’ and the old style ‘Experimental model’ annotations are the same data, we just take the genes and dump them as ‘Experimental models’ for display sake in WormBase.
 +
 +
==Data that should be excluded from Alliance submissions (March 2020)==
 +
*Exclude entire annotation if it has data for any one of the below tags (from the ?Disease_model_annotation class)
 +
**Interacting_variation
 +
**Interacting_transgene
 +
**Interacting_gene
 +
**RNAi_experiment
 +
**Qualifier_not
 +
**Inducing_chemical
 +
**Inducing_agent
 +
**Modifier_transgene
 +
**Modifier_variation
 +
**Modifier_strain
 +
**Modifier_gene
 +
**Modifier_molecule
 +
**Other_modifier

Revision as of 23:02, 9 March 2020

Data that needs to be submitted (March 2020)

We need to explicitly submit annotations to all entities—meaning the directly annotated entity and the inferred entities. Otherwise they will not show on the relevant page/table in the Alliance.

Strains, Alleles, Transgenes and Genes associated with disease need to be included.

Disease data submission for Alliance JSON

Mapping of acedb tags to objects in JSON
Acedb Tags in Annotation
(From ?Disease_model_annotation class)
Primary Object Number of Objects
with annotations
JSON Objects
1 Modeled_by Disease_relevant_gene Gene 1 objectID: GeneID
2 Modeled_by Variation
Modeled_by Disease_relevant_gene
Variation 2 objectID: GeneID
primaryGeneticEntityIDs: AlleleID
3 Modeled_by Strain Strain 1 objectID: strainID
4 Modeled_by Strain
Modeled_by Disease_relevant_gene
Strain 2 objectID: strainID

objectID: GeneID
primaryGeneticEntityIDs: strainID

5 Modeled_by Transgene Transgene 1 objectID: TransgeneID
6 Modeled_by Transgene
Modeled_by Disease_relevant_gene
Transgene 2 objectID: Transgene ID

objectID: GeneID
primaryGeneticEntityIDs: TransgeneID

7 Modeled_by Strain
Modeled_by Variation
Modeled_by Disease_relevant_gene
Strain 3 objectID: strainID

objectID: GeneID
primaryGeneticEntityIDs: strainID

objectID: AlleleID
primaryGeneticEntityIDs: strainID

Note:

  • ‘InferredGeneAssociation’ is not needed anymore because Alliance is not using this in the pipeline.
  • The old style ‘Experimental model’ annotations not be submitted to the Alliance at all. In our curation database at Caltech, the new style ‘Disease_model_annotation’ and the old style ‘Experimental model’ annotations are the same data, we just take the genes and dump them as ‘Experimental models’ for display sake in WormBase.

Data that should be excluded from Alliance submissions (March 2020)

  • Exclude entire annotation if it has data for any one of the below tags (from the ?Disease_model_annotation class)
    • Interacting_variation
    • Interacting_transgene
    • Interacting_gene
    • RNAi_experiment
    • Qualifier_not
    • Inducing_chemical
    • Inducing_agent
    • Modifier_transgene
    • Modifier_variation
    • Modifier_strain
    • Modifier_gene
    • Modifier_molecule
    • Other_modifier