Difference between revisions of "Specifications for data submission to the Alliance"

From WormBaseWiki
Jump to navigationJump to search
 
(15 intermediate revisions by the same user not shown)
Line 1: Line 1:
==Data that needs to be submitted (March 2020)==
+
==Old stuff--Data that needs to be submitted (March 2020)==
 
We need to explicitly submit annotations to all entities—meaning the directly annotated entity and the inferred entities.  Otherwise they will not show on the relevant page/table in the Alliance.  
 
We need to explicitly submit annotations to all entities—meaning the directly annotated entity and the inferred entities.  Otherwise they will not show on the relevant page/table in the Alliance.  
  
All Strains, Alleles, Transgenes and Genes associated with disease need to be included in the JSON as long as the Alliance is ready to take them in.
+
Strains, Alleles, Transgenes and Genes associated with disease need to be included.
  
1. When you find all three --gene, strain and allele in a single annotation WB annotation, note that we always choose Strain/AGM as the primary object annotated to, though you can’t tell by just the acedb tag names; acedb tags: Modeled_by  Strain, Modeled_by  Variation, and Modeled_by  Disease_relevant_gene) there should be 3 submitted annotations for the JSON as follows, as we want to explicitly submit annotations to the allele and gene as well:
+
====Disease data submission for Alliance JSON====
 
+
{|Class="wikitable"
objectID: GeneID
+
|+Mapping of acedb tags to objects in JSON
primaryGeneticEntityIDs: strain
+
|-
 
+
! !!Acedb Tags in Annotation <br/> (From ?Disease_model_annotation class)!!Primary Object!!Number of Objects<br/>with annotations!!JSON Objects
objectID: AlleleID
+
|-
primaryGeneticEntityIDs: strain
+
|1||Modeled_by  Disease_relevant_gene||Gene||1||objectID: GeneID
 
+
|-
objectID: strainID
+
|2||Modeled_by  Variation<br/>Modeled_by  Disease_relevant_gene||Variation||2||objectID: GeneID <br/> primaryGeneticEntityIDs: AlleleID
 
+
|-
 
+
|3||Modeled_by Strain||Strain||1||objectID: strainID
2. When AGM/strain is the object of the primary annotation, this is when there is only a strain in a single WB annotation (acedb tag: Modeled_by  Strain), there should be 1 submitted annotation for the JSON as follows:
+
|-
 
+
|4||Modeled_by Strain<br/>Modeled_by Disease_relevant_gene||Strain||2||objectID: strainID<br/>
objectID: strainID
+
objectID: GeneID <br/>primaryGeneticEntityIDs: strainID
 
+
|-
 
+
|5||Modeled_by Transgene||Transgene||1||objectID: TransgeneID
3. When gene is the object of the primary annotation, this is when you find only gene in a single WB annotation (acedb tag: Modeled_by Disease_relevant_gene) there should be 1 submitted annotation for the JSON:
+
|-
 
+
|6||Modeled_by Transgene<br/> Modeled_by  Disease_relevant_gene||Transgene||2||objectID: Transgene ID <br/>
objectID: GeneID
+
objectID: GeneID <br/>primaryGeneticEntityIDs: TransgeneID
 
+
|-
 
+
|7||Modeled_by Strain<br/>Modeled_by Variation<br/>Modeled_by Disease_relevant_gene||Strain||3||objectID: strainID
4. When you find allele and gene in a single WB annotation ((acedb tags: Modeled_by Variation, and Modeled_by Disease_relevant_gene), allele is the object of the primary annotation, there should be 2 submitted annotations for the JSON, because we want to submit explicitly to the gene as well:
+
objectID: GeneID <br/>primaryGeneticEntityIDs: strainID <br/>
 
+
objectID: AlleleID
+
objectID: AlleleID <br/>primaryGeneticEntityIDs: strainID <br/>
 
 
objectID: GeneID
 
primaryGeneticEntityIDs: AlleleID
 
 
 
 
 
5. When Transgene is the object of the primary annotation, this is when you only find Transgene in a single WB annotation (aced tag: Modeled_by Transgene) there should be one annotation in the JSON:
 
 
 
objectID: TransgeneID
 
 
 
 
 
6. When you find Transgene and gene in a single WB annotation (aced tags: Modeled_by Transgene, Modeled_by  Disease_relevant_gene), Transgene is the object of the primary annotation, so there should be two annotations in our JSON, again because we want to submit the primary annotation to the Transgene but we also want to explicitly submit the gene connection as well.
 
 
 
objectID: Transgene ID
 
 
 
objectID: GeneID
 
primaryGeneticEntityIDs: TransgeneID
 
  
 +
|}
  
 
Note:  
 
Note:  
Line 50: Line 35:
 
*The old style ‘Experimental model’ annotations not be submitted to the Alliance at all.  In our curation database at Caltech, the new style ‘Disease_model_annotation’ and the old style ‘Experimental model’ annotations are the same data, we just take the genes and dump them as ‘Experimental models’ for display sake in WormBase.
 
*The old style ‘Experimental model’ annotations not be submitted to the Alliance at all.  In our curation database at Caltech, the new style ‘Disease_model_annotation’ and the old style ‘Experimental model’ annotations are the same data, we just take the genes and dump them as ‘Experimental models’ for display sake in WormBase.
  
==Data excluded from Alliance submissions==
+
==Data that should be excluded from Alliance submissions (March 2020)==
*Annotations with data for tags (from the ?Disease_model_annotation class)
+
*Exclude entire annotation if it has data for any one of the below tags (from the ?Disease_model_annotation class)
 
**Interacting_variation
 
**Interacting_variation
 
**Interacting_transgene
 
**Interacting_transgene
 
**Interacting_gene
 
**Interacting_gene
 
**RNAi_experiment
 
**RNAi_experiment
 +
**Qualifier_not
 +
**Inducing_chemical
 +
**Inducing_agent
 
**Modifier_transgene
 
**Modifier_transgene
 
**Modifier_variation
 
**Modifier_variation

Latest revision as of 17:43, 26 October 2023

Old stuff--Data that needs to be submitted (March 2020)

We need to explicitly submit annotations to all entities—meaning the directly annotated entity and the inferred entities. Otherwise they will not show on the relevant page/table in the Alliance.

Strains, Alleles, Transgenes and Genes associated with disease need to be included.

Disease data submission for Alliance JSON

Mapping of acedb tags to objects in JSON
Acedb Tags in Annotation
(From ?Disease_model_annotation class)
Primary Object Number of Objects
with annotations
JSON Objects
1 Modeled_by Disease_relevant_gene Gene 1 objectID: GeneID
2 Modeled_by Variation
Modeled_by Disease_relevant_gene
Variation 2 objectID: GeneID
primaryGeneticEntityIDs: AlleleID
3 Modeled_by Strain Strain 1 objectID: strainID
4 Modeled_by Strain
Modeled_by Disease_relevant_gene
Strain 2 objectID: strainID

objectID: GeneID
primaryGeneticEntityIDs: strainID

5 Modeled_by Transgene Transgene 1 objectID: TransgeneID
6 Modeled_by Transgene
Modeled_by Disease_relevant_gene
Transgene 2 objectID: Transgene ID

objectID: GeneID
primaryGeneticEntityIDs: TransgeneID

7 Modeled_by Strain
Modeled_by Variation
Modeled_by Disease_relevant_gene
Strain 3 objectID: strainID

objectID: GeneID
primaryGeneticEntityIDs: strainID

objectID: AlleleID
primaryGeneticEntityIDs: strainID

Note:

  • ‘InferredGeneAssociation’ is not needed anymore because Alliance is not using this in the pipeline.
  • The old style ‘Experimental model’ annotations not be submitted to the Alliance at all. In our curation database at Caltech, the new style ‘Disease_model_annotation’ and the old style ‘Experimental model’ annotations are the same data, we just take the genes and dump them as ‘Experimental models’ for display sake in WormBase.

Data that should be excluded from Alliance submissions (March 2020)

  • Exclude entire annotation if it has data for any one of the below tags (from the ?Disease_model_annotation class)
    • Interacting_variation
    • Interacting_transgene
    • Interacting_gene
    • RNAi_experiment
    • Qualifier_not
    • Inducing_chemical
    • Inducing_agent
    • Modifier_transgene
    • Modifier_variation
    • Modifier_strain
    • Modifier_gene
    • Modifier_molecule
    • Other_modifier