Difference between revisions of "WormBase gene association file"

From WormBaseWiki
Jump to navigationJump to search
Line 47: Line 47:
 
  !date-generated: 2020-05-29
 
  !date-generated: 2020-05-29
 
  !project-URL: https://wormbase.org
 
  !project-URL: https://wormbase.org
  !specification-URL:  
+
  !specification-URL: https://wiki.wormbase.org/index.php/WormBase_gene_association_file
 +
!project-release: WS277
 
  !Contact Email: help@wormbase.org
 
  !Contact Email: help@wormbase.org
 +
 +
The WormBase anatomy association file generally follows the GAF 2.0 format with the following exceptions:
 +
 +
* The column 4 "Qualifier" will be one of four values specific to anatomical expression annotation: Certain, Uncertain, Partial, Enriched
  
 
=== Development (life stage) association file ===
 
=== Development (life stage) association file ===

Revision as of 19:30, 23 November 2020

This page represents the current information about the WormBase gene association file. Click here for an archive of outdated/obsolete information about the WormBase gene association file.

Gene Association File (GAF) format

The original Gene Association File (GAF) format was specified within the Gene Ontology consortium to specify how gene associations to Gene Ontology (GO) terms would be reported in a tab-delimited download file.

To view the (now deprecated) GAF 2.0 format specification, visit:

http://geneontology.org/docs/go-annotation-file-gaf-format-2.0/

To view the (now stale, but not quite deprecated) GAF 2.1 format specification, visit:

http://geneontology.org/docs/go-annotation-file-gaf-format-2.1/

To view the latest GAF 2.2 format specification, visit:

https://github.com/geneontology/geneontology.github.io/blob/issue-go-annotation-2917-gaf-2_2-doc/_docs/go-annotation-file-gaf-format-22.md

WormBase ontology gene association files

WormBase provides a gene association file for each of the various ontologies that WormBase uses, essentially providing a single annotation associating a gene to an ontology term on a single row of the tab-delimited output file.

Current WormBase gene association files can always be found on the WormBase FTP site here:

ftp://ftp.wormbase.org/pub/wormbase/releases/current-production-release/ONTOLOGY/

Anatomy association file

The anatomy association file (associating genes to anatomical entities where the gene product has been reported to be expressed) has a general name like:

anatomy_association.WSXXX.wb

where WSXXX refers to the relevant release number. The file for the WS277 release of WormBase is called:

anatomy_association.WS277.wb

The current header for the anatomy association file is

!gaf-version: 2.0
!Project_name: WormBase
!Contact Email: help@wormbase.org

The proposal (Nov 2020) is to update this header to:

!gaf-version: 2.0
!generated-by: WormBase
!date-generated: 2020-05-29
!project-URL: https://wormbase.org
!specification-URL: https://wiki.wormbase.org/index.php/WormBase_gene_association_file
!project-release: WS277
!Contact Email: help@wormbase.org

The WormBase anatomy association file generally follows the GAF 2.0 format with the following exceptions:

  • The column 4 "Qualifier" will be one of four values specific to anatomical expression annotation: Certain, Uncertain, Partial, Enriched

Development (life stage) association file

The development (life stage) association file (associating genes to life stages when the gene product has been reported to be expressed) has a general name like:

development_association.WSXXX.wb

where WSXXX refers to the relevant release number. The file for the WS277 release of WormBase is called:

development_association.WS277.wb


Disease association file

The disease association file (associating genes to human diseases the genes have been implicated in) has a general name like:

disease_association.WSXXX.wb

where WSXXX refers to the relevant release number. The file for the WS277 release of WormBase is called:

disease_association.WS277.wb


Gene Ontology (GO) gene association file

The Gene Ontology gene association file (associating genes to Gene Ontology terms) has a general name like:

gene_association.WSXXX.wb

where WSXXX refers to the relevant release number. The file for the WS277 release of WormBase is called:

gene_association.WS277.wb


Phenotype association file

The phenotype association file (associating genes to phenotype terms) has a general name like:

phenotype_association.WSXXX.wb

where WSXXX refers to the relevant release number. The file for the WS277 release of WormBase is called:

phenotype_association.WS277.wb


RNAi phenotype association file

The RNAi phenotype association file (associating genes to RNAi phenotypes) has a general name like:

rnai_phenotypes.WS277.wb

where WSXXX refers to the relevant release number. The file for the WS277 release of WormBase is called:

rnai_phenotypes.WS277.wb