Difference between revisions of "Phasing out the manual annotations"

From WormBaseWiki
Jump to navigationJump to search
Line 4: Line 4:
 
*Those that have a Date last updated date of 2004-06-17 in the OA, 3075 genes  
 
*Those that have a Date last updated date of 2004-06-17 in the OA, 3075 genes  
  
*Those genes that have the following description (also from 2004-06-17)
+
*Those genes that have the description below (also from 2004-06-17)
 
   
 
   
 
  This gene encodes a protein containing an F-box, a motif predicted to mediate protein-protein
 
  This gene encodes a protein containing an F-box, a motif predicted to mediate protein-protein
Line 10: Line 10:
 
Examples: fbxb genes, fbxa genes, a few sdz genes, number of unnamed (uncloned?) genes, total 295 genes
 
Examples: fbxb genes, fbxa genes, a few sdz genes, number of unnamed (uncloned?) genes, total 295 genes
  
*Those genes with the description (most from 2004-06-17)
+
*Those genes with the description below (most from 2004-06-17)
  
 
  The protein product of this gene is predicted to contain a glutamine/asparagine (Q/N)-rich ('prion')
 
  The protein product of this gene is predicted to contain a glutamine/asparagine (Q/N)-rich ('prion')
Line 16: Line 16:
 
  wormpep77).
 
  wormpep77).
 
Examples: pqn genes, some abu genes, some unnamed (uncloned?) genes, total 72 genes
 
Examples: pqn genes, some abu genes, some unnamed (uncloned?) genes, total 72 genes
 +
 +
*Those genes that have the word 'disease' in the description (113 genes)
 +
hex-1 encodes a beta-N-acetylhexosaminidase that is orthologous to the human gene CERVICAL CANCER PROTO-
 +
ONCOGENE 7 (HEXB; OMIM:606873), which when mutated leads to disease.
 +
 +
*Those genes that have the description with the word 'syndrome' (85 genes)

Revision as of 23:07, 18 February 2016

The plan is to look at the groups of annotations by date and/or their text in order to phase them out from the concise descriptions dump that is submitted for upload, and possibly from Postgres as well.

Note: These are only the Description Type 'Concise_descritpion', not looking at Provisional descriptions'.

  • Those that have a Date last updated date of 2004-06-17 in the OA, 3075 genes
  • Those genes that have the description below (also from 2004-06-17)
This gene encodes a protein containing an F-box, a motif predicted to mediate protein-protein
interactions either with homologs of yeast Skp-1p or with other proteins.

Examples: fbxb genes, fbxa genes, a few sdz genes, number of unnamed (uncloned?) genes, total 295 genes

  • Those genes with the description below (most from 2004-06-17)
The protein product of this gene is predicted to contain a glutamine/asparagine (Q/N)-rich ('prion')
domain, by the algorithm of Michelitsch and Weissman (as of the WS77 release of WormBase, i.e., in  
wormpep77).

Examples: pqn genes, some abu genes, some unnamed (uncloned?) genes, total 72 genes

  • Those genes that have the word 'disease' in the description (113 genes)
hex-1 encodes a beta-N-acetylhexosaminidase that is orthologous to the human gene CERVICAL CANCER PROTO- 
ONCOGENE 7 (HEXB; OMIM:606873), which when mutated leads to disease.
  • Those genes that have the description with the word 'syndrome' (85 genes)