Difference between revisions of "Genetics Markup by Textpresso and First Pass"

From WormBaseWiki
Jump to navigationJump to search
m
m
 
(23 intermediate revisions by 2 users not shown)
Line 1: Line 1:
 
[[http://www.wormbase.org/wiki/index.php/How_to_FirstPass back]]
 
[[http://www.wormbase.org/wiki/index.php/How_to_FirstPass back]]
 
----
 
----
=[[Instructions for Genetics]]=
+
==[[Mark up policy]]==
[http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/journal/journal_first_pass.cgi?action=Curate&paper=00000001&passwd=1241223658.1418967 Sample GENETICS Firstpass form]
 
  
[http://www.wormbase.org/wiki/index.php?title=Instructions_for_Genetics&action=edit Note to GENETICS]
+
==[[Mark Up Work Flow]]==
 +
[[Media:slide1.jpg]]
  
[http://www.wormbase.org/wiki/index.php?title=Instructions_for_Genetics&action=edit Note to authors]
+
this file diagrams the most recent and most detailed workflow we established, which includes editorial roles at Dartmouth. The steps below is a brief overview of the process. <br>
  
[[jfp postgres table details]]
+
These following steps outline the work flow for Textpresso markup starting with the acceptance of the paper by GENETICS and through to the incorporation of the paper into WB.<br> 
----
 
  
==Data types used in the first-pass forms==
+
''The responsible party is noted before each step.'' <br>
  
'''''C. elegans''''' (default checked): Postgres character name: '''celegans.'''  Postgres table:  [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/first_pass.cgi?action=Curate&paper=00000003&passwd=1228446342.8668923 afp], [http://tazendra.caltech.edu/~Postgres/cgi-bin/curator_first_pass.cgi cfp], [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/journal/journal_first_pass.cgi?action=Curate&paper=00000001&passwd=1241223658.1418967 jfp]. <br>
+
Genetics (1) The paper gets accepted.
  
'''''C. elegans'' other than Bristol: '''''  Data is present for ''C. elegans'' isolates other than Bristol, such as Hawaiian, CB4855, etc. Postgres character name: '''  cnonbristol.'''  Postgres table:  [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/first_pass.cgi?action=Curate&paper=00000003&passwd=1228446342.8668923 afp], [http://tazendra.caltech.edu/~Postgres/cgi-bin/curator_first_pass.cgi cfp], [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/journal/journal_first_pass.cgi?action=Curate&paper=00000001&passwd=1241223658.1418967 jfp]. <br>
+
Genetics (2) A DOI is assigned to the paper
  
'''Nematodes other than ''C. elegans'': '''  Data is presented for Caenorhabditis sister species e.g., briggsae, remanei,  and/or related nematodes, parasitic nematodes.  Postgres character name: '''  nematode.'''  Postgres table:  [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/first_pass.cgi?action=Curate&paper=00000003&passwd=1228446342.8668923 afp], [http://tazendra.caltech.edu/~Postgres/cgi-bin/curator_first_pass.cgi cfp], [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/journal/journal_first_pass.cgi?action=Curate&paper=00000001&passwd=1241223658.1418967 jfp]. <br>
+
Genetics (3) A WBPaperID and URL are acquired through the ticketing form at: http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/journal/journal_paper_ticket.cgi
  
'''Non-nematode species: '''  Data is presented for Human, Mouse, SGD, Dog, Plant, etc. genes/proteins, other.  Postgres character name: '''  nonnematode.'''  Postgres table:  [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/first_pass.cgi?action=Curate&paper=00000003&passwd=1228446342.8668923 afp], [http://tazendra.caltech.edu/~Postgres/cgi-bin/curator_first_pass.cgi cfp], [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/journal/journal_first_pass.cgi?action=Curate&paper=00000001&passwd=1241223658.1418967 jfp]. <br>
+
Genetics (4) The WBPaperID from the ticket issuer is attached to the XML
  
'''Enter genes studied in this paper: '''  List of gene(s) studied in the paper.  Postgres character name: '''  genestudied.'''  Postgres table:  [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/journal/journal_first_pass.cgi?action=Curate&paper=00000001&passwd=1241223658.1418967 jfp]. <br>
+
Genetics (5) the author is sent the generated URL to access the author form
  
'''Enter genes cloned in the paper : '''  List of genes newly identified, named, cloned, reassigned etc. Postgres character name: '''   genesymbol.'''  Postgres table:  [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/first_pass.cgi?action=Curate&paper=00000003&passwd=1228446342.8668923 afp], [http://tazendra.caltech.edu/~Postgres/cgi-bin/curator_first_pass.cgi cfp], [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/journal/journal_first_pass.cgi?action=Curate&paper=00000001&passwd=1241223658.1418967 jfp]. <br>
+
Author  (6) Authors provide the final source files and fills out the author first pass form within 48 hours of acceptance to GENETICS.
 +
* Author data populates journal first-pass tables in postgres, new objects are automatically added to the appropriate lexicon.
 +
* New object data from the journal first-pass form is sent to Karen, who makes sure the information is entered correctly
 +
* New data goes to the appropriate data curator for acedb object creation.
 +
* All other data populates author first-pass tables and paper is placed in the pipeline for first-pass curation (Juancarlos will make a filter so these papers will be prioritized for first-pass).  
 +
 +
DJS (Dartmouth Journal Services) (~5-8) sends the final source file (in XML format) to Arun - ''deposits XML on a web service?'' http://textpresso-dev.caltech.edu/gsa/worm/incoming_xml/
 +
* An e-mail is sent to the QC curator that a paper is coming and to expect a follow up e-mail with a link to the linked paper and entity table.  
  
'''Enter new alleles: '''  Alleles reported in the paper that don't exist in WBPostgres character name: '''  extvariation.''' Postgres table:  [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/first_pass.cgi?action=Curate&paper=00000003&passwd=1228446342.8668923 afp], [http://tazendra.caltech.edu/~Postgres/cgi-bin/curator_first_pass.cgi cfp],  [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/journal/journal_first_pass.cgi?action=Curate&paper=00000001&passwd=1241223658.1418967 jfp]. <br>
+
Automatic (8) the paper is run through the linking script.   
 +
* The script links all the objects in wormbase and the ones provided by the author first pass form
 +
* The linked file is deposited here http://textpresso-dev.caltech.edu/gsa/worm/html/
 +
* An initial entity link table is created and deposited in http://textpresso-dev.caltech.edu/gsa/worm/first_pass_entity_link_tables/
 +
* QC curator receives a link to the linked XML in a QCFast interface and a link to the entity table.  
  
'''Enter new strains: '''  Strains reported in the paper that are or will be made publicly available in the CGC. Postgres character name: '''  newstrains.'''  Postgres table:  [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/journal/journal_first_pass.cgi?action=Curate&paper=00000001&passwd=1241223658.1418967 jfp]. <br>
+
Juancarlos (~9) gets XML from Arun, populates bibliography information for WormBase.''?''
  
'''Genetic mapping data : '''  Determination of the location of the gene using genetic recombination, e.g., 2-factor recombination, 3-factor interval linkage, Df breakpoints, etc.  Postgres character name: '''  mappingdata.'''  Postgres table:  [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/first_pass.cgi?action=Curate&paper=00000003&passwd=1228446342.8668923 afp], [http://tazendra.caltech.edu/~Postgres/cgi-bin/curator_first_pass.cgi cfp], [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/journal/journal_first_pass.cgi?action=Curate&paper=00000001&passwd=1241223658.1418967 jfp]. <br>
+
Script (10) sends markup back to DJS, this action is prompted by a submit button on QCFast.  
 +
* A final entity table is created http://textpresso-dev.caltech.edu/gsa/worm/entity_link_tables/
 +
* An e-mail is sent to the curator that the file has been deposited and includes a link to the final entity table.  
  
'''Enter new balancers: '''  List of balancers reported in paper that don't exist in WB.  Postgres character name: '''  newbalancers .'''  Postgres table:  [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/journal/journal_first_pass.cgi?action=Curate&paper=00000001&passwd=1241223658.1418967 jfp]. <br>
+
--[[User:Kyook|kjy]] 19:44, 19 April 2012 (UTC)
  
'''Phenotype analysis : '''  Indicates the paper reports phenotypes of mutants or phenotypic analysis of strains.  Postgres character name: '''  newmutant.'''  Postgres table:  [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/first_pass.cgi?action=Curate&paper=00000003&passwd=1228446342.8668923 afp], [http://tazendra.caltech.edu/~Postgres/cgi-bin/curator_first_pass.cgi cfp], [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/journal/journal_first_pass.cgi?action=Curate&paper=00000001&passwd=1241223658.1418967 jfp]. <br>
+
==Instructions for Genetics==
 +
The following are instructions to the journal and to the authors.
  
'''Overexpression phenotype : '''  Indicates the paper reported phenotypes due to the overexpression of transgenes.  Postgres character name: '''  overexpr.'''  Postgres table:  [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/first_pass.cgi?action=Curate&paper=00000003&passwd=1228446342.8668923 afp], [http://tazendra.caltech.edu/~Postgres/cgi-bin/curator_first_pass.cgi cfp], [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/journal/journal_first_pass.cgi?action=Curate&paper=00000001&passwd=1241223658.1418967 jfp]. <br>
+
===Note to authors:===
 +
The following message is sent to authors by GENETICS upon acceptance of their paper and after the DOI has been assigned and WBPaperID retrieved.  
  
'''Chemicals : '''  Indicates if chemicals or drug treatments were used to analyze strain behavior,  physiology, gene function, etc. of mutant or 'normal' worms. Postgres character name: '''  chemicals.'''  Postgres table:  [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/first_pass.cgi?action=Curate&paper=00000003&passwd=1228446342.8668923 afp], [http://tazendra.caltech.edu/~Postgres/cgi-bin/curator_first_pass.cgi cfp], [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/journal/journal_first_pass.cgi?action=Curate&paper=00000001&passwd=1241223658.1418967 jfp]. <br>
+
"GENETICS is working with textpresso (www.textpresso.org) and WormBase (www.wormbase.org) to create links between genetic and genomic objects that are in your paper to the appropriate page in WormBase. These links will be included in both the online full text and PDF formats of your paper.
  
'''Small-scale RNAi (less than 100 experiments reported) : '''  Indicates that gene function was assayed by RNA interferencePostgres character name: '''  rnai.'''  Postgres table:  [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/first_pass.cgi?action=Curate&paper=00000003&passwd=1228446342.8668923 afp], [http://tazendra.caltech.edu/~Postgres/cgi-bin/curator_first_pass.cgi cfp], [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/journal/journal_first_pass.cgi?action=Curate&paper=00000001&passwd=1241223658.1418967 jfp]. <br>
+
If you want any genes, alleles, transgenes, CGC-destined strains, anatomy terms, etc., discovered or described in your paper to be linked to WormBase please enter the names of these objects using the form at the following link:
 +
http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/journal/journal_first_pass.cgi?action=Curate&paper=00040237&passwd=1317224998.8903801
  
'''Large-scale RNAi (more than 100 experiments reported) : '''  Indicates that gene function was assayed by large RNA interference screens. Postgres character name: '''  lsrnai.'''  Postgres table:  [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/first_pass.cgi?action=Curate&paper=00000003&passwd=1228446342.8668923 afp], [http://tazendra.caltech.edu/~Postgres/cgi-bin/curator_first_pass.cgi cfp], [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/journal/journal_first_pass.cgi?action=Curate&paper=00000001&passwd=1241223658.1418967 jfp]. <br>
+
Follow the examples carefully as your submitted data will be processed automatically. If you would rather upload a file, please contact kyook@caltech.edu.
  
'''Mosaic analysis : '''  Indicates that gene function was assayed in specific cells using lineage analysis. Postgres character name: '''  mosaic.'''  Postgres table:  [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/first_pass.cgi?action=Curate&paper=00000003&passwd=1228446342.8668923 afp], [http://tazendra.caltech.edu/~Postgres/cgi-bin/curator_first_pass.cgi cfp], [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/journal/journal_first_pass.cgi?action=Curate&paper=00000001&passwd=1241223658.1418967 jfp]. <br>
+
Thank you for your help."
  
'''Tissue or cell site of action : '''  Indicates that gene function was assayed in specific cells or tissues, such as in the case where gene function was rescued by cell/tissue-specific expression of the gene.  Postgres character name: '''  siteaction.'''  Postgres table:  [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/first_pass.cgi?action=Curate&paper=00000003&passwd=1228446342.8668923 afp], [http://tazendra.caltech.edu/~Postgres/cgi-bin/curator_first_pass.cgi cfp], [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/journal/journal_first_pass.cgi?action=Curate&paper=00000001&passwd=1241223658.1418967 jfp]. <br>
+
[http://www.wormbase.org/wiki/index.php/Instructions_for_Genetics previous notes to authors]<br>
 +
--[[User:Kyook|kjy]] 19:40, 2 December 2011 (UTC)
  
'''Time of action: '''  Indicates that timing of gene function in development was assayed.  Postgres character name: '''  timeaction.'''  Postgres table:  [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/first_pass.cgi?action=Curate&paper=00000003&passwd=1228446342.8668923 afp], [http://tazendra.caltech.edu/~Postgres/cgi-bin/curator_first_pass.cgi cfp], [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/journal/journal_first_pass.cgi?action=Curate&paper=00000001&passwd=1241223658.1418967 jfp]. <br>
+
===Note to GENETICS:===
  
'''Molecular function of a gene product : '''  Indicates that a new/novel molecular function or aspect of mol function for a gene was identified.  Postgres character name: '''  genefunc.'''  Postgres table:  [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/first_pass.cgi?action=Curate&paper=00000003&passwd=1228446342.8668923 afp], [http://tazendra.caltech.edu/~Postgres/cgi-bin/curator_first_pass.cgi cfp], [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/journal/journal_first_pass.cgi?action=Curate&paper=00000001&passwd=1241223658.1418967 jfp]. <br>
+
This form (link sent) represents a user-friendly interface to a postgres database table stored at WormBase Caltech.
  
'''Homolog of a human disease-associated gene. : '''  Indicates that a gene studied in the paper is a homolog of a human gene, which is directly associated with a disease.  Postgres character name: '''  humdis.'''  Postgres table:  [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/first_pass.cgi?action=Curate&paper=00000003&passwd=1228446342.8668923 afp], [http://tazendra.caltech.edu/~Postgres/cgi-bin/curator_first_pass.cgi cfp], [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/journal/journal_first_pass.cgi?action=Curate&paper=00000001&passwd=1241223658.1418967 jfp]. <br>
+
Once the authors hit 'flag' (the button at the bottom of the page) their data will:
  
'''Genetic interactions : '''  Indicates the paper reports results from experiments where genes were assayed for effect on the function of another gene.  Often this is made apparent by the analysis of double, triple, etc. mutants, or with the use of experiments where RNAi was used concurrent with other RNAi-treatment or mutations. Postgres character name: '''  geneint.'''  Postgres table:  [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/first_pass.cgi?action=Curate&paper=00000003&passwd=1228446342.8668923 afp], [http://tazendra.caltech.edu/~Postgres/cgi-bin/curator_first_pass.cgi cfp], [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/journal/journal_first_pass.cgi?action=Curate&paper=00000001&passwd=1241223658.1418967 jfp]. <br>
+
* be added to Arun's object markup list, so new objects can be marked up in the paper along with known WB objects. NOTE: these objects will need to be entered correctly and cleanly as everything that appears on his markup list will be linked.  
 +
* populate the postgres table and the corresponding first pass paper curation form used by fp curators
 +
* be marked in our first pass curation list as having author data, therefore marked for priority curation
  
'''Functional complementation : '''  Indicates the paper reports results from experiments functional redundancy between separate genes, e.g., the rescue of gen-A by overexpression of gen-B, or any other extragenic sequence.  Also indicated by the rescue of gene function by a gene from another species.  Postgres character name: '''  funccomp.'''  Postgres table:  [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/first_pass.cgi?action=Curate&paper=00000003&passwd=1228446342.8668923 afp], [http://tazendra.caltech.edu/~Postgres/cgi-bin/curator_first_pass.cgi cfp], [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/journal/journal_first_pass.cgi?action=Curate&paper=00000001&passwd=1241223658.1418967 jfp]. <br>
+
The WB QC curator will receive an e-mail alerting them to the submitted data, at which point it will be checked over to make sure it is in the proper format for Arun's scripts and corrected if needed.
  
'''Gene product interactions : '''  Indicates the paper reports data on protein-protein, RNA-protein, DNA-protein, or Y2H interactions, etc.  Postgres character name: '''  geneprod.'''  Postgres table:  [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/first_pass.cgi?action=Curate&paper=00000003&passwd=1228446342.8668923 afp], [http://tazendra.caltech.edu/~Postgres/cgi-bin/curator_first_pass.cgi cfp], [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/journal/journal_first_pass.cgi?action=Curate&paper=00000001&passwd=1241223658.1418967 jfp]. <br>
+
We won't be able to take uploaded tables for the data types; the parsing of this data will be problematic for Texptresso as we anticipate people uploading the data in many different formats. Nonetheless, if they do want to upload a file, for data that doesn't exist in their supplemental materials, they are welcome to contact one of us (kyook@caltech.edu, for now) to do that.  
  
'''New expression pattern for a gene : '''  Indicates that the paper reports new temporal or spatial (e.g., tissue, subcellular, etc.) data on the pattern of expression of any gene in a wild-type background. You can include reporter gene analysis, antibody staining, In situ hybridization, RT-PCR, Western or Northern blot data.  Postgres character name: '''  otherexpr.'''  Postgres table:  [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/first_pass.cgi?action=Curate&paper=00000003&passwd=1228446342.8668923 afp], [http://tazendra.caltech.edu/~Postgres/cgi-bin/curator_first_pass.cgi cfp], [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/journal/journal_first_pass.cgi?action=Curate&paper=00000001&passwd=1241223658.1418967 jfp]. <br>
+
--[[User:Kyook|kjy]] 19:53, 2 December 2011 (UTC)
  
'''Alterations in gene expression by genetic or other treatment : '''  Indicates that the paper reports changes or a lack of changes in gene expression levels or patterns in response to genetic, chemical, temperature, or any other experimental treatment.  Postgres character name: '''  genereg.'''  Postgres table:  [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/first_pass.cgi?action=Curate&paper=00000003&passwd=1228446342.8668923 afp], [http://tazendra.caltech.edu/~Postgres/cgi-bin/curator_first_pass.cgi cfp], [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/journal/journal_first_pass.cgi?action=Curate&paper=00000001&passwd=1241223658.1418967 jfp]. <br>
+
==Journal first-pass form (jfp) : GENETICS papers only==
 +
This is not a postgres table but is a form that collects extra data from Genetics authors, which is then stored in our author first pass table (afp).
 +
 +
The form URL is generated by Genetics through the doi ticket form.  
  
'''Regulatory sequence features : '''  Indicates that the paper reports any gene expression regulatory elements, e.g., DNA/RNA elements required for gene expression, promoters, introns, UTR's, DNA binding sites, etc.  Postgres character name: '''  seqfeat.'''  Postgres table:  [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/first_pass.cgi?action=Curate&paper=00000003&passwd=1228446342.8668923 afp], [http://tazendra.caltech.edu/~Postgres/cgi-bin/curator_first_pass.cgi cfp], [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/journal/journal_first_pass.cgi?action=Curate&paper=00000001&passwd=1241223658.1418967 jfp]. <br>
+
The purpose of the genetics first pass form is to collect data objects that do not exist in WB already so that Arun can mark-up the Genetics paper and provide links for these objects.<br>
  
'''Position frequency matrix (PFM) or Position weight matrix (PWM): '''  Indicates that the paper reports PFMs or PWMs, which are typically used to define regulatory sites in genomic DNA (e.g., bound by transcription factors) or mRNA (e.g., bound by translational factors or miRNA). PFMs define simple nucleotide frequencies, while PWMs are scaled logarithmically against a background frequency.  Postgres character name: '''  matrices.'''  Postgres table:  [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/first_pass.cgi?action=Curate&paper=00000003&passwd=1228446342.8668923 afp], [http://tazendra.caltech.edu/~Postgres/cgi-bin/curator_first_pass.cgi cfp], [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/journal/journal_first_pass.cgi?action=Curate&paper=00000001&passwd=1241223658.1418967 jfp]. <br>
+
New entities (postgres table) are collected for the following data types:
 +
*gene (genesymbol)
 +
*variations (extvariation)
 +
*CGC-sent strains (newstrains)
 +
*integrated transgene (transgene)
 +
*balancers
 +
*cell/anatomy(newcell)
 +
*phenotype
  
'''Microarray: ''' Indicates that the paper reports microarray-derived dataPostgres character name: '''  microarray .'''  Postgres table:  [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/first_pass.cgi?action=Curate&paper=00000003&passwd=1228446342.8668923 afp], [http://tazendra.caltech.edu/~Postgres/cgi-bin/curator_first_pass.cgi cfp], [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/journal/journal_first_pass.cgi?action=Curate&paper=00000001&passwd=1241223658.1418967 jfp]. <br>
+
All of these fields, except genesymbol, do not show on the normal afp_form. We opted to make a hybrid of the afp_form for the Genetics authors so that they would not be requested to fill out another WB generated form for us after their paper was published and because we needed this extra information from them asap.   
  
'''Enter new protein: '''  Lists proteins not found in WormBase.  Postgres character name: '''  newprotein.'''  Postgres table:  [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/journal/journal_first_pass.cgi?action=Curate&paper=00000001&passwd=1241223658.1418967 jfp]. <br>
+
It is also my understanding that these authors would be required to fill out the form as part of the publication process, so this was an opportunity to have 100% author feedback for paper flagging.  
  
'''Protein analysis in vitro: '''  Indicates that the paper reports any in vitro protein analysis such as kinase assays, agonist pharmacological studies, reconstitution studies, etc.  Postgres character name: '''  invitro.'''  Postgres table:  [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/first_pass.cgi?action=Curate&paper=00000003&passwd=1228446342.8668923 afp], [http://tazendra.caltech.edu/~Postgres/cgi-bin/curator_first_pass.cgi cfp], [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/journal/journal_first_pass.cgi?action=Curate&paper=00000001&passwd=1241223658.1418967 jfp]. <br>
+
Data curators are automatically alerted to the new data when the form is submitted.
  
'''Domain analysis: '''  Indicates that the paper reports results of experimentation done on a particular domain within a protein to assay the function of that domain.  Postgres character name: '''  domanal.'''  Postgres table:  [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/first_pass.cgi?action=Curate&paper=00000003&passwd=1228446342.8668923 afp], [http://tazendra.caltech.edu/~Postgres/cgi-bin/curator_first_pass.cgi cfp], [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/journal/journal_first_pass.cgi?action=Curate&paper=00000001&passwd=1241223658.1418967 jfp]. <br>
+
==[http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/journal/journal_first_pass.cgi?action=Curate&paper=00000001&passwd=1241223658.1418967 Sample GENETICS Firstpass form]==
  
'''Covalent modification : '''  Indicates that the paper reports on post-translational modifications of a gene product, as assayed by mutagenesis or in vitro analysis.  Postgres character name: '''  covalent.'''  Postgres table:  [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/first_pass.cgi?action=Curate&paper=00000003&passwd=1228446342.8668923 afp], [http://tazendra.caltech.edu/~Postgres/cgi-bin/curator_first_pass.cgi cfp], [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/journal/journal_first_pass.cgi?action=Curate&paper=00000001&passwd=1241223658.1418967 jfp]. <br>
+
==[[jfp postgres table details]]==
  
'''Structural information: '''  Indicates that the paper reports protein structural analysis, through NMR, X-Ray crystallography, etc.  Postgres character name: '''  structinfo.'''  Postgres table:  [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/first_pass.cgi?action=Curate&paper=00000003&passwd=1228446342.8668923 afp], [http://tazendra.caltech.edu/~Postgres/cgi-bin/curator_first_pass.cgi cfp], [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/journal/journal_first_pass.cgi?action=Curate&paper=00000001&passwd=1241223658.1418967 jfp]. <br>
+
==[[Data types used on first-pass forms]]==
  
'''Mass spectrometry : '''  Indicates that the paper reports protein mass analysis through any mass spectrometry analysis (MS/MS, LCMS, HRMS).  Some Mass spec analysis programs include MASCOT, SEQUEST, X!Tandem, OMSSA, MassMatrix.  Postgres character name: '''  massspec.'''  Postgres table:  [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/first_pass.cgi?action=Curate&paper=00000003&passwd=1228446342.8668923 afp], [http://tazendra.caltech.edu/~Postgres/cgi-bin/curator_first_pass.cgi cfp], [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/journal/journal_first_pass.cgi?action=Curate&paper=00000001&passwd=1241223658.1418967 jfp]. <br>
 
  
'''Enter C. elegans antibodies : '''  List of antibodies generated in a noncommercial laboratory, against a C. elegans gene product.  Postgres character name: '''  antibody.'''  Postgres table:  [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/journal/journal_first_pass.cgi? action=Curate&paper=00000001&passwd=1241223658.1418967 jfp]. <br>
 
  
'''Enter integrated transgenes  : '''  List of integrated transgenes used in this paper that doesn't exist in WormBase already, especially if the transgene does not have a canonical name: '''  .  Postgres character name: '''  transgene.'''  Postgres table:  [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/journal/journal_first_pass.cgi?action=Curate&paper=00000001&passwd=1241223658.1418967 jfp]. <br>
 
  
'''Transgenes used as tissue markers: '''  List of reporters (integrated transgenes)  used to mark certain tissues, subcellular structures, or life stages, etc. as a reference to assay site of action of gene function or location.  Postgres character name: '''  marker.'''  Postgres table:  [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/first_pass.cgi?action=Curate&paper=00000003&passwd=1228446342.8668923 afp], [http://tazendra.caltech.edu/~Postgres/cgi-bin/curator_first_pass.cgi cfp], [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/journal/journal_first_pass.cgi?action=Curate&paper=00000001&passwd=1241223658.1418967 jfp]. <br>
+
[[Category:Curation]]
 
+
[[Category:GSA_markup]]
'''Gene structure correction : '''  Indicates that the paper reports a gene structure that is different from the one in WormBase, e.g., different splice-site, SL1 instead of SL2, etc.  Postgres character name: '''  structcorr (this use to be two different fields).'''  Postgres table:  [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/first_pass.cgi?action=Curate&paper=00000003&passwd=1228446342.8668923 afp], [http://tazendra.caltech.edu/~Postgres/cgi-bin/curator_first_pass.cgi cfp], [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/journal/journal_first_pass.cgi?action=Curate&paper=00000001&passwd=1241223658.1418967 jfp]. <br>
 
 
 
'''Sequencing mutant alleles : '''  Indicates that the paper reports sequence data for any mutation.  Postgres character name: '''  seqchange.'''  Postgres table:  [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/first_pass.cgi?action=Curate&paper=00000003&passwd=1228446342.8668923 afp], [http://tazendra.caltech.edu/~Postgres/cgi-bin/curator_first_pass.cgi cfp], [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/journal/journal_first_pass.cgi?action=Curate&paper=00000001&passwd=1241223658.1418967 jfp]. <br>
 
 
 
'''New SNPs : '''  List of SNPs reported in the paper that don't exist in WormBase already.  Postgres character name: '''  newsnp.'''  Postgres table:  [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/first_pass.cgi?action=Curate&paper=00000003&passwd=1228446342.8668923 afp], [http://tazendra.caltech.edu/~Postgres/cgi-bin/curator_first_pass.cgi cfp], [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/journal/journal_first_pass.cgi?action=Curate&paper=00000001&passwd=1241223658.1418967 jfp]. <br>
 
 
 
'''Enter new cell/anatomy term: '''  List of C. elegans cells or anatomy parts reported in the paper that doesn't exist in WormBase already. Currently unknown if cells or anatomy parts of other nematodes will be collected here.  Postgres character name: '''  newcell'''  Postgres table:  [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/journal/journal_first_pass.cgi?action=Curate&paper=00000001&passwd=1241223658.1418967 jfp]. <br>
 
 
 
'''Ablation data : '''  Indicates that the paper reports data from an assay involving any cell or anatomical unit being ablated by laser or by other means (e.g., by expressing a cell-toxic protein).  Postgres character name: '''  ablationdata.'''  Postgres table:  [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/first_pass.cgi?action=Curate&paper=00000003&passwd=1228446342.8668923 afp], [http://tazendra.caltech.edu/~Postgres/cgi-bin/curator_first_pass.cgi cfp], [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/journal/journal_first_pass.cgi?action=Curate&paper=00000001&passwd=1241223658.1418967 jfp]. <br>
 
 
 
'''Cell function : '''  Indicates that the paper reports a function for any anatomical part (e.g., cell, tissue, etc.), which has not been indicated elsewhere on this form.  Postgres character name: '''  cellfunc.'''  Postgres table:  [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/first_pass.cgi?action=Curate&paper=00000003&passwd=1228446342.8668923 afp], [http://tazendra.caltech.edu/~Postgres/cgi-bin/curator_first_pass.cgi cfp], [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/journal/journal_first_pass.cgi?action=Curate&paper=00000001&passwd=1241223658.1418967 jfp]. <br>
 
 
 
'''Phylogenetic data: '''  Indicates that the paper discusses evolutionary relationships between or among genes or gene products.  Postgres character name: '''  phylogenetic.'''  Postgres table:  [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/first_pass.cgi?action=Curate&paper=00000003&passwd=1228446342.8668923 afp], [http://tazendra.caltech.edu/~Postgres/cgi-bin/curator_first_pass.cgi cfp], [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/journal/journal_first_pass.cgi?action=Curate&paper=00000001&passwd=1241223658.1418967 jfp]. <br>
 
 
 
'''Other bioinformatics analysis: '''  Indicates that the paper reports other bioinformatic data not indicated anywhere else on this form.  In general, this may include alignments.  Postgres character name: '''  othersilico.'''  Postgres table:  [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/first_pass.cgi?action=Curate&paper=00000003&passwd=1228446342.8668923 afp], [http://tazendra.caltech.edu/~Postgres/cgi-bin/curator_first_pass.cgi cfp], [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/journal/journal_first_pass.cgi?action=Curate&paper=00000001&passwd=1241223658.1418967 jfp]. <br>
 
 
 
'''Supplemental materials : '''  In the Author First-Pass form, checking this box indicates that supplementary material is attached to the paper.  In the curator first-pass checking the box indicates that the Supplementary Materials are missing.  Postgres character name: '''  supplemental.'''  Postgres table:  [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/first_pass.cgi?action=Curate&paper=00000003&passwd=1228446342.8668923 afp], [http://tazendra.caltech.edu/~Postgres/cgi-bin/curator_first_pass.cgi cfp], [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/journal/journal_first_pass.cgi?action=Curate&paper=00000001&passwd=1241223658.1418967 jfp]. <br>
 
 
 
'''NONE of the aforementioned data types are in this research article : '''  This is used as a default category for any paper where the author checked "here" for review or non primary research paper.  Postgres character name: ''' nocuratable.'''  Postgres table:  [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/first_pass.cgi?action=Curate&paper=00000003&passwd=1228446342.8668923 afp], [http://tazendra.caltech.edu/~Postgres/cgi-bin/curator_first_pass.cgi cfp], [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/journal/journal_first_pass.cgi?action=Curate&paper=00000001&passwd=1241223658.1418967 jfp]. <br>
 
 
 
'''Enter authors: '''  List of authors for updating the paper details and author paper connections.  Postgres character name: '''  authors.'''  Postgres table:  [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/journal/journal_first_pass.cgi?action=Curate&paper=00000001&passwd=1241223658.1418967 jfp]. <br>
 
 
 
'''Feedback: '''  This is used to record thoughts, notes, comments about the form etc.  Postgres character name: '''  comment .'''  Postgres table:  [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/first_pass.cgi?action=Curate&paper=00000003&passwd=1228446342.8668923 afp], [http://tazendra.caltech.edu/~Postgres/cgi-bin/curator_first_pass.cgi cfp], [http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/journal/journal_first_pass.cgi?action=Curate&paper=00000001&passwd=1241223658.1418967 jfp]. <br>
 

Latest revision as of 20:36, 23 July 2013

[back]


Mark up policy

Mark Up Work Flow

Media:slide1.jpg

this file diagrams the most recent and most detailed workflow we established, which includes editorial roles at Dartmouth. The steps below is a brief overview of the process.

These following steps outline the work flow for Textpresso markup starting with the acceptance of the paper by GENETICS and through to the incorporation of the paper into WB.

The responsible party is noted before each step.

Genetics (1) The paper gets accepted.

Genetics (2) A DOI is assigned to the paper

Genetics (3) A WBPaperID and URL are acquired through the ticketing form at: http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/journal/journal_paper_ticket.cgi

Genetics (4) The WBPaperID from the ticket issuer is attached to the XML

Genetics (5) the author is sent the generated URL to access the author form

Author (6) Authors provide the final source files and fills out the author first pass form within 48 hours of acceptance to GENETICS.

  • Author data populates journal first-pass tables in postgres, new objects are automatically added to the appropriate lexicon.
  • New object data from the journal first-pass form is sent to Karen, who makes sure the information is entered correctly
  • New data goes to the appropriate data curator for acedb object creation.
  • All other data populates author first-pass tables and paper is placed in the pipeline for first-pass curation (Juancarlos will make a filter so these papers will be prioritized for first-pass).

DJS (Dartmouth Journal Services) (~5-8) sends the final source file (in XML format) to Arun - deposits XML on a web service? http://textpresso-dev.caltech.edu/gsa/worm/incoming_xml/

  • An e-mail is sent to the QC curator that a paper is coming and to expect a follow up e-mail with a link to the linked paper and entity table.

Automatic (8) the paper is run through the linking script.

Juancarlos (~9) gets XML from Arun, populates bibliography information for WormBase.?

Script (10) sends markup back to DJS, this action is prompted by a submit button on QCFast.

--kjy 19:44, 19 April 2012 (UTC)

Instructions for Genetics

The following are instructions to the journal and to the authors.

Note to authors:

The following message is sent to authors by GENETICS upon acceptance of their paper and after the DOI has been assigned and WBPaperID retrieved.

"GENETICS is working with textpresso (www.textpresso.org) and WormBase (www.wormbase.org) to create links between genetic and genomic objects that are in your paper to the appropriate page in WormBase. These links will be included in both the online full text and PDF formats of your paper.

If you want any genes, alleles, transgenes, CGC-destined strains, anatomy terms, etc., discovered or described in your paper to be linked to WormBase please enter the names of these objects using the form at the following link: http://tazendra.caltech.edu/~azurebrd/cgi-bin/forms/journal/journal_first_pass.cgi?action=Curate&paper=00040237&passwd=1317224998.8903801

Follow the examples carefully as your submitted data will be processed automatically. If you would rather upload a file, please contact kyook@caltech.edu.

Thank you for your help."

previous notes to authors
--kjy 19:40, 2 December 2011 (UTC)

Note to GENETICS:

This form (link sent) represents a user-friendly interface to a postgres database table stored at WormBase Caltech.

Once the authors hit 'flag' (the button at the bottom of the page) their data will:

  • be added to Arun's object markup list, so new objects can be marked up in the paper along with known WB objects. NOTE: these objects will need to be entered correctly and cleanly as everything that appears on his markup list will be linked.
  • populate the postgres table and the corresponding first pass paper curation form used by fp curators
  • be marked in our first pass curation list as having author data, therefore marked for priority curation

The WB QC curator will receive an e-mail alerting them to the submitted data, at which point it will be checked over to make sure it is in the proper format for Arun's scripts and corrected if needed.

We won't be able to take uploaded tables for the data types; the parsing of this data will be problematic for Texptresso as we anticipate people uploading the data in many different formats. Nonetheless, if they do want to upload a file, for data that doesn't exist in their supplemental materials, they are welcome to contact one of us (kyook@caltech.edu, for now) to do that.

--kjy 19:53, 2 December 2011 (UTC)

Journal first-pass form (jfp) : GENETICS papers only

This is not a postgres table but is a form that collects extra data from Genetics authors, which is then stored in our author first pass table (afp).

The form URL is generated by Genetics through the doi ticket form.

The purpose of the genetics first pass form is to collect data objects that do not exist in WB already so that Arun can mark-up the Genetics paper and provide links for these objects.

New entities (postgres table) are collected for the following data types:

  • gene (genesymbol)
  • variations (extvariation)
  • CGC-sent strains (newstrains)
  • integrated transgene (transgene)
  • balancers
  • cell/anatomy(newcell)
  • phenotype

All of these fields, except genesymbol, do not show on the normal afp_form. We opted to make a hybrid of the afp_form for the Genetics authors so that they would not be requested to fill out another WB generated form for us after their paper was published and because we needed this extra information from them asap.

It is also my understanding that these authors would be required to fill out the form as part of the publication process, so this was an opportunity to have 100% author feedback for paper flagging.

Data curators are automatically alerted to the new data when the form is submitted.

Sample GENETICS Firstpass form

jfp postgres table details

Data types used on first-pass forms