Life Stage

From WormBaseWiki
Jump to navigationJump to search

Life_Stage Data Model

////////////////////////////////////////////////////////////////////////////////////

?Life_stage   Public_name UNIQUE Text   //added by Daniela 120509
              Contains Cell ?Cell XREF Life_stage
                   Cell_group ?Cell_group XREF Life_stage
                   Anatomy_term ?Anatomy_term XREF Life_stage
                   Sub_stage ?Life_stage XREF Contained_in
              Contained_in ?Life_stage XREF Sub_stage            // changed for Wen 010516
              Preceded_by ?Life_stage XREF Followed_by
              Followed_by ?Life_stage XREF Preceded_by
              Expr_pattern ?Expr_pattern XREF Life_stage
              Remark	?Text #Evidence               
               Definition UNIQUE ?Text #Evidence// added for Wen [020607 krb]
               Other_name ?Text // added for Wen [020607 krb]
               Reference ?Paper XREF Life_stage 
               Curated_by UNIQUE Text // added for Sylvia [010928 krb]
              
            
///////////////////////////////////////////////////////////////////////////////////

obo file

https://raw.github.com/draciti/Life-stage-obo/master/worm_development.obo

Life_Stage

Originally Life_stage objects were stored as names. In May 2012 Caltech decided to switch Life_stage objects from names to IDs. In order to do that we need a model change to store the original name (see model).

All data types that were using Life_stage had to be modified and dumped into Ids instead of names

  • Phenotype -Karen
  • Process -Karen
  • RNAi -Chris
  • gene regulation -Xiaodong
  • gene expression -Daniela
  • Anatomy_term -Raymond
  • Expression Cluster -Wen
  • Condition -Wen & Gary W
  • Strain -Mary Ann

Life_Stage tags

tags used in Life_stage extracted from the Life_Stage. ace dump May 9th 2012

  • Definition
  • Anatomy_term
  • Reference
  • Other_name
  • Expr_pattern
  • Contained_in -> Change data into IDs
  • Cell_group
  • Sub_stage -> Change data into IDs
  • Cell
  • Followed_by -> Change data into IDs
  • Remark -> goid: WBls could be deleted because will be redundant with IDs
  • Preceded_by -> Change data into IDs


Action items

.ace file to modify located here: /home/acedb/draciti/Life_stage and is named Life_stageWS230.ace

Modify the .ace dump as follows

1) Object should be replaced with ID instead of name using the mapping table

2) A new tag called Public_name should be added and should contain the name that was previously the object name

3) the following tags should be replaced with ID instead of name

  • Contained_in
  • Followed_by
  • Preceded_by
  • Sub_stage

4) The lines containing Remark "goid: WBls:#######" should be deleted


Mapping table is here http://tazendra.caltech.edu/~azurebrd/var/work/lifestage

script at : /home/acedb/draciti/Life_stage/parse.pl output goes to script. Errors from lack of mappings are in the file. current output in file "out"


the script that updates the OA runs at 8pm every day and is at /home/postgres/work/pgpopulation/obo_oa_ontologies/update_obo_oa_ontologies.pl

gets the data from here https://raw.github.com/draciti/Life-stage-obo/master/worm_development.obo

The script runs when the date in the obo file changes.

Deleting terms

In WS230 the following terms were used in Citace Minus but were not present in the obo file:

L1 Larva Male L2 Larva male L3 Larva male

the last term generated was L4 Larva male:

[Term]
id: WBls:0000073
name: L4 larva male
def: "The fourth stage larva male. At 25 Centigrade\, it ranges 40-49.5 hours after fertilization\, 26-35.5 hours after hatch." [wb:wjc]
is_a: WBls:0000038 ! L4 larva


Raymond suggested to leave the sex distinction out of the ontology. We are therefore NOT adding these new terms. Checked in the Life stage WS230 dump -> no data associated with these terms, we could delete them from the obo file and modify citace minus accordingly. Waiting for Wen's feedback -> OK. Deleted the 3 L1 Larva Male, L2 Larva male, L3 Larva male, L4 larva hermaphrodite, mixed, mixed stage terms from Citace Minus

Deposited on spica (Data for CitaceMinus) the file Life_stageD with the corrections and the Name_to_ID change. (05152012DR) Create terms for one cell embryo and oocyte?

Adding terms

Added 11-15 days post-L4 adult hermaphrodite using obo edit as per Wen Chen request (WBPaper38462).

id: WBls:0000074
name: 11-15 days post-L4 adult hermaphrodite
def: "At 20 Centigrade\\: 11-15 days after L4-adult molt. 14-18 days after first
cleavage." [wb:dr]
is_a: WBls:0000057 ! adult hermaphrodite
created_by: danielaraciti
creation_date: 2012-05-22T14:35:04Z


Aligning worm_development_obo with acedb

Daniela will hand over the Life_stage ontology to Raymond (February 2019) As per their discussion there will be no model changes to align the worm_development_obo file with acedb. The only change was to add A UNIQUE ans #Evidence hash in the Definition, in order to capture the paper evidence.

1) Is_a and part_of relationships

Is_a 
and
Part_of 
relationships will both convert into Contained_in

2) Preceded_by and Immediately_preceded_by  relationships

Preceded_by
and
Immediately_preceded_by 
relationships will both convert into Preceded_by

3) Synonym will convert into Other_name

See the section: 'Added synonyms in the obo file' in this wiki (March 21st 2017)

4) Comment will convert into Remark
These lines we will –D on CitaceMinus, as they will come from the obo file converted into .ace:
Public_name                                
Sub_stage ?Life_stage XREF Contained_in 
Contained_in ?Life_stage XREF Sub_stage  
Preceded_by ?Life_stage XREF Followed_by               
Followed_by ?Life_stage XREF Preceded_by   
Remark	?Text #Evidence                               
Definition ?Text 
Other_name ?Text // added for Wen [020607 krb]  

conversion from obo to .ace

Added preceded_by relationships in the obo file using the dump "LS_Preceded_by.ace" provided by Wen -DR. Added additional preceded_by relationships that were not in the file-DR. The file is located on Lario here /Users/danielaraciti/Desktop/Life_stage/Relationships.

The conversion from obo to .ace should be as follows


OBO
id: WBls:0000007
name: 2-cell embryo
def: "0-20min after first cleavage at 20 Centigrade.  Contains 2 cells." [wb:wjc]
is_a: WBls:0000005 ! blastula embryo
relationship: preceded_by WBls:0000006 ! 1-cell embryo


.ace
Life_stage : "WBls:0000007"
Public_name	 "2-cell embryo"
Definition	 "0-20min after first cleavage at 20 Centigrade.  Contains 2 cells."
Contained_in	 "WBls:0000005"
Preceded_by	 "WBls:0000006"


  • script at :

/home/acedb/draciti/Life_stage/lifestageAceFromObo.pl

  • generates :

/home/acedb/draciti/Life_stage/lifestage.ace

Here's the relevant matching :
    if ($obj =~ m/id: (WBls:\d+)/) { $id = $1;
      $ace .= qq(Life_stage : "$id"\n);
      if ($obj =~ m/name: (.*)/) { $ace .= qq(Public_name\t"$1"\n); }
      if ($obj =~ m/def: "(.*)"/) { $ace .= qq(Definition\t"$1"\n); }
      if ($obj =~ m/is_a: (WBls:\d+)/) { $ace .= qq(Contained_in\t"$1"\n); }
      if ($obj =~ m/relationship: preceded_by (WBls:\d+)/) { $ace .=
qq(Preceded_by\t"$1"\n); }
      $ace .= qq(\n);

meaning :
- the id is the WBls
- the name is everything
- the def is what's in quotes
- the is_a is the WBls
- the relationship is the WBls and it must say "relationship: precededby WBls:\d+"


It always downloads the .obo file from the URL https://raw.github.com/draciti/Life-stage-obo/master/worm_development.obo, and always regenerates the .ace file.


Broad synonyms

Ref Issue #72 on Textpresso github Nematoda Life Stage Category - Term Names not Optimal for Searching #72

Juancarlos automatically scripted in a broad synonym for all the terms that had Ce as suffix, so to have the life stages more amenable for textpresso searches.

Script here:/home/acedb/draciti/Life_stage/Broad_synonym/parse_worm_development_obo.pl

output:worm_development.obo.out

Committed on Git on July 13th 2016

Added synonyms in the obo file

Since we are going to dump all the ontology data from the obo file and convert into .ace, we will -D the Other_name lines on Citace minus, need to transfer the missing synonyms into the obo:

Added synonyms in the obo file, 9 instances
they were listed in the .ace as Other_name

1)
Life_stage : "WBls:0000006"
Public_name	 "1-cell embryo Ce"
Other_name	 "zygote"

2)
Life_stage : "WBls:0000011"
Public_name	 "51-cell embryo Ce"
Other_name	 "50-70 cell embryo Ce"

3)
Life_stage : "WBls:0000012"
Public_name	 "88-cell embryo Ce"
Other_name	 "90 cell embryo Ce"
Did not add it as synonym in the obo as they seem to be 2 different stages


4)
Life_stage : "WBls:0000020”
Public_name	 "3-fold embryo Ce"
Other_name	 "pretzel embryo"

5)
Life_stage : "WBls:0000021”
Public_name	 "fully-elongated embryo Ce"
Other_name	 "pre-hatched embryo Ce"

other synonyms were already accounted for in the .ace

-D Citace Minus

Citace minus dumps back ups here: Users/danielaraciti/Desktop/Life-stage-obo/Citace minus back ups February 2019

Discuss with Wen what is best to do for the -D file to submit to CitaceMinus. Daniela dumped the Life stage class from citace (file life_stage111213.ace) and -D all the lines that were present in the lifestage.ace generated by Juancarlos' script. All files are on lario here: /Users/danielaraciti/Desktop/Life_stage/DashD_november_2013 (life_stage111213.ace, lifestage.ace, and DashD_life_stage111213.ace)

If you decide to move away from citace minus completely, remark in Citace Minus could be stored in Comment in OBO edit and the comment in OBO edit should then be dumped as remark from Juancarlos' script. Same thing for othername, could go into synonyms and then dumped as other name.

One option would be to remove the whole class from citace minus. In this way we can rely just on the obo file and run the script that converts into .ace, then upload the lifestage.ace on citace at each upload.

In order to do that we should check that all tags are being converted in the .ace file

  • Definition ->  in OBO file
  • Public_name ->  in OBO file
  • Contained_in ->  in OBO file
  • Preceded_by ->  in OBO file
  • Followed_by ->  populated via acedb
  • Sub_stage -> populated via acedb
  • Anatomy_term -> populated through the Anatomy_term class?
  • Reference -> populated how?
  • Expr_pattern  -> populated through the Expression pattern class?
  • Cell_group is this coming from the cell group class?
  • Cell is this coming from the cell class?

There are few tags not accounted in the obo file (Anatomy_term, Reference, Cell_group, Cell). So we have decided to keep the class in citace minus and add-remove as needed.


Synonym -> Other_name
synonym: ""
should convert in
Other_name	 ""
Comment -> Remark
comment: 
should convert in 
Remark	 ""

part_of should convert into contained_in starts_at_the_end_of should convert into preceded_by

some of the changes to apply to the script if we decide to delete the whole class are:

I need to modify an existing script
the script is here:
/home/acedb/draciti/Life_stage/lifestageAceFromObo.pl*

and generates /home/acedb/draciti/Life_stage/lifestage.ace 

it takes data from this obo file:
 https://raw.github.com/draciti/Life-stage-obo/master/worm_development.obo

and creates an .ace.



similarly as 
if ($obj =~ m/is_a: (WBls:\d+)/) { $ace .= qq(Contained_in\t"$1"\n); }


and starts_at_the_end_of should convert into Immediately_preceded_by

similarly as:
if ($obj =~ m/relationship: starts_at_the_end_of (WBls:\d+)/) { $ace .=
qq(Immediately_receded_by\t"$1"\n); }

The comment field should be dumped as remark

Comment -> Remark
comment: 
should convert in 
Remark	 ""

and synonym into Other_name

Synonym -> Other_name
synonym: ""
should convert in
Other_name	 ""
Done-> See the section: Added synonyms in the obo file in this wiki


alt_IDs

There are only 2 entry that have alt_IDs in the obo file:

[Term]
id: WBls:0000077
name: Brugia sheathed microfilaria
alt_id: WBls:0000662

and 

[Term]
id: WBls:0000663
name: Brugia unsheathed microfilaria
alt_id: WBls:0000078

Only Expr11949 was using the term WBls:0000078: changed in WBls:0000663 in expression_OA

Gene association file for Life_stage

Specifications for a GAF for gene-life_stage data

other species

generic terms for other species -i.e. non C.elegans, non B. Malayi


Life_stage : "WBls:0000101"
Public_name	"all stages"
Definition	"All developmental stages, including embryo, larva and adult stage."
Contained_in	"WBls:0000075"

Life_stage : "WBls:0000102"
Public_name	"embryo"
Definition	"The whole period of embryogenesis, from the formation of an egg till its hatch."
Contained_in	"WBls:0000101"

Life_stage : "WBls:0000103"
Public_name	"postembryonic"
Definition	"The stage after hatch till death."
Contained_in	"WBls:0000101"
Preceded_by	"WBls:0000102"

Life_stage : "WBls:0000104"
Public_name	"adult"
Definition	"The stage after an animal is fully-developed and reaches maturity."
Contained_in	"WBls:0000103"
Preceded_by	"WBls:0000105"

Life_stage : "WBls:0000105"
Public_name	"larva"
Definition	"From the time after hatch till becomes adult."
Contained_in	"WBls:0000103"

Life_stage : "WBls:0000106"
Public_name	"L1 larva"
Definition	"The first stage larva."
Contained_in	"WBls:0000105"
Preceded_by	"WBls:0000110"

Life_stage : "WBls:0000107"
Public_name	"L2 larva"
Definition	"The second stage larva."
Contained_in	"WBls:0000105"
Preceded_by	"WBls:0000106"

Life_stage : "WBls:0000108"
Public_name	"L3 larva"
Definition	"The third stage larva."
Contained_in	"WBls:0000105"
Preceded_by	"WBls:0000107"

Life_stage : "WBls:0000109"
Public_name	"L4 larva"
Definition	"The fourth stage larva."
Contained_in	"WBls:0000105"
Preceded_by	"WBls:0000108"

Life_stage : "WBls:0000110"
Public_name	"sheathed microfilaria"
Definition	"Microfilaria is the stage which develops from the egg and precedes the L1 stage. The larvae are surrounded by a membrane thought to be derived from the vitelline membrane."
Contained_in	"WBls:0000105"

Further additions

please list below the life stages that have been described or used in the literature for other species. This list will be periodically revised in order to expand the life stage ontology. If there is a formal description of the life stage i.e. definition of "sheathed microfilaria", "enclosing embryo" etc.. please include it with a reference (WBPaperID).

  • C. briggsae
    • ...
  • C. japonica
    • ...
  • C. remanei
    • ...
  • C. sp11
    • ...
  • P. pacificus
    • ...
  • A. suum
    • ...


Nematode Lifecycles

C elegans life cycle.jpg

Entomopathogenic nematodes

Entomopathogenic.png