WormBase Model:Construct

From WormBaseWiki
Jump to navigationJump to search

back to WormBase_Models

Proposed model changes

Purpose: the technology of engineering mutations and gene replacement has been developed in C. elegans. With these new research tools, capture and display of the molecular information of these alleles needs to be updated. We propose a new class, Construct, to capture the specifics of the DNA tool used to perform the replacement or engineering, while the Variation model gets updated to record the engineering event itself and its impact on the genome. As a side benefit to the creation of the Construct model, we can use this new class to also capture the genomic arrays used create transgenes.

Variation

Proposed additions

?Variation
Variation_type 
    Engineered_allele 
Variation_summary ?Text//to house final engineered construct
Derived_from ?Construct XREF Engineered_variation
Corresponing_transgene Unique ?Transgene XREF Corresponding_variation
Method 
    Homologous_recombination //Homologous_recombination
    NHEJ //Non-homologous DNA end-joining, imprecise DNA repair
    MosSci
    TALENs
    CRISPR_Cas9
    ZFN-NHEJ repair //Zinc-finger nuclease
    ZFN-HR repair
Expr_pattern ?Expr_pattern XREF Variation #Evidence

notes on variation model changes

see discussion tab

WBPaper00045772 Paix et al., 2014
Discusses Homology directed repair (HDR) vs Nonhomologous end-joining (NHEJ). It would seem that distinguishing the method used to create engineered alleles is important. However, the tags should perhaps be modified to distinguish double stranded break method and repair method.

break_made_by
  Mos_excision
  CRISPR/Cas9
  TALENs
  Zinc_finger_endonuclease
  OTHER

repaired_by
  nonhomologous_end_joining_NHEJ
  homologous_direct_repair_HDR


--Kyook (talk) 21:17, 16 October 2014 (UTC)

Construct (new)

Class itself is new

?Construct Evidence #Evidence
	   Curator_confirmed	?Person
	   Public_name ?Text
	   Other_name ?Text
	   Summary ?Text                      //genotype like [Pmyo-2::GFP]
	   Sequence_feature ?Feature XREF Associated_with_construct
	   Driven_by_gene ?Gene XREF Drives_construct #Evidence
	   Gene ?Gene XREF Construct_product #Evidence
	   3_UTR ?Gene #Evidence
	   Fusion_reporter ?Text              //fluorescent proteins GFP, RFP, mCherry, etc.
	   Other_reporter ?Text               //to add reporters, tags that aren�-F¢t included in model�-A
	   Purification_tag ?Text             //FLAG, HA, Myc, TAP, etc.
	   Recombination_site ?Text           //LoxP, FRT
	   Type_of_construct  Chimera
			      Domain_swap
			      Engineered_mutation
			      Fusion
			      Complex                    // complex changes
			      Transcriptional_fusion
			      Translational_fusion
			      Nterminal_translational_fusion
			      Cterminal_translational_fusion
			      Internal_coding_fusion
	   Selection_marker ?Text
	   Construction_summary  ?Text        // Backbone vector, mol bio
	   DNA_text Text                      // for mapping, can include entire construct sequence
	   Clone ?Clone XREF Construct
	   Used_for Transgene_construct ?Transgene XREF Construct
		    Transgene_coinjection ?Transgene XREF Coinjection
		    Engineered_variation ?Variation XREF Derived_from_construct
		    Topic_output_indicator ?WBProcess XREF Marker_construct
		    Expression_pattern ?Expr_pattern XREF Construct
		    Interactor ?Interaction
	   Reference ?Paper
	   Person ?Person
	   Laboratory ?Laboratory #Lab_Location
	   Historical_gene ?Gene #Evidence
	   Remark ?Text #Evidence

notes on construct model

see discussion tab

#Interactor_info

 Construct ?Construct XREF Interactor

Interaction

 Unaffiliated_construct   ?Construct
 Detection_method    Construct

Expression_pattern

proposed addition

Variation ?Variation XREF Expression_pattern
Construct ?Construct XREF Expression_pattern

Gene

proposed change

Drives_Transgene ?Transgene XREF Driven_by_gene
change to 
Drives_construct ?Construct XREF Driven_by_gene

Transgene

Proposed changes: Many of the transgene tags have been moved to the proposed ?Construct model, the remaining tags as well as some additions are shown below

?Transgene      Evidence #Evidence
		Public_name UNIQUE ?Text
		Summary UNIQUE ?Text
		Synonym ?Text
		Corresponding_variation UNIQUE ?Variation XREF Corresponding_transgene //put in to unambiguously associate the allele/transgene
		Construction Construct   ?Construct XREF Transgene_construct
     			     Coinjection ?Construct XREF Transgene_coinjection
     			     Coinjection_other ?Text //for coinjection markers that are not specified as a construct
     			     Integration_method UNIQUE ?Text
     			     Integrated_from ?Transgene XREF Transgene_derivation
     			     Laboratory ?Laboratory #Lab_Location
     			     Author ?Author
		Construction_summary ?Text
		Genetic_information Extrachromosomal
				    Integrated
				    Map ?Map  #Map_position  //needed for transgenes with no granular mapping, e.g., just mapped to a LG
                                    Mapping_data 2_point ?2_point_data      //deleted for WS245, rolled back for WS246
				    Map_evidence #Evidence
				    Phenotype ?Phenotype XREF Transgene #Phenotype_info
				    Phenotype_not_observed ?Phenotype XREF Not_in_Transgene #Phenotype_info
		Used_for Transgene_derivation ?Transgene XREF Integrated_from
			 Expr_pattern ?Expr_pattern XREF Transgene
			 Marker_for   ?Text #Evidence
			 Interactor ?Interaction
		Associated_with Marked_rearrangement ?Rearrangement XREF By_transgene
				Strain ?Strain XREF Transgene
		Reference ?Paper XREF Transgene
		Species UNIQUE ?Species
		Remark ?Text #Evidence
                Historical_gene   ?Gene #Evidence

WBProcess

    Marker_construct ?Construct XREF Topic_output_indicator

Test data

Variation objects with constructs

LP132 nmy-2(cp7[nmy-2::gfp + LoxP unc-119(+) LoxP]) I; unc-119(ed3) I
Variation : "WBVar020000000"
Public_name "cp7"
Engineered_allele 
Variation_summary "[nmy-2::gfp + LoxP unc-119(+) LoxP]"
Derived_from "WBConstruct00000010"
Derived_from "WBConstruct00000011"
Homologous_recombination

Construct : "WBConstruct00000010"
Summary "[nmy-2::gfp]
Driven_by_gene "WBGene00003777"
Fusion_reporter "GFP"
N-terminal_translational_fusion

Construct : "WBConstruct00000011"
Summary "[LoxP unc-119(+) LoxP]
Recombination_site "LoxP"
Gene "WBGene00003777"

bus-50(e5000[T110E]) = An engineered missense mutation

bus-50(e5001[bus-50::gfp]) aka bus-50::gfp = 
 An engineered fusion of GFP to the C-terminus of BUS-50
Variation : "WBVar0200000001"
Public_name "e5001"
Engineered_allele 
Variation_summary "[bus-50::gfp]"
Derived_from "WBConstruct00000012"
CRISPR-Cas9

Construct : "WBConstruct00000012"
Summary "[bus-50::gfp]"
Gene "WBGene0020000001"
Fusion_reporter "GFP"
C-terminal_translational_fusion


bus-50(e5002[bus-50::gfp + loxP unc-119(+) loxP]) 
 An engineered insertion of GFP plus the unc-119(+) selectable marker, flanked by loxP sites.
Variation : "WBVar020000003"
Public_name "e5002"
Engineered_allele 
Variation_summary "[bus-50::gfp + loxP unc-119(+) loxP]"
Derived_from "WBConstruct00000012"
Derived_from "WBConstruct00000011"
HR

Construct : "WBConstruct00000012"
Summary "[bus-50::gfp]"
Gene "WBGene0020000001"
Fusion_reporter "GFP"
C-terminal_translational_fusion


Construct : "WBConstruct00000011"
Summary "[LoxP unc-119(+) LoxP]
Recombination_site "LoxP"
Gene "WBGene00003777"


bus-50(e5003[bus-50::gfp +loxP]) aka bus-50(e5003) = 
 bus-50(e5002) following Cre-mediated recombinase removal of unc-119(+) leaving a single loxP site
Variation : "WBVar020000004"
Public_name "e5003"
Engineered_allele 
Variation_summary "[bus-50::gfp + loxP]"
Derived_from "WBConstruct00000012"
Derived_from "WBConstruct00000011"
HR

Construct : "WBConstruct00000012"
Summary "[bus-50::gfp]"
Gene "WBGene0020000001"
Fusion_reporter "GFP"
C-terminal_translational_fusion

Construct : "WBConstruct00000011"
Summary "[LoxP unc-119(+) LoxP]
Recombination_site "LoxP"
Gene "WBGene00003777"

Variation object with transgene

eIs2002 = eIs2002[unc-119::gfp] = eIs2002[unc-119::gfp, III:2992500]  
 Engineered insertions in apparent intergenic region with optional 
 descriptors (nature of the insertion or position in the genome)
Variation : "WBVar020000005"
Public_name "eIs2002"
Engineered_allele 
Variation_summary "[unc-119::gfp]"
Derived_from "WBConstruct00000013"
Identical_transgene "WBTransgene00024514"
MosSci

Construct : "WBConstruct00000013"
Summary "[unc-119::gfp]"
Gene "WBGene0020000001"
Fusion_reporter "GFP"
Translational_fusion

Transgene : "WBTransgene00024514"
Public_name "eIs2002"
Summary "[unc-119::GFP]"
Construct "WBConstruct00000013"


ozIs909, or ozIs909[unc-119::mCherry *eIs2002] = 
  Engineered changes to existing Is (or Si) insertions, which should 
  receive new Is numbers using originating lab’s prefix. The original 
  Is insertion is indicated in brackets with a preceding asterisk (*), 
  in order to allow searches for all derivatives from a given insertion. 
  In this case, an engineered change from GFP to mCherry in eIs2002
Variation : "WBVar020000006"
Public_name "ozIs909"
Engineered_allele 
Variation_summary "[unc-119::mCherry *eIs2002]"
Derived_from "WBConstruct00000015"
Identical_transgene "WBTransgene00024515"
MosSci

Construct : "WBConstruct00000015"
Summary "[unc-119::mCherry]"
Gene "WBGene0020000001"
Fusion_reporter "mCherry"
Translational_fusion

Transgene : "WBTransgene00024515"
Public_name "ozIs909"
Summary "[unc-119::mCherry]
Construct "WBConstruct00000015"

Transgene with construct

The following depicts how current transgenes would be redistributed into the proposed Construct and Transgene models

(Original)Transgene : "WBTransgene00000001"
Public_name	"adEx1256"
Summary	"[egl-19::sGFP-NLS + lin-15(+)]"
Reporter_product	"GFP"
Driven_by_gene	"WBGene00001187"
Strain	"DA1256"
Reference	"WBPaper00029359"
Reporter_type	"Transcriptional fusion"
Synonym	"[C48A7.1::gfp]"


(New)Transgene : "WBTransgene00000001"
Public_name	"adEx1256"
Summary	"[egl-19::sGFP-NLS + lin-15(+)]"
Construct "WBConstruct00000001"
Coinjection_other "lin-15(+)"
Extrachromosomal
Strain	"DA1256"
Reference	"WBPaper00029359"

(New)Construct : WBConstruct00000001
Public_name	"adEx1256"
Other_name	"[C48A7.1::gfp]"
Summary	"[egl-19::sGFP-NLS]"
Fusion_reporter	"GFP"
Driven_by_gene	"WBGene00001187"
Reference	"WBPaper00029359"
Transcriptional_fusion


(Original)Transgene : "WBTransgene00000011"
Public_name	"adIs1240"
Summary	"[lin-15(+) eat-4::sGFP]"
Reporter_product	"GFP"
Driven_by_gene	"WBGene00001135"
Strain	"DA1240"
Strain	"DA1243"
Map	"X"
Map_evidence	Paper_evidence	"WBPaper00038205"
Reference	"WBPaper00030960"
Reference	"WBPaper00032252"
Reference	"WBPaper00035265"
Reference	"WBPaper00036277"
Reference	"WBPaper00036704"
Reference	"WBPaper00037626"
Reference	"WBPaper00038205"
Reference	"WBPaper00044482"
Synonym	"[eat-4::gfp]"

(New)Transgene : "WBTransgene00000011"
Public_name	"adIs1240"
Summary	"[lin-15(+) eat-4::sGFP]"
Strain	"DA1240"
Strain	"DA1243"
Map	"X"
Map_evidence	Paper_evidence	"WBPaper00038205"
Coninjection_other "lin-15(+)"
Reference	"WBPaper00030960"
Reference	"WBPaper00032252"
Reference	"WBPaper00035265"
Reference	"WBPaper00036277"
Reference	"WBPaper00036704"
Reference	"WBPaper00037626"
Reference	"WBPaper00038205"
Reference	"WBPaper00044482"

(New)Construct : "WBConstruct00000011"
Public_name	"adIs1240"
Other_name	"[eat-4::gfp]"
Summary	"[eat-4::sGFP]"
Fusion_reporter	"GFP"
Driven_by_gene	"WBGene00001135"
Reference	"WBPaper00030960"


(Original) Transgene : "WBTransgene00000017"
Public_name	"ajIs1"
Summary	"[pgp-5::gfp]"
Coinjection_marker	"pRF4[rol-6(su1006)]"
Construction_summary	"Integrated from BC10030 sEx864."
Reporter_product	"GFP"
Driven_by_gene	"WBGene00003999"
Driven_by_gene	"WBGene00006767"
Integration_method	"Gamma_ray"
Integrated
Reference	"WBPaper00002968"
Reference	"WBPaper00031023"

(New) Transgene : "WBTransgene00000017"
Public_name	"ajIs1"
Integration_method	"Gamma_ray"
Integrated
Reference	"WBPaper00002968"
Reference	"WBPaper00031023"
Integrated_from "WBTransgene00002030"
Construction_summary	"Integrated from BC10030 sEx864."

(New) Transgene : "WBTransgene00002030"
Public_name "sEx864"
Synonym "[pgp-5::gfp]"
Synonym	"[C05A9.1::gfp]"
Summary "[rCesC05A9.1::GFP + pCeh361]"
Extrachromosomal
Construct "WBConstruct00000017"
Construct "WBConstruct00000018"
Construct "WBConstruct00000002"

(New) Construct : "WBConstruct00000018"
Public_name "pRF4"
Summary "[rol-6(su1006)]"
Gene "WBGene00004397"

(New) Construct : "WBConstruct00000017"
Public_name "[pgp-5::gfp]"
Summary	"[rCesC05A9.1::GFP]"
Reporter_product	"GFP"
Transcriptional_fusion
Driven_by_gene	"WBGene00003999"

(New) Construct : "WBConstruct00000002"
Public_name "pCeh361"
Summary "[pCeh::DPY-5]"
Clone "pGEM-5
Construction_summary "A 3.3-kb Nco I fragment containing a predicted cuticle 
  collagen gene was isolated from the cosmid F27C1, and cloned into the Nco I 
  site of pGEM-5 to generate the clone pCeh361"
Reference	"WBPaper00027361"

(New) Transgene : "WBTransgene00000600"
Public_name	"hIs2"
Summary	"[DPY-5::GFP + rol-6(su1006) + pBluescript]
Construct  "WBConstruct00000003"//pCeh358
Coinjection "WBConstruct00000004"//PCes1943
Coinjection "WBConstruct00000005"//pBluescript KS
Construction_summary	"Transgenic animals were generated by microinjection of 
  pCeh358 (5 ng/ul) and pBluescript KS (100 ng/ul), or in combination with 50 ng/ul 
  pCes1943, which carries a dominant rol-6 mutation [rol-6(su1006)] used as a 
  morphological marker for successful transformation"

(New) Construct : "WBConstruct00000003"
Public_name	"pCeh358"
Summary	"[dpy-5::gfp]"
Driven_by_gene	"WBGene00001067"
Gene	"WBGene00001067"
Clone "pPD95.69"
Translational_fusion
Construction_summary	"The dpy-5::gfp reporter construct pCeh358 was 
  generated by insertion of a 750 bp Sph I fragment from pCeh361 into the 
  Sph I site of the gfp expression vector pPD95.69 (kindly provided by A. Fire). 
  This fragment contains 5' sequences from an Sph I site in the polylinker of 
  pCeh361 to a site 30 bp downstream from the predicted DPY-5 initiator methionine, 
  resulting in an in-frame fusion of the first 12 codons of dpy-5 with gfp."
Reference	"WBPaper00027361"

(New) Construct : "WBConstruct00000004"
Public_name	"pCes1943"
Summary "[rol-6(su1006)]

(New) Construct : "WBConstruct00000005"
Public_name	"pBluescript KS"


Expression objects with construct

Examples of two expression objects pertaining to sequence feature


Example 1:

Expr_pattern : "Expr11377"
Anatomy_term	"WBbt:0005813" Certain
Anatomy_term	"WBbt:0008588" Certain
Anatomy_term	"WBbt:0008589" Certain
Gene	"WBGene00001948"
Life_stage	"WBls:0000003"
Life_stage	"WBls:0000041"
Pattern	"enh-1 was preferentially active in the posterior C and D lineages 
 in the embryo and in body wall musculature in the adult."
Reference	"WBPaper00032967"
Reporter_gene
Construct	"WBConstruct00000020"
Associated_feature "WBsf047531"

Construct : "WBConstruct00000020"
Public_name "enh-1 reporter"  
Summary	"[hlh-1.enh-1::pPD107.94]"  
Clone "pPD107.94"                     //added clone back to the construct model
Sequence_feature "WBsf047531"
Gene	"WBGene00001948"
Fusion_reporter	"GFP"
Construction_summary "Enhancer region enh-1 for hlh-1 cloned into the 
 (del)Pes-10 basal promoter vector pPD107.94"
Reference	"WBPaper00032967"



Example 2:

Expr_pattern : "Expr11284"
Anatomy_term	"WBbt:0006894"
Gene	"WBGene00001185"
Pattern	"The distal enhancer activity was observed in P6.p and its 
 descendants from the two-cell stage and increased with time. Distal 
 enhancer activity persisted into much later stages than the proximal 
 enhancer did."
Reference	"WBPaper00005841"
Reporter_gene
Construct	"WBConstruct00000021"
Associated_feature "WBsf919543"


Construct : "WBConstruct00000021"
Public_name "[Distal egl-17 enhancer reporter]" 
Summary	"[egl-17.distal::pPD122.53]" 
Clone "pPD122.53"                        //added clone back into model
Sequence_feature "WBsf919543"
Gene	"WBGene00001185"
Fusion_reporter	"GFP"
Construction_summary "Distal egl-17 enhancer inserted into the pPD122.53 vector."
Reference	"WBPaper00005841"



Expression objects described with reporter fusions that do not have a classical Ex transgene designation


Expr_pattern : "Expr11598"
Anatomy_term	"WBbt:0003681" Certain
Anatomy_term	"WBbt:0005772" Certain
Anatomy_term	"WBbt:0006749" Certain
Gene	"WBGene00001753"
Life_stage	"WBls:0000023"
Life_stage	"WBls:0000041"
Pattern	"gst-5 was expressed in intestine, pharynx, and circumpharyngeal neurons. 
 Expression is seen from L1 to adult stages."
Reference	"WBPaper00037704"
Reporter_gene	 
Construct	"WBConstruct00000021"


Construct : "WBConstruct00000021"  
Public_name	"Expr11598_Ex"  
Summary	"[GST-5::GFP]"
Driven_by_gene	"WBGene00001753"
Gene	"WBGene00001753"
Fusion_reporter	"GFP"
Translational_fusion
Reference	"WBPaper00037704"

Interaction/Regulation objects pertaining to construct

Interaction/Regulation object contains sequence feature. I am not sure shall we add construct tag in our model? --Xiaodong


Interaction : "WBInteraction000502404"
Change_of_expression_level
Interactor_overlapping_gene      "WBGene00004077" Trans_regulator
Interactor_overlapping_gene      "WBGene00004077" Variation "WBVar00241166"
Interactor_overlapping_gene      "WBGene00009560" Trans_regulated
Interactor_overlapping_gene      "WBGene00009560" Expr_pattern "Expr4287"
Feature_interactor       "WBsf034247" Cis_regulator
Interaction_summary      "psa-3 expression was induced in T.p after the T cell division, and it accumulated in the posterior granddaughters. psa-3 expression in the T.p lineage was much lower in a pop-1 hypomorphic allele, q645."
Reporter_gene    "[psa-3::gfp], translational fusion"  //This information seems like it can be captured by transgene or construct now, I am missing why this tag is needed. 
//Xiaodong's comments: this tag has been in the model from the beginning. it's just curator's simple notes on reporter construct. if you notice, I don't even have the precise reporter description. 
//We can leave as it is now, and let user have a rough idea of what reporter looks like. if they want, they can go to check construct for detailed info.
//If the construct model goes in, the webteam can port the construct summary to this interaction page, likewise with the transgene summary info. But I leave it to you.
Construct       "WBConstruct00000021"
Transcriptional 
Positive_regulate        Anatomy_term "WBbt:0006996"
Positive_regulate        Anatomy_term "WBbt:0006997"
Positive_regulate        Anatomy_term "WBbt:0007000"
Positive_regulate        Anatomy_term "WBbt:0007007"
Positive_regulate        Anatomy_term "WBbt:0007012"
Positive_regulate        Anatomy_term "WBbt:0007016"
Paper    "WBPaper00027741"Remark   "POP-1 function is known to be required in T.p to determine the neural fate. POP-1 regulates psa-3 expression in T.p through the POP-1 binding site."


Construct : "WBConstruct00000021"
Public_name "osEx113"
Summary	"[Ppsa-3::psa-3::gfp]"
Construction_summary	"psa-3 promoter and the entire coding region."
Reporter_product	"GFP"
Driven_by_gene	"WBGene00009560"
Gene	"WBGene00009560"
Translational_fusion
Reference	"WBPaper00040473"
Reference	"WBPaper00027741"