Nematode resequencing and diversity
This is a community page for registering information about future and pending nematode (re)sequencing projects. Please describe briefly the species, population or isolate you will be (re)sequencing, the technology you will be using, the status of the project, and contact information.
Contents
Solexa Resequencing of two Wild C. elegans Isolates
- Rationale
- Understanding natural population structure in C. elegans. This is a pilot for a larger study.
- Source material
- Two C. elegans isolates isolated from geographically distinct locations.
- JU258 (Madeira, Portugal; collected by M.A. Felix)
- ED3040 (Johannesburg, South Africa; collected by E. Dolgin).
- Technology
- Solexa sequencing, 35 cycles. Roughly 5x coverage.
- Data Release
- Complete as of October 2007. We are running another roughly 5x for each strain to determine the effects of read depth on alignment/assembly quality.
- Contact
- Lincoln Stein
- Asher Cutter
Solexa Resequencing of C. elegans CB4858
- Rationale
- Get a strain other than CB4856 for SNP mapping, for behavioral (or other) phenotypes that cannot be mapped using CB4856.
- Source material
- CB4858 (fify-eight)
- Technology
- Solexa sequencing, roughly 7x coverage
- Status
- Complete
- Contact
- Elaine Mardis, Washington University Genome Sequencing Center, St. Louis
Solexa Resequencing of C. elegans CB4856
- Rationale
- Source of SNPs for behavioral (or other) phenotypes.
- Source material
- CB4856 (Hawaiian)
- Technology
- Solexa sequencing, 5-7x coverage
Status
- In progress, data coming in as of 7/10/2007.
- Contact
- Marco Marra, University of British Columbia
Solexa Resequencing of C. elegans PB306
- Rationale
- missing
- Source material
- PB306 (North America)
- Technology
- Solexa sequencing
- Status
- 5x coverage complete, as of April 2007. This run had very high error rates associated with our old Solexa machine's technical problems. We have since received a replacement machine from Illumina that is working much better. We do not plan to re-do PB306, however, until after we have completed sequencing from mutation-accumulation lines.
- Contact
- Dee Denver, Oregon State U.
Solexa Resequencing of C. elegans N2 and C. elegans DR1350
- Rationale
- Mark Viney in Bristol has been mapping quantitative trait loci in recombinant inbred lines between N2 and DR1350. In order to identify the likely quantitative trait nucleotides underpinning the QTL in the RIL crosses, we sequenced DR1350 to ~16 fold depth. As some of the DR1350 genome (large segments of chromosomes) appear to be derived from an N2-like background, we also sequences N2 to 16 fold to affirm single nucleotide polymorphism calls in these regions, and to assess the reliability of indel and SNP calls in the DR1350 data.
- Source material
- DR1350 (a PA3 derivative) and N2 (Mark Viney's copy of this strain from CGC; we call this N2Viney, as N2Bristol would cause far too much confusion!)
- Technology
- Solexa sequencing using a GAI instrument, initially with 36 base reads and latterly with 44 base reads.
- Status
- We aimed for equal coverage over both genomes, and achieved a mean of 14 fold depth. These data were complete as of June 2008; we are currently (Nov 2008) in the process of analysing them. If anyone wants a look see, email me (Mark Blaxter)
- Contact
- Mark Blaxter, University of Edinburgh, UK
Other Information
(This is quoted from a letter from Marie-Anne Felix dated April 11, 2007; it is a placeholder until this page grows.)
After a round of e-mailing, taking account of available data, especially sequencing and SNP data from Dee Denver, Elie Dolgin, Asher Cutter and Matt Rockman (most data are unpublished), it seems that a consensus for resequencing C. elegans isolates is something like the following, in decreasing order of priority:
- CB4856/Hawaii is apparently being done by Waterston (info by Jim Thomas).
- JU258 Madeira
- MY2 Germany
- KR314 Vancouver, BC, Canada
- MY6 Germany
- AB1 Australia
- PB306 N America (exact origin unknown)
- ED3040 South Africa
- PS2025 Altadena, CA, USA
- MY3 Germany
- JU322 France
I can give a justification for this set if it is of any use. It basically maximizes diversity. It also covers four continents, but basically there is no large-scale geographic structure in the C. elegans species. (just found one elegans strain in Japan, but no sequence data yet)
Tell me if you need any information (like a general rationale for resequencing, or for choosing the strains). I can provide the strains which are not at CGC (I think ED3040 is the only one).
Sequencing of new Caenorhabditis species
A proposal to sequence up to seven additional genomes and cDNAs of Caenorhabditis species has been submitted to NIH as part of the modENCODE project, organized by Fabio Piano (worms) and Peter Cherbas (flies), with contribution from K. Kiontke, P. Sternberg, R. Waterston, D. Fitch, A. Cutter and M.-A. Félix. The Priority Group 1 species sequencing has been funded (October 2008).
Priority Group 1:
Caenorhabditis sp. 9 (JU1325)
Caenorhabditis sp. 11 (JU1373)
Caenorhabditis sp. 7 (JU1199)
[Priority Group 2 was:Caenorhabditis sp. 5 (JU727), Caenorhabditis sp. 10 (JU1333), two species among: Caenorhabditis sp. 6, C. sp. 3, C. drosophilae, C. sp. 8, C. sp. 2.]