Difference between revisions of "Software Life Cycle: 1. Updating The Development Server"

Revision as of 21:25, 6 January 2009

This is a quick description of how to update the development server with a new release of the database. In general, the development process involves mirroring a large number of files from Sanger, unpacking, and in many cases, massaging into a format suitable for driving the website.

Development Server

The current development server is

 brie3: brie3.cshl.org / dev.wormbase.org

Update Pipeline Code

The update pipeline code is available in the WormBase admin module:

 brie3> svn co svn+ssh://brie3.cshl.org/usr/local/svn/admin

Change into the development directory:

 brie3> cd admin/update/development

The contents are:

  logs/      -- the logs directory for each step/update
  README.txt -- nothing 
  steps/     -- Perl scripts that launch each step
  Update.pm  -- the top level module for the update process
  Update/    -- Perl modules corresponding to each step
  updatelog.conf  -- a configuration file for the update process
  update.sh  -- master script that fires off each step of the pipeline
  util/       -- various helper scripts for the update process

A single shell script fires off all steps of the process.

 brie3> ./update.sh

The steps it performs are:

Purge disk space
Create necessary directories
Mirror and unpack ACeDB from Sanger
Mirror ontology files from Sanger
Compile ontology resources for the site
Create BLAST databases for available species
Create BLAT database for available species
Create ePCR databases for select species
Load genomic GFF databases for available species
Build and load GFF patches
Convert GFF3 into GFF3
Create a GBrowse-driven genetic map
Create a GBrowse-drive physical map
Create dump files of common datasets
Load the CLUSTALW database
Mirror annotation files from Sanger to the FTP site

Each step is described below.

Update Steps

Purge Disk Space

Remove obsolete files from the (staging) FTP site. These have already been mirrored to the production FTP site and do not need to be maintained on the development server.

 Usage : ./steps/purge_disk_space.pl

Create necessary directories

Create staging directories for the update process.

 Usage : ./steps/create_directories.pl {WSRELEASE}

Mirror and unpack ACeDB from Sanger

Mirror and unpack the new release of the database from Sanger. Add in appropriate control files for the new acedb database: serverpasswrd.wrm, passwrd.wrm, serverconfig.wrm pulled from the checked out development source (/usr/local/wormbase/wspec).

Files will be mirrored and unpacked to /usr/local. Please make sure that there is sufficient space in this directory! You will most likely need approximately 25 GB of disk space. Possible places to free up disk space:

 /usr/local/mysq/data
 /usr/local/acedb/tmp
 ~{you}/mp3s

 Usage : ./steps/create_directories.pl {WSRELEASE}

Note: This can take a *long* time. You might to run this in a screen:

 brie3> screen
 brie3> ./steps/mirror_acedb.pl WS198
   (to disconnect your screen)
 brie3> ^a ^d
   (to resume your screen)
 brie3> screen -r

When complete, check to make sure that the following directory and symlink exist:

 brie3> /usr/local/acedb/elegans -> elegans_{WSRELEASE}

 file checklist at the end of the run:

Mirror ontology from Sanger

Mirror OBO files from Sanger.

 Usage: ./steps/mirror_ontology_files.pl {WSRELEASE}

Compile ontology resources for the site

Take the mirrored files and compile them into the databases for the ontology searches.

 Usage: ./steps/compile_ontology_resources.pl {WSRELEASE}

Resulting files list in /usr/local/wormbase/databases/{WSRELEASE}/ontology:

anatomy_association.WS197.wb
gene_association.WS197.wb.ce
gene_ontology.WS197.obo
name2id.txt
search_data.txt
anatomy_ontology.WS197.obo
gene_association.WS197.wb.cjp
id2association_counts.txt
parent2ids.txt
gene_association.WS197.wb
gene_association.WS197.wb.ppa
id2name.txt
phenotype_association.WS197.wb
gene_association.WS197.wb.cb
gene_association.WS197.wb.rem
id2parents.txt
phenotype_ontology.WS197.obo

Create BLAST databases for available species

Notes...

 Usage: ./steps/create_blast_databases.pl {WSRELEASE}

Create BLAT databases for available species

Notes...

 Usage: ./steps/create_blat_databases.pl {WSRELEASE}

Create ePCR databases for available species

Notes...

 Usage: ./steps/create_epcr_databases.pl {WSRELEASE}

Load genomic GFF DBs for available species

Notes...

 Usage: ./steps/load_genomic_gffdb.pl {WSRELEASE}

Build and Load GFF patches

Notes...

Convert GFF2 into GFF3

Notes...

 Usage: ./steps/convert_gff2_to_gff3.pl {WSRELEASE}

Create a GBrowse-driven genetic map

Notes...

 Usage: ./steps/load_gmap_gffdb.pl {WSRELEASE}

Create a GBrowse-driven physical map

Notes...

 Usage: ./steps/load_pmap_gffdb.pl {WSRELEASE}

Create dump files of common datasets

Notes...

Load the CLUSTALW database

Notes...

 Usage: ./steps/load_clustal_db.pl {WSRELEASE}

Mirror annotation files from Sanger to the FTP site

Notes...

 Usage: ./steps/mirror_annotations.pl {WSRELEASE}

Create BLAST databases for available species
Create BLAT database for available species
Create ePCR databases for select species
Load genomic GFF databases for available species
Build and load GFF patches
Convert GFF2 into GFF3
Create a GBrowse-driven genetic map
Create a GBrowse-driven physical map
Create dump files of common datasets
Load the CLUSTALW database
Mirror annotation files from Sanger to the FTP site

@@ Line 106: / Line 106: @@
    Usage: ./steps/compile_ontology_resources.pl {WSRELEASE}
-  resulting files list:
+Resulting files list in /usr/local/wormbase/databases/{WSRELEASE}/ontology:
 *anatomy_association.WS197.wb
-*gene_association.WS197.wb.ce, gene_ontology.WS197.obo, name2id.txt, search_data.txt, anatomy_ontology.WS197.obo, gene_association.WS197.wb.cjp, id2association_counts.txt, parent2ids.txt,
+*gene_association.WS197.wb.ce
-gene_association.WS197.wb, gene_association.WS197.wb.ppa, id2name.txt, phenotype_association.WS197.wb, gene_association.WS197.wb.cb, gene_association.WS197.wb.rem, id2parents.txt, phenotype_ontology.WS197.obo
+*gene_ontology.WS197.obo
+*name2id.txt
+*search_data.txt
+*anatomy_ontology.WS197.obo
+*gene_association.WS197.wb.cjp
+*id2association_counts.txt
+*parent2ids.txt
+*gene_association.WS197.wb
+*gene_association.WS197.wb.ppa
+*id2name.txt
+*phenotype_association.WS197.wb
+*gene_association.WS197.wb.cb
+*gene_association.WS197.wb.rem
+*id2parents.txt
+*phenotype_ontology.WS197.obo
 === Create BLAST databases for available species ===

Difference between revisions of "Software Life Cycle: 1. Updating The Development Server"

Revision as of 21:25, 6 January 2009

Contents

Development Server

Update Pipeline Code

Update Steps

Purge Disk Space

Create necessary directories

Mirror and unpack ACeDB from Sanger

Mirror ontology from Sanger

Compile ontology resources for the site

Create BLAST databases for available species

Create BLAT databases for available species

Create ePCR databases for available species

Load genomic GFF DBs for available species

Build and Load GFF patches

Convert GFF2 into GFF3

Create a GBrowse-driven genetic map

Create a GBrowse-driven physical map

Create dump files of common datasets

Load the CLUSTALW database

Mirror annotation files from Sanger to the FTP site

Navigation menu

Page actions

Page actions

Personal tools

Navigation

Search

Tools