Hinxton 2015.08- Meeting minutes
From WormBaseWiki
Jump to navigationJump to searchContents
August 2015
August 14, 2015
In attendance: KH, TD, PD, GW, BB, MP
Database migration
- Security discussion
- Script will authenticate to the application using self-signed SSL certificates
- What level of granularity for the certificates? Per center? Per user? Per update type (e.g. a separate cert for each build step)
- Transaction log
- Web app now allow unrolling of recent transactions
- Functionality added to detect when unrolling of a T impacts on Ts made subsequently (e.g. T1 makes object, T2 refers to object, user attempts to undo T1).
- Environment
- GW had trouble previously getting Datomic running on EBI machine on nfd-mounted storage.
- The free storage engine with Datomic assumes/requires low-latency (ideally directly attached) storage
- GW to try again on acedb.ebi.ac.uk using /nfs/acedb/vol1 (purportedly low-latency storage)
- 2-way XEFS with different attached # models at either end
- e.g. Gene -> Variation (#Evidence), Variation -> Gene (#Molecular_change)
- New db puts #model annotation of the edges of the graph, so does not currently support this.
- Discussed possible workarounds
- Loading Ace into Datomic
- Some testing done by GW, works well
Build 250
- Status
- BLAST - all 9 species run, dumped and loaded
- Compara - finished and loaded
- RNASeq - in progress
- General problems/fire-fighting
- Transcript builder (mysteriously did not complete for two genomes)
- InterProScan (found a bug in the Ensembl code, fixed and provided patch)
- BLAST dumping (exceeded MySQL max connections. Needed to tweak server conf)
ParaSite
- Redesign of homepage
- Beta version demonstrated. Will circulate some other possible layouts
- New blog for ParaSite
- wbparasite.wordpress.com
Data integration / curation
- Brugia
- Produced initial annotation of new assembly.
- MP presented to Brugia analysis group in conference call
- C. elegans
- DB_remark clean up -> Brief_identification, pseudogenes (will go in as a note in ENA submissions)
- Classification pseudogenes. Will need a new unitary pseudogene tag in ?Pseudogene model
- Consolidation of UniProt product names into the database
- SVM paper => gene curation
- Proteomics data from Lamond lab
August 6, 2015
In attendance: KH, PD, MP, TD, BB
Build 250
- In progress. MP doing all 9 core species (no precedent for this).
- First build from github code base (no problems so far)
- How long will BLAST for all 9 species take (all BLASTX and BLASTP will need to be re-done, since all of the worm proteomes have changed)
- Problems
- RNAis with bad DNA_text. Cleaned and reported to Caltech
- NCBI taxonomy database at EBI changed location, config needed a patch
Data integration / curation
- B. malayi
- WS248 Brugia submission now live at ENA
- Gene prediction on new assembly
- Looking at RATT transfers from old assembly
- Using HAL mapping to project genes also works quite well.
- AUGUSTUS to fill in the gaps
- Handover will be Monday 10th August (during Brugia conference call)
- C. remanei
- Bad data. Submitter does not have time right now to clean. Will be deferred from this build
- General
- Hawaain strain - will go in as T3. BLASTable.
- Cleaning up T2. Getting rid of in-frame STOPs for P. pacific us
- Cleaning up Brief_identifcation for protein-coding and pseudogenes.
Database migration
- New interface for uploading ace files.
- Discussed potential strategies for schema updates
ParaSite
- New release out
- BLAST
- Fixed glitches in new service
- Discussed potential minor improvements to font page
- Analytics
- Redesign of the home page. Make it more WormBase compliant.
- Search box confusion