WBConfCall 2014.07.03-Agenda and Minutes

From WormBaseWiki
Jump to navigationJump to search


Enter Topics Here

Apologies from Mary Ann

WS245 Schedule

Proposed Hinxton release Friday 29th August

Proposed upload date Friday 1st August (- Sunday 3rd)

Models 3 weeks prior Friday 11th July

Person evidence vs Publication evidence

(Karen, Kimberly)
Questions have come up about what to do with user submitted evidence that can also be extracted from a publication. In particular, when users submit data through an online form, that information is captured and entered with the submitter of the data as person evidence.

Q1. If those data are then published by the same person, should the person evidence be deleted and replaced by the publication as evidence, or should those data retain the person as evidence as well as having the publication attached?

[Mary Ann - here are my thoughts. I support the latter, though looking at the Evidence attached to CGC_name is does not look like the website is displaying it anyway (a separate issue). In any case, I think there is no harm in leaving Person_evidence there. It adds weight to our curation.]

Q2. If and when person and publication are both used as evidence, are there ways to highlight publication evidence over person evidence in our various data output modes (website, GAF files, WormMine queries, scripted concise descriptions, etc.)?

[Mary Ann - I think this is a good idea, though it must be obvious to users that publications take precedence over people. But if you're talking about ensuring that Papers are listed at the top of a list of Evidence values then yes, I support that]

Resurrection of the second all-hands conference call?

A few months ago, we changed the second conference call of the month to a more focussed smaller-group meeting on a specific topic. This seems to have failed (we've only had one of these). The results of a poll at Hinxton suggested a feeling that communication has become worse since we went to a once-a-month schedule. Proposal then: reinstate the second all-hands conference call. Topic-based calls for smaller groups can be arranged outside the main calls when the need arises.

Quick Models Update

3 Models for WS245 addition (Will be tagged next Friday 11th as this is 3 weeks prior to upload for WS245 if this is the agreed schedule)

1) The long awaited ?Construct class - Karen

Is it much different to the working version we had for WS244? http://cvs.sanger.ac.uk/cgi-bin/viewvc.cgi/wormbase/wspec/models.wrm?revision=1.404&root=ensembl&view=markup

[Karen - not much different at all.]

2) Clone Class additional ?Sequence connections to remove ambiguity - Paul

 Sequence ?Sequence
 End_sequence ?Sequence

3) Interaction class - Chris

Addition of "DNase_I_footprinting" to detection methods.


WS245 release schedule looks good.

CVS Tagged next Friday July 11

Person evidence vs Publication evidence

Karen : Keep person evidence + subsequent publication evidence ?

Michael P : Yes, may as well keep all evidence.

Karen : Can we prioritize one evidence over another ?

Kevin : Not in the schema.

Todd : Sure on the website, can sort by date.

Karen : Prefer publications taking precedence.

Todd : We can do that as well.

Kevin : Evidence hash is a tree of tags, shouldn't be a problem.

Karen : Can we prioritize display of data based on the type of evidence it has ?

Kimberly : How to communicate to users data that only has person communication as evidence. How should users cite WormBase ? How should we make it clear to users that some is published and some isn't.

Todd : Tooltip for evidence, show it's provenance.

Michael P : Take care if the Person with evidence is not the author of the Publication.

Paul S : How many cases of someone communicating data and not publishing for years ? (we don't know)

Kimberly : Author just as Paper or also Person evidence ?

Paul D : Use both and also accession evidence.

Karen : User submission forms are personal communication ?

Kimberly : Yes, if there is no paper.

Karen : Sort by timestamp clearer ?

Raymond : Doesn't seem right for the paper not to get priority. We don't need to worry about ordering, keep it simple.

Todd : Agree with Raymond, rely on users to look at data + evidence and make their own decisions.

Kevin : We'll need to show them the date it was added then.

Todd : Yes

Kimberly : And also tell them what does Person_evidence means.

Raymond : To display add prefix to say Personal Communication.

Kimberly : Caltech has used the tag that way, but maybe not everyone has ?

Kevin : Can add a new Personal_communication tag to #Evidence. Pop-up / tooltip seems good.

Todd : Pop-up / tooltip would be good for all #Evidence tags.

Karen : To summarize, new tag Personal_communication, Caltech will dump to that. Web display will be clearer and not prioritized.

Kimberly : Also explain what Curator_confirmed means. Do we create different Phenotype objects when publication is different from prior personal communication ?

Paul D : If they publish something different, the personal communication was probably not correct, ask them for clarification.

Karen : Need to keep both because another user might have used the original data based on personal communication.

Paul D : If different could treat as conflicting data from different people.

Gary : In Phenotype might mean that the data is variable and need to keep both.

Kevin : How do we deal with retractions when the data is no longer correct ? We have timestamps but don't show them to users.

Raymond : People often give us new data, but don't retract the previous data.

Paul D : What should we do when they do retract it ?

Ranjana : Add a 'retracted' label.

Paul S : How often does this happen ? It might disincentivize user submissions. We're not changing meeting abstract information.

Karen : For Phenotypes it will work itself out, but for Transgenes they have to be corrected and have remarks.

Raymond : Keep all previous WS website releases so users can see why they used that.

Paul S : Not optimal to keep so many releases running.

Raymond : Good issue to consider for database migration. There's a difference between tracking typos and real data changes, would be good to have a way to tag.

Kimberly : Date_last_updated tag to track significance.

Daniela : Change OA to track it.

Juancarlos : We could, but need the future database to support it.

Raymond : Talk more at Caltech meeting.

Kimberly : Add to wiki definitions of evidence tags that people can edit.

Resurrection of the second all-hands conference call

Kevin : Hinxton thinks we could use an extra call.

Paul S : Yes, let's do it.

Paul K : Why hasn't the database call been happening.

Kevin : People have been working on different things, need someone dedicated to work on the future database.

Paul K : Hinxton is stuck with the duty.

Todd: notes added after minutes being posted: I don't recall Paul K saying that Hinxton is stuck with the duty. If he did in fact say that it is 100% untrue. In fact, OICR and Caltech have dedicated substantive resources to the project. Neither team currently has the luxury of devoting an FTE to the project as they already have other duties to attend to.

Paul S : New person starting at Hinxton in August. Have 2nd call on the third week, and discuss database afterward.

Models Update


Paul D : Construct class, how is it going ?

Karen : It's great. Small tweak compared to cvs. Mary Ann okayed it.

Paul D : A lot of tags removed.

Karen : Need more rigorous testing, change a tag name. Can get it by the 11th.

Paul D : Will incorporate changes, submit tweaks next week.


Paul D : Component sequences are end sequence, want an end sequence tag.

Chris : Good idea.


Paul D : Adding a new Detection Method, looks fine.


Michael P : More crossreference to Genes + CDS + similar data, to link to Ensembl. Is there a wiki for external databases ?

Paul S : External resources at the bottom ?

Raymond : External links widget.

Chris : Page with more information somewhere.

Todd : Friends link on footer. External page would get lost.

Raymond : Database class links to external database front page.

Todd : Good idea to have overview of what we do and what resources we trust.

Raymond : Better if maintenance-free.

Kevin : UCSC supports genome hub in their genome browser, gives us control of nematode sequence releases.