Welcome to sgenomics.org
Had a few discussions in the lab today where we required an ontology or set of Gene Ontology terms to describe a protein complex rather than its constituents. I've seen ontologies for constituent parts of the cell but not for more transient complexes. In part the need for this should be guided by the likes of the systems biology graphical notation framework where the idea of units of activity come together to form distinct activities in cooperation. However, I have still haven't seen an actual ontology of protein complexes.
SLiMSearch is now available with a faster implementation time. The new service is available on the bioware dev server - ping me for details on access. The new service utilises a lookup table of pre-compiled results. Keep an eye out for the SLiMSearch paper coming out in the next month or two.
The manuscript describing the work of the ProteomeBinders subgroup on data standards and exchange formats has finally been published. This paper focuses on the minimum information required for a protein affinity reagent experiment. The work was led by Sandra Orchard and Henning Hermajakob at the EBI. I've been interested in ontologies and data standards for some time, so its great to be able to put this into practise and help out on an important and emerging data format. The work is based off the well established PSI-MI (Molecular Interaction) data standards. This is built on to encompass the exchange of information about proteomics experiments that use affinity reagents such as antibodies.
The paper is available here.
I have seen a bunch of papers by the same author recently where I felt that the text was repeated between papers. In order to test out this idea I ran the text of these papers through some compression algorithms to see if the text was repeated significantly between papers.
The results will appear here again soon.
Quite a few people in the new Lab need access to the ELM Server. Thankfully, they provide a Web Service to connect to their database. We have developed a java client to this service and provided some example code to go along with it. At the moment all the code is in an eclipse project so we are just sharing the whole project with all of the required library jars and axis generated code. However, in the future we'll try to tidy that up a bit.
The code is available at UCD SVN Repo.
Its the time of year again where I visit Cambridge and learn about the changes in the Distributed Annnotation System. The conference was hosted as always at the Genome Campus. This year there isn't as much changing in the system as much as there is consolidation of existing systems. The talks focused on client development this year and it seems much more focus is on the delivery of data. Many of the talks introduced client libraries and talked about data representation. This is an encouraging direction for the system to take. The scale of some of the new DAS servers is pretty incredible. The enCore project brings together around 20 different bioinformatics groups in Europe. These massive EU framework programs can be pretty scary. However, given the amount of money that they attract they are able to provide services that are not possible for other smaller projects or groups to acheive. One such example is the easyDAS project. They offer free hosting for small projects that want to share data via a DAS server.
Late last year I reviewed a book entitled "What a time I am having - Selected letters of Max Perutz" for the journal BioEssays. The review is out now and available from here Link out to Bioessays site. I won't bother adding much more here since its available there. But it was fun reviewing the book and I would recommend the process (and the book) to others.
The book is available on Amazon.
I gave my first talk yesterday at the UCD Bioinformatics Seminar Series. A couple of points. One I must remember to start using some kind of slide sharing service in order to preserve my talks. I'm impressed by the ability of others to maintain a record of their public speaking events.
The other major point that emerged from this was a discussion of Open Data and the challenges that opens up. I've already expressed an interest in hosting DAS services or Webservices on cloud infrastructure and I'll hopefully get some time to work on this in a couple of months time. I came across this article today on the topic of Open Data and it reminded me to restart a couple of data sharing projects. Hosting these in the cloud is really attractive for researchers since it dramatically lowers the costs and most of the time the amount of usage expected is lower than the pay threshold for these services.
I'll be putting the SLiMFinder code up on a svn server at some point in the near future. Hopefully that will spur some increased development of that code. There is a mailing list on googlegroups as well if people are interested.
I've been following the work of Vincent Rouilly at the Parts Registry in MIT and trying to get the DAS server dazzle working on the Google App Engine. So far it seems to work though I can't seem to get the datasources working correctly. More updates to follow.
The Advanced Interfaces Group in Manchester has been developing tools for biologists for some time. The structure viewer Cinema and tools like Ambrosia have been demonstrated to be useful in the past. I thought I would draw attention to their latest offering Utopia Documents available from here:
It's pretty powerful and for my money should be your default reader for PDF's replacing preview or Acrobat.