Latest

Presentation slides from Evolution 2013 available

Open Tree of Life at meetings

The Open Tree of Life project is one of the many phylogeny projects that are featured during the Evolution 2013 meeting that currently takes place in Snowbird (UT). The presentation slides from Karen Cranston, the principal investigator of Open Tree of Life, are available online (LINK). Presentation slides from other investigators are added here in the upcoming days.

Evolution 2013 is the joint annual meeting of the Society for the Study of Evolution (SSE), the Society of Systematic Biologists (SSB), and the American Society of Naturalists (ASN). The conference meets jointly with the iEvoBio conference. Open Tree of Life is represented at both events. About 1400 participants are expected to share their research in evolution, systematics, biodiversity, software, and mathematics.

Free webinar: Putting all species in a graph database

Biology + Technology = OTOL

Neo4j screenshotOne of the developers of the Open Tree of Life demonstrates Thursday, during a free webinar, how graph databases are used to construct a tree of life. The lecture is organized by Neo Technology, which is the maker of Neo4j, an open-source database that is used for OTOL.

Stephen Smith, an ecology and evolutionary biology professor at the University of Michigan, is going to explain how Neo4j and other digital technologies are assisting in constructing the tree of life. Starting at 10:00 PDT (19:00 CEST), he will also discuss other aspects of the interface of biology with next generation technologies.

“Our project is building the tools with which scientists in the community can continually improve the tree of life as we gather new information. Neo4j allows us to not only store trees in their native graph form, but also allows us to map trees to the same structure, the graph. So in fact, we are facilitating the construction of the graph of life,” says Smith.

Neo4j approached the Open Tree of Life team to present a webinar because it is a project that utilizes the Neo4j graph database to represent the interconnectedness of biological data. The company considers the project a great example of how a graph database can better model the natural world.

The online lecture is intended for a broad audience including beginner computer programmers, advanced hackers, data scientists, natural scientists, and anyone interested in the cross-section of science and technology, especially data modeling. Over 150 people have already registered online.

The registration form: LINK

Update: The video from this webinar is available on vimeo: http://vimeo.com/67870035

Karen Cranston talks about open licensing and Dryad

Making scientific data available

 

In case you haven’t seen the video yet, Open Tree of Life investigator Karen Cranston talked about sharing research data in open access data repositories during the Creative Commons 10-year celebration in Raleigh, North Carolina, last December. She mainly focused on the use of CC0 for Dryad, which is a curated general-purpose repository that makes the data underlying scientific publications discoverable and freely reusable. Cranston also mentioned that data availability leads to more citations, which is highly valued in the academic community. You can access the presentation slides as well (link).

Interview with Open Tree of Life investigator

Crandall featured on PeerJ blog

Keith Crandall for bio low resOpen Tree of Life investigator Keith Crandall is featured on the blog of PeerJ, which is a peer-reviewed, open access journal on the Internet. Crandall is an Academic Editor for PeerJ and is the director of the Computational Biology Institute at George Washington University. He was the editor for the “living fossil” manuscript that got much news media attention last week. Here’s the link to the interview.

Building an API for the Open Tree of Life database

Do you want an app for this?

Screen Shot 2012-08-29 at 9.22.20 PMThe developers of the Open Tree of Life would like to know from the phylogenetic community what kind of information they want to extract from its database when the first draft is released later this year. With those preferences, it is possible to develop an API that gives scientists the opportunity to build their own websites or software packages that use the data.

An API (application programming interface) is a digital tool that allows one website or software program to “talk” to another website to dig up certain pieces of data. For instance, a lot of people use Tweetdeck to navigate the ongoing bombardment of messages in the Twittersphere. In that case, Tweetdeck is connecting to Twitter, through its API, to receive and order the messages according to the preferences of the user.

In case of the Open Tree of Life, an API gives researchers advanced access to the data of about two million species, the phylogenies that have been created to illustrate possible relationships between them, and the underlying data and methods of synthesis. “For example, it will be possible to select smaller trees for specific species or find out how many studies there are for a particular node within the database,” says Karen Cranston, the lead investigator of the project. Read the rest of this page »

Connecting millions of data points in a graph database

Creating ‘Facebook’ for species

Neo4j screenshotThe Open Tree of Life database is not just a list with about two million species. Information is added about their special characteristics and possible relationships with others as well. “It may become tens or hundreds of million pieces of data when we are all done.”

Stephen Smith, an evolutionary biology professor at the University of Michigan, is working together with the other researchers of the Open Tree of Life project to develop the programs and tools that will be used to construct the full tree of life. Scientists from all over the world can then synthesize all the information in the database.

“We are currently building the back-end of the Open Tree of Life. We need to create software that allows us to put all our information in a graph network, so that we can easily retrieve the information that researchers are specifically looking for.” Read the rest of this page »

“We need a sense of ownership of phylogenetic trees”

Where are the fungi datasets?

FungiA couple thousand fungi phylogeny studies have been published in the past twelve years. Clark University postdoc researcher Romina Gazis has gone through all of them. Now she is working on a bigger challenge: finding all the trees and datasets that were the foundation of those studies.

Ideally, all scientists who publish a phylogenetic tree would also deposit the datasets they used to create such trees at a publicly available online database. That allow other researchers to synthesize data from different sources to advance the knowledge about relationships between certain species and their evolutionary history.

Unfortunately, most of those datasets are not publicly available. Gazis only found datasets for about a quarter of the two-thousand fungi articles she surveyed. “Around 600 studies had tree files available, but not necessarily complete,” she concluded. “Some scientists deposited one but not all the trees.” Read the rest of this page »

Tree of Life: Are big changes looming on the horizon?

All species like some gadgets

Photo by PublicDomainPictures (Creative Commons Deed CC0)While movie hero James Bond gets his spy gadgets from his loyal developer Q, almost every other species on Earth has to put a little more effort in armoring themselves. But that does not mean they cannot rely on some good ol’ friends to do so. In fact, the acquisition of genes from two or more species through lateral gene transfer can lead to innovations that at times can be painful—sometimes even deadly—to others.

One of those evolutionary novelties is noticeable for certain types of jellyfish that developed the ability to sting after their ancestors acquired a gene from a bacterium and incorporated that material in their own DNA. This gene transmission helped jellyfish to create an innovative defense tool to fend off other species that could endanger them. The result is quite frightening: more humans get killed by jellyfish than sharks. Read the rest of this page »

Open Tree of Life featured in weekly science journal

tree02Four investigators of the Open Tree of Life project and four postdoc researchers affiliated with the project have published an article in the latest issue of Nature, an international weekly journal of science. They have concluded that, unfortunately, most phylogenetic trees and nucleotide alignments from the past two decades have been irrevocably lost. (“Data deposition: Missing data mean holes in tree of life”)

Click here for the the website of Nature  (subscription required).

Small portion of phylogenetic data is stored publicly

‘The glass is still pretty empty’

Sometimes you wonder whether the glass is half full or half empty.

But when it is only filled for four percent—the other 96 percent is just air—there is only one conclusion: it is time for more.

At least that is what some scientists in the phylogenetic community argue, because only about four percent of all published phylogenies are stored in places such as TreeBASE or Dryad. Their message is quite simple: it is time to bring together more databases with estimations on how species are possibly related to each other.

Several journals in the evolutionary biology field recently adopted policies that encourage or require contributors to make their data publicly available online. Yet, this only leads to the storage of a very small percentage of ten-thousands of phylogenies that have been constructed in the past few decades.

Of course, there are also other ways to receive data that are not stored on the Internet, but those alternatives are commonly not the most efficient routes. For instance, it is possible to send an email to a scientist who published a phylogenetic tree and “sometimes wait for six months to maybe get a response—either with or without the data,” says Keith Crandall, one of the Open Tree of Life investigators and the founding director of the Computational Biology Institute at George Washington University.

Read the rest of this page »

%d bloggers like this: