Genomics and Evolution

By evolgen on June 15, 2006.

Andy Clark has written a review of comparative evolutionary genomics for Trends in Ecology and Evolution. His review deals with identifying functional regions of the genome and inference of both positively and negatively selected sequences.

Clark is one of the leaders in the field of evolutionary genetics (and now genomics), actively participating in the analysis of both the human and Drosophila genomes. He also brings a solid understanding of biology, as well as an appreciation of statistical rigor. You can sense his excitement about the union of molecular biology and evolution in the following passage:

One of the most wonderful things about comparative genomics is that it has turned a whole generation of molecular biologists into evolutionists, full of excitement about the way that evolution has sculpted exquisite modifications to organismal genomes and eager to tell stories about it.

Clark is also cautious about the conclusions we can draw from the preliminary analyses completed thus far. He does not appear to be happy with the sloppy work of some investigators:

At the same time, one of its worst disasters is that it has created a hoard of genomics investigators who think that evolutionary biology is just fun, speculative story telling. Sadly, much of the scientific publication industry seems to respond to the herd as much as it does to scientific rigor, and so we have a bit of a mess on our hands. Fortunately, this is all a temporary aberration and, eventually, the noise will be separated from the signal, and progress will march on in understanding what genome sequence divergence really means.

I hope he is correct that a lack of rigorous analysis is a "temporary aberration". There are two explanations for the sloppy work Clark is describing: a lack of understanding of proper statistical procedures or a disregard for them. I hope that the problem stems from the former and not the latter. A lack of understanding can be overcome through education, whereas a disregard for the scientific rigor requires a major shift in the entire field. We would have to convince researchers, reviewers, and publishers that such shoddy research is not acceptable for publication without correcting for incomplete statistical analyses.

Examples of such analyses are studies which look for conserved non-coding sequences. The authors of such studies argue that the conserved sequences are under purifying selection, although they fail to reject the hypothesis that the sequences are conserved due to low mutation rates. Doing so requires polymorphism data, and Clark applauds the researchers who are using polymorphism as well as divergence to detect selective constraint.

We also must be cautious when inferring positive selection using polymorphism data. Clark warns against using SNP data (such as that available in the HapMap project) because one falls victim to the problems associated with ascertainment bias. SNPs are fine for association studies, but the identification of genes under positive selection should be carried out using complete sequences. One must also control for the effects of demography on nucleotide sequence polymorphism by scanning multiple loci. Clark was a co-author on one such study which I briefly discussed here.

Clark also describes how analyses of genome content (duplications, gene gain, gene loss, gene order, gene density) require evolutionary explanations, and he explains that complex models of mutation are necessary to describe the evolution of the genome. Mutation rates are heterogeneous and depend on the genomic region (both local and global) in which a nucleotide is found.

Genomics has become a huge enterprise, both in the public and private sectors. Almost all of the research in this area is carried out in a comparative evolutionary framework. In a sense, nothing in genomics make sense except in the light of evolution.

More like this

Advertisment

Donate

ScienceBlogs is where scientists communicate directly with the public. We are part of Science 2.0, a science education nonprofit operating under Section 501(c)(3) of the Internal Revenue Code. Please make a tax-deductible donation if you value independent science communication, collaboration, participation, and open access.

You can also shop using Amazon Smile and though you pay nothing more we get a tiny something.

Science 2.0

Science Codex

What An Eclipse Means For US President Donald Trump

More by this author

This is a Good-bye Post

January 16, 2009

This is the final post ever at evolgen. It was a fun 4+ years, the last three spent at ScienceBlogs, but it has come time for me to close up shop. When I first got into blogging, I did it as a way to share what was on my mind to the few people who would read what I had to say (usually in topics…

Mendel's Garden #27 - Call for Submissions

January 2, 2009

Mendel's Garden is the original genetics blog carnival. The next edition will be hosted by Jeremy at Another Blasted Weblog. If you would like to submit a blog post to be included in the carnival, send an email to Jeremy (jcherfas at mac dot com). The carnival should be posted within the next few…

Eric Lander Teaches?

December 20, 2008

John Hawks points out that Eric Lander has been appointed to co-chair Obama's Council of Advisers on Science and Technology along with science adviser John Holdren and Nobel Laureate Harold Varmus. Here's how the AP article describes Lander: Lander, who teaches at both MIT and Harvard, founded the…

The Implementation of Molecular Evolution for the Masses

December 18, 2008

A couple of years ago, there was talk in the bioblogosphere about getting the general public interested in bioinformatics and molecular evolution: Amateur bioinformatics? Lowering the Ivory Tower with Molecular Evolution Molecular Evolution for the Masses The idea was inspired by the findings of…

Do people still use microarrays?

December 17, 2008

Larry Moran points to a couple of posts critical of microarrays (The Problem with Microarrays): Why microarray study conclusions are so often wrong Three reasons to distrust microarray results Microarrays are small chips that are covered with short stretches of single stranded DNA. People…

More like this

This is a Good-bye Post

Mendel's Garden #27 - Call for Submissions

Eric Lander Teaches?

The Implementation of Molecular Evolution for the Masses

Do people still use microarrays?

Colorblind Cuttlefish

Eastern Pacific Hurricane Season: Carlos tours the coast?

Enforcing the Cosmic Speed Limit