State of sequencing technology in 2010

By dgmacarthur on March 10, 2010.

Dan Koboldt has a very nice recap of the various sequencing technologies presented at last week's Advances in Genome Biology and Technology meeting. I totally agree with his central point:

Something had been bothering me about the sequencing-company presentations this year, and I finally realized what it was. During AGBT 2009, every player was gunning to take over the world. This year it seems like every sequencing platform has a niche in mind.

The recent proliferation of sequencing technologies - each with their own characteristic profile of strengths and weaknesses - has been bewildering, especially given the excessive hype being sprayed around as companies seek to raise venture capital and drown out their competitors. However, I think Dan's right that the market is now openly segmenting as each platform seeks to find the applications that best fit its strength/weakness profile.

As one notable example, it's very clear now that the third-generation single molecule sequencing technology developed by Pacific Biosciences - originally touted as being a replacement for second-generation platforms - will be restricted to niche applications (rapid confirmation of variants discovered by another technology, and supplementing second-gen sequencing in the assembly of novel genomes) for the foreseeable future due to its low yield and high error rate.

Anyway, if you're interested in how the sequencing field is starting to play out, go and read Dan's post.

Subscribe to Genetic Future.

Follow Daniel on Twitter

More like this

BLASTing through the flu: activity 5, how similar is similar?

No more delays! BLAST away! Time to blast. Let's see what it means for sequences to be similar. First, we'll plan our experiment. When I think about digital biology experiments, I organize the steps in the following way:

Shotgun Sequencing a Eukaryotic Genome

Shotgun sequencing refers to the process whereby a genome is sequenced and assembled with no prior information regarding the genomic location of any of the DNA we sequence. There are quite a few steps that you have to go through before you have an assembled genome sequence.

Development and Role of the Human Reference Sequence in Personal Genomics

A few weeks back, we published a review about the development and role of the human reference genome. A key point of the reference genome is that it is not a single sequence.

More flu follies: comparing sequences and making trees, activity 4

What tells us that this new form of H1N1 is swine flu and not regular old human flu or avian flu? If we had a lab, we might use antibodies, but when you're a digital biologist, you use a computer.

I doubt PacBio will find much use for validating small variants found with other platforms; why would you use a less reliable system to validate a more reliable one.

One the other hand, in addition to being useful in sequencing novel genomes (or cleaning up previously sequenced ones) the very long read technologies -- even with high point substitution & indel rates -- could be very useful for elucidating detailed structural variation in human (both normal and cancer) and other well studied genomes. It's clear that many of the processes generating structural variation make reference-guided alignment problematic (because the breakpoint regions may have stretches looking nothing like the reference) & so you end up doing assembly -- and having long reads will be valuable for that.

Hi Keith,

I was also skeptical about the applications for validation, but at least one major genome facility is already doing just that. I gather the plan is to pull down fragments spanning a whole set of candidate variants, circularise them, and then do multiple-pass sequencing (i.e. rolling circle) with PacBio. It looks like the errors in PacBio are almost exclusively randomly distributed indels, so if you get five-pass coverage of a given fragment your error rate will be pretty low - and importantly for validation, the PacBio error mode is entirely orthogonal to the dominant error mode in Illumina or SOLiD.

And yes, you're absolutely right about structural variants; the strobe reads may well prove useful for resolving these. However, I'm holding off until I've seen some raw data from the platform before getting too excited about this.

Advertisment

Donate

ScienceBlogs is where scientists communicate directly with the public. We are part of Science 2.0, a science education nonprofit operating under Section 501(c)(3) of the Internal Revenue Code. Please make a tax-deductible donation if you value independent science communication, collaboration, participation, and open access.

You can also shop using Amazon Smile and though you pay nothing more we get a tiny something.

Science 2.0

Science Codex

More by this author

Genetic Future is moving

January 18, 2011

After a semi-hiatus due to various distractions, I'm about to restart blogging in earnest again over at the new home of Genetic Future on Wired Science. Please update your RSS feed: my new one is here. And a reminder: you can always keep track of new posts here as well as other nuggets of…

One more step towards the end of recessive diseases

January 13, 2011

In the last century infant mortality has declined precipitously in the Western world, thanks in large part to the development of antibiotics and vaccination. Yet as the suffering and death from infectious disease has reduced, the burden from genetic disease has become proportionately greater:…

New FireFox plugin for 23andMe customers

January 11, 2011

Software company 5AM Solutions has just launched a neat little FireFox plug-in for customers of consumer genomics company 23andMe. The idea is very simple: Download your raw data from 23andMe (or use one of the files from me or my colleagues at Genomes Unzipped); Install the…

Why you CAN have your $1000 genome - so long as you learn what to do with it

January 7, 2011

As part of his Gene Week celebration over at Forbes, Matthew Herper has a provocative post titled "Why you can't have your $1000 genome". In this post I'll explain why, while Herper's pessimism is absolutely justified for genomes produced in a medical setting, I'm confident that I'll be obtaining…

Bioscience Resource Project critique of modern genomics: a missed opportunity

December 15, 2010

Late last week I stumbled across a press release with an attention-grabbing headline ("The Causes of Common Diseases are Not Genetic Concludes a New Analysis") linking to a lengthy blog post at the Bioscience Resource Project, a website devoted to food and agriculture. The post, written by two…