Finding influenza: the data are out there, let's get them, activity 3

I was pretty impressed to find the swine flu genome sequences, from the cases in California and Texas, already for viewing at the NCBI.

You can get them and work them, too. It's pretty easy. Tomorrow, we'll align sequences and make trees.

Activity 3: Getting the swine flu sequence data

1. Go to the NCBI, find the Influenza Virus Resource page and follow the link to:

04/27/2009: Newest swine influenza A (H1N1) sequences.

2. You'll see a page that looks like this:

i-8f46c115d6bc63eb88bd5cf05666d87f-Picture 19.png

Each column heading is a name of a segment of the influenza genome. You can see there are eight of these. Each segment codes for different proteins.

For those of you who are used to thinking about genomes being in one piece or being DNA, flu is kind of fun. Not only is the genome broken up into 8 pieces, those pieces are RNA. And, a copy of that RNA has to be made before the information can be translated into protein.

Anyway, you can get anyone of the sequences, either the protein or the nucleic acid, by clicking the linked accession number.

In the next post, we'll see how to use these and make phylogenetic trees.

More like this

No more delays! BLAST away! Time to blast. Let's see what it means for sequences to be similar.  First, we'll plan our experiment.  When I think about digital biology experiments, I organize the steps in the following way: 
Shotgun sequencing refers to the process whereby a genome is sequenced and assembled with no prior information regarding the genomic location of any of the DNA we sequence. There are quite a few steps that you have to go through before you have an assembled genome sequence.
A few weeks back, we published a review about the development and role of the human reference genome. A key point of the reference genome is that it is not a single sequence.
What tells us that this new form of H1N1 is swine flu and not regular old human flu or avian flu? If we had a lab, we might use antibodies, but when you're a digital biologist, you use a computer.