Finding influenza: the data are out there, let's get them, activity 3

I was pretty impressed to find the swine flu genome sequences, from the cases in California and Texas, already for viewing at the NCBI.

You can get them and work them, too. It's pretty easy. Tomorrow, we'll align sequences and make trees.

Activity 3: Getting the swine flu sequence data

1. Go to the NCBI, find the Influenza Virus Resource page and follow the link to:

04/27/2009: Newest swine influenza A (H1N1) sequences.

2. You'll see a page that looks like this:

i-8f46c115d6bc63eb88bd5cf05666d87f-Picture 19.png

Each column heading is a name of a segment of the influenza genome. You can see there are eight of these. Each segment codes for different proteins.

For those of you who are used to thinking about genomes being in one piece or being DNA, flu is kind of fun. Not only is the genome broken up into 8 pieces, those pieces are RNA. And, a copy of that RNA has to be made before the information can be translated into protein.

Anyway, you can get anyone of the sequences, either the protein or the nucleic acid, by clicking the linked accession number.

In the next post, we'll see how to use these and make phylogenetic trees.

More like this