Original research

No more delays! BLAST away! Time to blast. Let's see what it means for sequences to be similar.  First, we'll plan our experiment.  When I think about digital biology experiments, I organize the steps in the following way:             A.  Defining the question B.  Making the data sets            C.  Analyzing the data sets D.  Interpreting the results I'm going intersperse my results with a few instructions so you can repeat the things that I've done below.  I've some people writing that only experts should be analyzing data.  But  I disagree with those who say that sequence…
In which we identify unknown human proteins. Yesterday, I wrote about using the BLOSUM 62 matrix to calculate a score for matches between two proteins. Those scores give us a good start on understanding how blastp determines whether two sequences are matching by chance or because they're more likely to be related. But that's not all there is to calculating a blast score, and there's at least one other statistic to consider as well, the E value. It all comes down to biochemistry The BLOSUM 62 matrix is based on the substitutions that really do or do not happen in real protein sequences. I…
Let's play anomaly! Most of this week, I've written about the fun time I had playing around with NCBI's Blink database and finding evidence that at least one mosquito, Aedes aegypti, seems to have been infected at some point with a plant paramyxovirus and that the paramyxovirus left one of its genes behind, stuck in the mosquito genome. During this process, I realized that the method I used works with other viruses, too. I tried it with a few random viruses and sure enough, I found some interesting things. You've got a week to give it a try. Let's see what you find! The method is…
Lots of bloggers in the DNA network have been busy these past few days writing about Google's co-founder Sergey Brin, his blog, his wife's company (23andme), and his mutation in the LRRK2 gene. I was a little surprised to see that while other bloggers (here, here, here, and here) have been arguing about whether or not the mutation really increases the risk to the degree (20-80%) mentioned by Brin, no one has really looked into the structure and biochemistry of the LRRK2 protein to see if there's a biochemical explanation for Parkinson's risk. I guess that task is up to me. Let's begin at…
Do mosquitoes get the mumps? Part V. A general method for finding interesting things in GenBank This is the last in a five part series on an unexpected discovery of a paramyxovirus in mosquitoes and a general method for finding other interesting things. In this last part, I discuss a general method for finding novel things in GenBank and how this kind of project could be a good sort of discovery, inquiry-based project for biology, microbiology, or bioinformatics students. I. The back story from the genome record II. What do the mumps proteins do? And how do we find out? III.…
Part IV. Assembling the details and making the case for a novel paramyxovirus This is the fourth in a five part series on an unexpected discovery of a paramyxovirus in a mosquito. In this part, we take a look at all the evidence we can find and try to figure out how a gene from a virus came to be part of the Aedes aegypti genome. image from the Public Health Library I. The back story from the genome record II. What do the mumps proteins do? And how do we find out? III. Serendipity strikes when we Blink. IV. Assembling the details of the case for a novel mosquito paramyxovirus V. A…
Part III. Serendipity strikes when we Blink In which we find an unexpected result when we Blink while looking at the mumps polymerase. This is the third in a five part series on an unexpected discovery of a paramyxovirus in mosquitoes. And yes, this is where the discovery happens. I. The back story from the genome record II. What do the mumps proteins do? And how do we find out? III. Serendipity strikes when we Blink. IV. Assembling the details of the case for a mosquito paramyxovirus V. A general method for finding interesting things in GenBank To paraphrase Louis Pasteur,…
Part II. What do mumps proteins do? And how do we find out? This is the second in a five part series on an unexpected discovery of a paramyxovirus in mosquitoes, and a general method for finding interesting things. I. The back story from the genome record II. What do the mumps proteins do? And how do we find out? III. Serendipity strikes when we Blink. IV. Assembling the details of the case for a mosquito paramyxovirus V. A general method for finding interesting things in GenBank In Part I, we looked at the NCBI SeqViewer, and found a new way to check out a genome map, and learn more…
Part I. The back story from the genome record Together, these five posts describe the discovery of a novel paramyxovirus in the Aedes aegyptii genome and a new method for finding interesting anomalies in GenBank. I. The back story from the genome record II. What do the mumps proteins do? And how do we find out? III. Serendipity strikes when we Blink. IV. Assembling the details of the case for a mosquito paramyxovirus V. A general method for finding interesting things in GenBank I began this series on mumps intending to write about immunology and how vaccines work to stimulate the immune…