Value Added Testing (or "Merry Christmas, Novak")

By drorzel on December 28, 2005.

One of the more contentious recurring topics around here over the years has been education policy, mostly centering around the question of teacher evaluation and teacher's unions. It's probably the subject for which there's the biggest gap between my opinions and those of some of my regular readers.

As this is a good time of year for peace and reconcilliation, let me point to this guest post at Calpundit Monthly, in which Paul Glastris talks about the problems of "gifted" kids under the "No Child Left Behind" system, and pushes a Washington Monthly article on Value-Added Testing. The idea here is to evaluate teachers and schools based on how much the students improve over the course of a year, rather than how well they meet a single absolute standard. This avoids penalizing schools and teachers who are stuck with student populations that come in poorly prepared, and also provides some incentive to push the more advanced students (who tend to be relatively neglected in a single-absolute-standard system).

It's not a perfect evaluation method-- it increases the number of required tests, and opens the possibility of rewarding schools that turn out lots of students who are below grade level, but not as far below as when they came in, which isn't really a stisfying outcome (not to mention perverse statistical dodges like telling kids to bomb the pre-tests, to inflate the "value added"). It's a step in the right direction, though, and some combination of absolute standards and value-added testing might lead to an acceptable assessment method.

The basic idea is pretty standard among people who do physics education research (see, for example, the well-known groups at Harvard, Washington, and NC State). I've done this with my intro classes the past few years-- you give a pre-test in the first week of class, and a post-test at the end, and measure student improvement using the "normalized gain," which is the post-test score mins the pre-test score, divided by the maximum score minus the pre-test score:

                     (Post-Pre)
               Gain = --------
                      (Max-Pre)

This is designed to account for the fact that a one-point gain for a student who starts with a 25/30 is a bigger achievement than a one-point gain for a student who started with a 5/30.

(The test we use is designed so that the scores are generally pretty lousy-- I think the pre-test average nationally is somewhere around 10/30-- and the gain from a "traditional" lecture-format course is usually around 25%. Classes based on "active learning" are found to do a better job, with gains of 40-60%.)

It's a reasonably effective technique, though you can question how much it actually proves about anything (and people do, at length). It's also fiendishly difficult to come up with good tests for this purpose, and it's not hard to find people who complain about even the well-established tests that are out there.

(Let me note, by the way, that I'm not prepared to whole-heartedly endorse everything that Glastris says in his post. In particular, I'm not especially disturbed by a drop in the fraction of students scoring as "advanced" in math as they move up in grade level-- it should be harder to score as "advanced" at the higher levels. I'm also a little skpetical of the oft-cited claim that some sizable fraction of high school drop-outs are really "gifted" kids who are too bored to continue-- in a lot of the anecdotes people use to illustrate this, the problem seems to be more a matter of psychological issues than boredom.

(I do agree, though, that the focus on bringing the bottom kids up to speed-- "triage, rather than teaching," in the phrase of one of the Washington Monthly commenters-- tends to lead to the better students being slighted, particularly in distracts with limited resources. The "gifted" program at my old school was an on-again-off-again thing while I was there (existing only when enough parents made enough noise to get it funded), and was shut down for good several years back. Programs for the "special needs" kids at the bottom end of the class are legally mandated, and have actually expanded over the same span.)

(Originally posted at my steelypips blog.)

More like this

Kids These Days: Is Our Learning Measure Valid?

Kevin Drum has done a couple of education-related posts recently, first noting a story claiming that college kids study less than they used to, and following that up with an anecdotal report on kids these days, from an email correspondent who teaches physics. Kevin's emailer writes of his recent…

Active Learning Experiment: Nearly the End

As noted in previous posts, I've been trying something radically different with this term's classes, working to minimize the time I spend lecturing, and replace it with in-class discussion and "clicker questions." I'm typing this while proctoring the final exam for the second of the two classes I'm…

More Evidence of How Value-Added Testing Fails at Teacher Evaluation

Last week, E.D. Kain took Megan McArdle to task for promoting the use of student testing as a means to evaluate teachers. This, to me, was the key point: ....nobody is arguing against tests as a way to measure outcomes. Anti-standardized-tests advocates are arguing against the way tests are being…

Advanced Placement: Threat or Menace?

Kevin Drum reports receiving an email from a professor of physics denouncing the Advanced Placement test in Physics: It is the very apotheosis of "a mile wide and an inch deep." They cover everything in the mighty Giancoli tome that sits unread on my bookshelf, all 1500 pages of it. They have seen…

Advertisment

Donate

ScienceBlogs is where scientists communicate directly with the public. We are part of Science 2.0, a science education nonprofit operating under Section 501(c)(3) of the Internal Revenue Code. Please make a tax-deductible donation if you value independent science communication, collaboration, participation, and open access.

You can also shop using Amazon Smile and though you pay nothing more we get a tiny something.

Science 2.0

Science Codex

More by this author

Go On Till You Come to the End; Then Stop

October 31, 2017

ScienceBlogs is coming to an end. I don't know that there was ever a really official announcement of this, but the bloggers got email a while back letting us know that the site will be closing down. I've been absolutely getting crushed between work and the book-in-progress and getting Charlie the…

Meet Charlie

October 30, 2017

It's been a couple of years since we lost the Queen of Niskayuna, and we've held off getting a dog until now because we were planning a big home renovation-- adding on to the mud room, creating a new bedroom on the second floor, and gutting and replacing the kitchen. This was quite the undertaking…

Physics Blogging Round-Up: August

September 1, 2017

Another month, another set of blog posts. This one includes the highest traffic I think I've ever seen for a post, including the one that started me on the path to a book deal: -- The ALPHA Experiment Records Another First In Measuring Antihydrogen: The good folks trapping antimatter at CERN have…

The Age Math Game

August 22, 2017

I keep falling down on my duty to provide cute-kid content, here; I also keep forgetting to post something about a nerdy bit of our morning routine. So, let's maximize the bird-to-stone ratio, and do them at the same time. The Pip can be a Morning Dude at times, but SteelyKid is never very happy to…

Kid Art Update

August 13, 2017

Our big home renovation has added a level of chaos to everything that's gotten in the way of my doing more regular cute-kid updates. And even more routine tasks, like photographing the giant pile of kid art that we had to move out of the dining room. Clearing stuff up for the next big stage of the…