Search News


Browse Archives

News

A Call for Assessment -- of the Right Kind

January 8, 2009

Share This Story

FREE Daily News Alerts

Advertisement

As the intensity of discussion and conflict about higher education accountability and assessment of student learning have grown in recent years, the Association of American Colleges and Universities has often sought to play the role of Switzerland. To the politicians and policy makers clamoring for colleges to prove their value, the liberal education group has cautioned against oversimplified measurements and the importance of institutional diversity.

But to college leaders who've advocated the rope-a-dope strategy, doing nothing while awaiting the passing of a fad or the end of a certain presidential administration, AACU has persistently prodded academic leaders to take seriously the need to look within and improve the teaching and learning that takes place in their classrooms.

As leaders of the group sought to say something now about where the accountability and assessment movement should go next, they found that a near-perfect statement of their views already existed -- and that they had made it five years ago.

Does the fact that the statement the AACU released Wednesday, "Our Students' Best Work: A Framework for Accountability Worthy of Our Mission," is based largely on a 2004 statement mean that nothing of note has happened in the intervening years? That there has been so little progress that the issues have not changed? That AACU was ahead of its time?

Perhaps all of the above. The initial statement came out at a time when state legislatures and the Bush administration, fresh off the imposition of the standardized-test-based No Child Left Behind law in elementary and secondary education, were raising the idea of taking a similar approach in higher education, Carol Geary Schneider, the association's president, said during a telephone news conference Wednesday. It was important at that time, she said, for the association to clearly state "that there was no way that standardized testing alone could provide a full picture" of the many kinds of learning colleges seek to impart to their students, and that was one of the 2004 report's two major themes, Schneider said.

But the other theme was just as important: making it clear that it would be a mistake for colleges to ignore the demand for more and better information about student outcomes, and that "too many institutions and programs still are unable to answer legitimate questions about what their students are learning in college."

Five years later, higher education has been through what Schneider called a "shallow discussion about assessment and accountability," prompted by the work of Education Secretary Margaret Spellings' Commission on the Future of Higher Education. That discussion left a divide at the extremes, she said, between some higher education leaders who believe college-level learning is too complex to be gauged and those who take a "send a number to somebody soon" approach, arguing that colleges must find some results from a nationally or at least widely comparable test (or tests) they can report publicly to satisfy the thirst for proof of their satisfactory performance.

AACU, in contrast, believes that colleges must "actually use assessment to improve student learning," and its re-released report, updated and reinforced in several ways, is meant to be a roadmap to help both the higher education establishment and individual colleges figure out how to do that. By arguing both for the idea that colleges must measure student learning and strongly against the use of "mass testing," which it argues would be an "enormous misuse of time and scarce resources," the AACU statement seeks to walk a middle path in the contentious debate over whether and how colleges should measure and report student learning.

The AACU Framework

Most of the statement will probably succeed in building a consensus. "The public has questions about the quality of education that colleges and universities are providing, and it deserves to know how well students are doing," the report says. "It is time for leaders of education to embrace a set of highly valued and widely affirmed educational goals, establish high standards for each, and assess their achievement across the curriculum."

That cannot be done, AACU says, through the use of "a particular kind of standardized testing -- multiple-choice, 'one-best-answer' tests" -- that advocates (and the testing industry) often champion, because "[a]ssessing what students have learned in colleges and universities requires a sophisticated understanding both of context and of how knowledge and skills are to be used.... Even if better tests continue to be developed, standardized tests alone are an inadequate and inappropriate strategy to foster advanced learning and accountability in higher education."

AACU's preferred approach, it writes, on which it (and other groups like the Teagle Foundation) have been working in recent years, is a several-step process in which (1) educators would define the "essential outcomes" that students should be expected to master from liberal education and the standards; (2) individual institutions would create a set of rubrics to establish goals and levels of performance for themselves and their students in achieving those outcomes, at the institutional, department and program levels; (3) colleges would build those goals and standards into their curriculums, particularly to be met by the time students do "culminating work" "that will both cultivate advanced knowledge and skill and demonstrate students' cumulative learning," most likely through the use of electronic portfolios of their academic work; and (4) create processes for making public the goals, performance expectations, and levels at which students performed.

It is on the last point -- the extent to which AACU envisions colleges using their student learning assessment as a way of proving their performance to their various publics -- that those who have been pushing for a more quantitative (and comparable) method of judging higher education are most likely to find fault with association's statement, which AACU officials more or less acknowledge.

"There are different purposes for assessment and AAC&U's focus is primarily on the 'improving learning and programs' and 'helping students be able to demonstrate their achievement for employment' purposes rather than the purpose of 'comparing institutions to make funding decisions or determine a 'return on investment' of public funds," Debra Humphreys, the association's vice president for communications and public affairs, said in an e-mail message. "We do, however, believe that it would be possible to use the methods we are recommending to create a public report that could go to a legislator and present a picture of aggregate accomplishment that a legislator could, indeed, use to compare to other institutions -- in the same way they could use data [from the National Survey of Student Engagement] to compare the existence of high-impact practices across institutions."

For instance, the report notes, a campus could, based on the standards it has set, determine that "75 percent of the students in the college of arts and sciences met a proficient standard for analytical skill and collaborative problem-solving;" it could then make that finding public along with information about how it collected the information and reached that conclusion.

"Like standardized testing, this method will allow for summarizing the outcomes of student learning with a few scores," the statement says. "But unlike tests based on quick responses to multiple-choice questions, these will be summaries of higher-order skills such as communication, analytic ability and integration of knowledge, and it will reflect meaningful educational projects judged by professionals."

Left Unsaid

The statement says nothing about two of the biggest developments that have occurred on the higher education assessment and accountability front since the 2004 statement: the emergence of the Collegiate Learning Assessment, in which small, representative groups of students typically complete a series of exercises to measure critical thinking, analytic thinking and written communication to measure how much an institution adds to their learning over the course of their careers there, and the creation of the Voluntary System of Accountability, which two groups of state universities have established to try to make their performance on a range of fronts more transparent to the public.

Despite the statement's silence, it implicitly criticizes both endeavors. While it is not the sort of "multiple-choice, 'one-best-answer' tests" directly panned in the AACU statement, the CLA is a standardized measure, in that it is designed to be used across institutions in ways that allow them to be compared. And by requiring participating colleges to use either the CLA or comparable standardized measures created by the Educational Testing Service and ACT, the Voluntary System of Accountability can be seen as an attempt to try to satisfy politicians' thirst for accountability -- to "send a number to somebody soon," as Schneider put it.

When asked about the VSA during the news conference, she went out of her way to praise the accountability system as "a good framework," to note that AACU is working with the two public university groups on a major Education Department grant, and to say that her colleagues at the associations "would be first to say that one test is not enough" to fully measure student learning.

David E. Shulenburger, vice president for academic affairs at the National Association of State Universities and Land-Grant Colleges, one of the VSA's sponsors, said in a brief e-mail message (sent while he was traveling abroad) that the groups encourage participants to "use a complete package of assessments," and thus "does not encourage universities to rely 'solely' on a single test." But he also noted, importantly, that "VSA's use of an outcomes test creates a rough comparability among participating universities and enhances public accountability" -- suggesting that the public university groups clearly see that as a more central priority than does the AACU statement.

The statement's emphasis on assessment for internal improvement purposes rather than external accountability purposes was also singled out as its biggest weakness by Charles Miller, the Texas businessman who headed the Spellings Commission and has argued the need for more higher education accountability.

Miller praised the AACU for continuing to raise these issues and credited its report for "accepting the idea that you can make measurements" of student learning. But he said he had trouble understanding from the statement how reams of term papers and other student work from electronic portfolios, much of which would be assessed subjectively, could be turned into the sort of "quantitative and comparable" evidence of student learning that would really serve the "public purposes" he sees.

"How are they going to tell us how well they did?" Miller said. "You may be able to use it internally for improving teaching and learning, but unless it helps everybody else -- the public, legislators, taxpayers -- it’s not going to answer the question. Defining the skills, setting the goals, measuring the performance -- that's all a step forward, for them. But without some kind of comparability and some kind of quantitative, specific results, it doesn't help the rest of us."

Miller said he was struck by the statement's repeated condemnation of standardized measures, given that colleges lean heavily on standardized tests to decide whom to enroll and whom to admit to graduate school. "They are used throughout the academic process for life and death decisions -- about students," he said. "But they refuse to consider the use of those kinds of tests to measure their own performance. Ironic."

See all postings »
Advertisement
Advertisement

Matching Jobs

Comments on A Call for Assessment -- of the Right Kind

  • Accountability is essential in life...that includes education.
  • Posted by feudi pandola on January 8, 2009 at 8:20am EST
  • Standardized testing is not an "imposition" as is stated in this article. For most science-based disciplines, there is no other rational method of measuring a student's knowledge base. Many fields of study, such as medicine, nursing, even accounting, require students to pass rigorous standardized testing at the completion of the program. This process provides, at least, a modicum of public confidence in the abilities of the professional class.

    When one considers the enormous resources used up in higher education, and the horrible completion rates we have, the question should not be the efficacy of standardized testing, but rather how we can expand it rationally to include all majors.
    This may be difficult in some fields due to the subjectivity of the material, but these difficulties can be overcome if we all put on our thinking caps and get to work rather than gaze at our navels...to steal a line from Sister Mary Magdalene.

  • The way to do it that both sides miss. . .
  • Posted by Cliff Adelman , Senior Associate at Institute for Higher Education Policy on January 8, 2009 at 10:05am EST
  • Is to follow the European example under the Bologna Process and develop degree qualification frameworks that spell out what students must "demonstrate" to earn a degree, and repeat the process, following the Bologna "Tuning" template, in each of the disciplines in which degrees are awarded. A parallel alternative is the benchmarking of outcomes in each discipline illustrated by the UK's Quality Assurance Agency. In all these cases, students who don't "demonstrate," who don't meet thresholds of public, criterion-referenced learning outcome statements, don't earn degrees. Period! Once such a structure is promulgated, you don't need summative tests of any kind, and certainly not those that rely on a sample of 100 volunteers. You can use formative assessments in the way AACU has advocated, an idea that's been around since we started taking assessment seriously in the 1980s.

    Does the Bologna Process and Tuning approach standardize curriculum or delivery? Does it make the chemistry or history major at Old Siwash a carbon copy of that at East Greentree? Hardly. The qualification frameworks set thresholds of performance. The Tuning process sets reference points for criterion-referenced learning statements of both subject matter knowledge and its application and generic cognitive skills that faculties then develop within their institutional contexts. Tuning has already been adapted in 182 universities, 12 disciplines, and 18 countries in Latin America---and that's in addition to the hundreds of institutions and 25 disciplines using it in Europe. Maybe we can learn something from the way other countries approach this challenge? Both sides of the argument have missed this to date, but we will shortly see the beginnings of its exploration in the U.S.

    For a more detailed description of all this, with examples of what degree qualification frameworks, Tuning, and benchmarking look like, look at the Institute for Higher Education Policy's "The Bologna Club: What U.S. Higher Education Can Learn from a Decade of European Reconstruction," at www.ihep.org/research/GlobalPerformance.cfm

  • An answer to Miller's question
  • Posted by David Shupe at eLumen Collaborative on January 8, 2009 at 10:05am EST
  • Charles Miller's central question -- how is it possible to generate comparable quantitative data out of the process that AAC&U is recommending? -- has already been answered and described in the professional literature of higher education -- most recently in two special issues of the journal On The Horizon (volume 15, no. 2, 2007; and volume 16, no. 2, 2008).

  • Posted by Ross Miller , Sr. Director on January 8, 2009 at 10:05am EST
  • Mr Pandola's reference to low graduation rates makes a better case for formative, locally developed assessments than for the continued use of poorly aligned, standardized tests. Low performing students need good teaching as a dose of motivation. A standardized test score, delivered a semester or two after adminstration, to the 100 or so students taking the test does little to help individuals graduate.

    Ross Miller
    AAC&U

  • What's missing...
  • Posted by Stephen Langendorfer , Director, general education at Bowling Green State University on January 8, 2009 at 10:05am EST
  • I offer two reactions/thoughts to Doug Lederman's provocative article. I was surprised that Doug mentions the CLA but overlooked the AAC&U's Valid Assessment of Learning in Undergraduate Education (VALUE) project that has identified 16 "metarubrics" that hold the promise of providing a nation-wide qualitative assessment process of student's work. I found myself equally perturbed at the comments from Charles Miller who plainly "doesn't get it!" His steadfast refusal to acknowledge that the assessment of student work especially in portfolios using objective measures derived from the VALUE metarubrics is, in fact, exactly the answer to his "How are they going to tell us how well they did?" question. In fact, as AAC&U illustrates, more locally based assessment processes are much more valid because they can take into account individual institutional goals, outcomes, values, and context, something that a "one-size-fits-all" standardized test can never do.

  • recycling problems
  • Posted by Glen S. McGhee , Dir., at Florida Higher Education Accountability Project on January 8, 2009 at 10:25am EST
  • The continuing focus on student learning assessment diverts attention from other proven indicators of educational quality. It is also doubtful that feedback from learning metrics can be utilized by institutions that see themselves as perfect, or as most nearly perfect, and not essentially incomplete or otherwise flawed.

    For example, NCLB provisions dealing with out-of-field teacher assignments have had a direct positive impact on that problem at the elementary and secondary levels. Maybe higher ed needs something similar to address its OOF problems. Instead of focusing solely on student outcomes, higher ed needs to be more honest about its educational quality problems, including the staggering variance from classroom to classroom, from classroom to distance learning, from community college to university.

    In many cases this includes the surrender of dual enrollment oversight to the school districts. It is an open question whether student learning in such programs, which award college credit to high school students, would be subject to these kinds of initiatives.

    It is telling that these proposals were first discussed 5 years ago, well before the Spellings Commission was convened. The fact that these proposals remain the same in the face of the public clamor for accountability and transparency tells me that the lack of institutional reflexivity is the first hurdle that must be overcome before these criticisms can be addressed.

  • Definitions and validity
  • Posted by Lee Griffin on January 8, 2009 at 11:25am EST
  • I believe some fundamental concepts are being overlooked in this discussion. First, most assessment authorities advocate that an assessment process begin by defining what it is we want students to learn. And although each separate test and portfolio assessment may have completed that step successfully for itself, we lack a standard, nationally-accepted definition of what we mean by "critical thinking" and "effective communication." A serious attempt at assessment of higher cognitive skills would have to begin with a national dialog about definitions and constructs.

    Second, merely defining the terms does little to make them useful as the basis for assessment. The definitions have to be operationalized through application to actual samples of student work that illustrate levels of proficiency. Education professionals by and large accomplish this step in the assessments they develop, and they are willing to make the criteria public and transparent. Testing companies, however, never reveal how they operationalize the definitions they use. Without that information, it hard to know what is actually being tested, as opposed to what is claimed in promotional literature.

    Third, the definition of validity jointly adopted by the American Psychological Association (APA), the National Council on Measurement in Education (NCME), and the American Educational Research Associations (AERA) calls for validity to be established in relation to both theoretical constructs and empirical research in the discipline. Could any of the current tests and other assessments be judged valid under this premise of the three largest, most respected educational research associations? Probably not, unless such theoretical constructs and empirical research exist, but are simply underreported.

    Although this post sounds obstructionist in nature, it is not intended as such! I believe assessments of higher cognitive skills that are both valid and reliable can be developed. I even believe they can be developed in a cost-effective, time-efficient manner that would support faculty development and improvement in teaching. The first step, though, is to deal with the problem of exactly what we want students to learn in terms of theory and research. Until that understanding is established on a national basis, assessments will have little meaning beyond local contexts.

  • Assessment - of the right kind
  • Posted by George on January 8, 2009 at 11:55am EST
  • Any sell assessment is the right kind of assessment. I have been pre and post testing my students for the past 10 years using either a test I have generated where there are no national standard tests or a national standard test (the Force Concept Inventory.) I have presented results at the American Association of Physics Teachers meetings that have been well received. Is there any interest among the tenured faculty members of my department? None. Their silence has been complete. Offers to present at the local colloquium have been turned down while more sophmoric efforts of the new tenured faculty have been highlighted. The internal university "Assessment Conference" didn't even have the courtesy of telling me my paper was not accepted until I called them two days before the poster papers were to be given. Given these circumstances, I can only conclude that most assessment projects are just for show and not for improvement. When is the last time someone was given tenure at a major university because of their efforts in improving their teaching?

  • Don't Fear the CLA
  • Posted by Joe on January 8, 2009 at 1:40pm EST
  • The politicians and the public would like to have a fairly simple measure to see if colleges are doing their job. When you start to talk about rubrics and portfolios the eyes of politicians glaze over and they're gone.

    However, the CLA offers a very simple point of entry measurement and then a point of departure measurement. Even a very short attention-span can understand those results. If that is what they want then what do we have to fear with giving it to them? Do any of us work for institutions that DON'T believe we are imparting basic mathematics, problem solving, etc skills?

    So I'd say let's do both. On the one hand we can give them a CLA-esque exam to satisfy the need for a simple to measure outcome AND we can also implement more sophisticated outcomes measures as befits our varied institutional needs.

  • Assessment
  • Posted by Fred Flener , Retired on January 8, 2009 at 7:40pm EST
  • The trouble with replying from the Left coast is that most of the potential readers have left--which of course may be seen as a positive by many who don't read my comments.

    My question is "Why the big hang-up with assessment?" I know we pontificate on the beauty of critical thinking, analysis, and other higher level skills (although if it is a "skill" can it be "higher level thinking?"). Yet we reward contestants on Jeopardy for answering clues correctly and quickly at a higher financial level than we do for Nobel laurates. I even watched the "dumbest" of all shows "Deal or No Deal" (please forgive me for wasting the hour)and a contestant turned down something like $350,000. Huh?

    To be honest, if my students could determine whether a certain differential equation can be used to solve a type of physics problem, I would be happy as a clam. Whether they could do it faster than someone at, say, Temple was on no interest. Somehow, I knew what I wanted my students to know, and I did a reasonable job of seeing if they did. I honestly didn't care if Dick Durbin, my Senator, knew that they knew. (Incidentally, I think Durbin is a very intelligent politician--see, it is not necessarily an oxymoron.) The only way the Durbin's will understand which school produces the best product is to have a national Jeopardy contest, with really, really tricky questions (actually, these are answers ("Was the first Secretary of State under Andrew Jackson." I honestly don't have a clue as to who that was, but I am sure someone smarter than me will know.) How many person-hours have we spent trying to find some way to assess students so that someone who has no idea what it means is happy. Or in words of a well-known president, "Is our children learning?"

  • Accountable to whom
  • Posted by Dale , English instructor at Roanoke-Benson High School on January 9, 2009 at 11:15am EST
  • Fred Flener appropriately and wisely asks, "To whom is the education industry trying to prove its worth?" (Forgive me, Fred, for paraphrasing your discussion.) We can gather all the data we want, but neither the public nor government understand what the data mean. If they did, we wouldn't find ourselves required to produce students who are all average or above--yes, all of them--by 2012, no less. That's what the so-called No Child Left Behind legislation requires. Any government that would propose and pass flawed legislation like NCLB is intellectually incapable of understanding the data we give them.

  • Bologna/Tuning
  • Posted by Robin , Assessment/Effectiveness Coordinator on January 12, 2009 at 9:30pm EST
  • I hope a few readers out there will actually look at Adelman's "Bologna Club." In effect, it takes the AACU work to the next level, showing what happens when institutions decide they actually CAN work together and that there's reason to. It's quite a roadmap.