Search News


Browse Archives

News

Engaged or Confused?

November 9, 2009

Share This Story

FREE Daily News Alerts

Advertisement

With today's release of the National Survey of Student Engagement, hundreds of colleges and universities will be studying their results, and considering whether they should change policies or approaches to better reach students. But a new study released Friday argues that NSSE (pronounced "nessie") is seriously flawed, lacking validity for its conclusions and asking questions of students in ways that are sure to doom the value of the data collected.

NSSE "fails to meet basic standards for validity and reliability," writes Stephen R. Porter, an associate professor in Iowa State University's educational leadership and policy studies department. Porter's study -- presented in Vancouver at the annual meeting of the Association for the Study of Higher Education -- raises questions about most research based on surveys of students, and he stresses that he does not believe the problems are unique to NSSE. He even goes so far as to say that in the past he has done research based on student surveys that he now doubts has validity.

But he uses NSSE as his focus, in part, because it is a student survey that has captured the attention of so many higher education leaders.

"Most academic research" using student surveys "is ignored, but this has a huge impact," he said in an interview. "If I get something wrong in a journal article, maybe two dozen people read it, but this is something they are using to tell colleges what to do. There are high stakes on how these surveys are used."

Alexander C. McCormick, NSSE director and associate professor of education at Indiana University at Bloomington, said in an interview that "some of the issues [Porter] raises have legitimacy," but he also said that Porter is overstating some of the problems and ignoring some evidence that backs NSSE's methodology.

In his paper, Porter gathers evidence from other researchers that suggests NSSE's basic approach to asking questions of students is flawed. One key issue he raises is that students don't necessarily know what it means when they are asked if certain practices or experiences are frequent or rare -- even though such measurements are critical to NSSE questions.

For example, Porter cites a 1982 study in which college students were asked the same question (over the course of a longer survey so it wouldn't be obvious to students that they were answering the same question) in two ways. The question was typical of NSSE questions in asking students how frequently they made an appointment to see a faculty member, and the first round of answer choices were similar to the kinds of answers NSSE uses: occasionally, often and very often. But in the second iteration of the question, students were given more specific answers, ranging from "once or twice a year" to "more than once a week."

As the results indicate, there is considerable variation among students on what they mean when they say "often," which in this study was almost as likely to mean "once a week" as "3 to 6 times a year." And students with the same meaning (when defined precisely) check different categories when given the NSSE-like answers.

Consistency of Student Interpretation of Occasionally, Often and Very Often

Students Who Checked... Occasionally Often Very Often
... and also checked "never" for same question 4% 0% 0%
... "once or twice a year" 37% 6% 2%
... "3 to 6 times a year" 38% 24% 9%
.. "1 or 2 times a month" 18% 45% 33%
... "about once a week" 3% 21% 35%
... "more than once a week" 0% 4% 21%

But it's not just that students don't measure frequency the same way, Porter argues. They also don't know (at least with precision) many other terms used by NSSE.

For instance, there is a question about whether students discuss grades or assignments with instructors, to which Porter points out that some may count teaching assistants and others may not.

Or the question about "serious conversations with students." Of this question, Porter asks: "How does a student distinguish between serious and frivolous conversations? And what is a conversation? A chat in the bathroom? An hour-long bull session in a dorm room?"

And then there is the question about whether students believe that their college helps them learn to think critically and analytically. Porter writes that this question is "a good example of how we let educational jargon creep into our surveys, and then assume that students understand what we mean. Recently, a graduate student interviewed me for a class she was taking about teaching, and asked me how I taught critical thinking in my classes. We then proceeded to have a discussion about what she meant by critical thinking, because I wasn’t clear on what it meant, in terms of what she was asking. If I, as a higher education researcher, have trouble defining the phrase 'critical thinking,' how can we expect the average college student to understand the concept, much less ensure that this understanding is similar across college students?"

In these cases, students may be honestly answering questions, but their lack of knowledge may result in wide variations of what the data mean, Porter says.

In yet other cases, there is research to suggest that students may not be entirely honest. Porter writes of numerous studies suggesting that students engage in a bit of grade inflation when asked about their academic performance, and tend to answer questions in ways that make themselves look like slightly better students than they really are.

Given these and other issues, Porter writes that NSSE cannot be presumed to be valid -- and he questions the idea that there is any evidence that NSSE scores in their current form are a good indicator of student learning.

"The promise of a survey instrument that can quickly and relatively cheaply provide an alternative to actually measuring learning has, not surprisingly, been alluring to many colleges," he writes. "That an instrument that fails to meet basic standards of validity and reliability has been so quickly adopted by numerous institutions indicates the desire of many institutions for a solution to this issue."

What to do? In some cases, Porter argues for additional validity testing to see whether there are notable patterns -- by institutional competitiveness, sector or major, for example -- on how students respond to questions. In other cases, he argues for much more detailed definitions and instructions. And he suggests that other approaches -- such as time diaries, in which students carry a diary and record what they do over a period of time, rather than later remembering what they did -- are far more accurate.

In an interview, Porter stressed that he too believed (from his experience, not from research) that qualities promoted by NSSE, such as close student-faculty interaction, are important.

Porter went to Rice University, where he said he remembers many qualities of the type praised by NSSE in producing engaged students. "I would want my child to go to a college that has those characteristics," he said. "The question is: Can the NSSE help us identify those colleges?" He said that because of the "grand claims" made by NSSE and largely unquestioned by academic leaders, it has become the "gold standard," when it really needs an overhaul. Where is the evidence, he said, that students understand the questions, and that their answers lead not only to engagement but to learning?

The NSSE Response

McCormick, the NSSE director, said that he and his colleagues "are the last persons to say that NSSE is perfect," and that Porter had raised some issues "we need to deal with." But he also said that Porter was leaving out some key context about survey research generally and the realities of student surveys.

As to what Porter got right, McCormick said it was important for those who do survey-based research to regularly consider whether the wording of various questions made them vulnerable to misinterpretation, and he said that, for example, that while he thinks the question on critical thinking isn't typical, it is the question where "one could make the strongest case that it's a jargon-ish question."

But McCormick said that NSSE has done extensive validity studies on its questions -- typically through focus group "cognitive interviews" in which subjects are asked to think aloud about their answers to various questions, and that this process was used to verify that questions were being answered consistently, and to refine wording. Further, he said that earlier versions of NSSE included lengthier instructions and definitions, and that researchers found that students simply ignored the information, leading to the belief that more students will participate, and participate thoughtfully, with shorter introductions to the questions.

Further, McCormick questioned the cost and practicality of giving students time diaries to carry out for some period of time. He noted that students can fill out a NSSE survey in 15 to 20 minutes, making it much easier to gather information from large numbers of students.

"It's certainly true that if we equipped all students with time use diaries and said 'Fill this out every half hour to tell me what you do,' we would get much more precise estimates of how many hours a week they spend. Of if we hired someone to follow them around, we would also get more information," he said, but he wondered how many students would go for this approach.

Porter was setting a false standard for NSSE (and other surveys) to live up to, McCormick said. "A lot of what's written about in the paper are problems that are common in social science research," he said. "Our measurement tools are blunt, and NSSE is certainly no exception to that."

Finally, McCormick said that a lot of the value of NSSE was in "relative comparisons" in how an individual college does from year to year, how various parts of a college do, or how groups of colleges do. So if a college sees relatively low scores in an area, and shifts policies, and sees an improvement on NSSE, that means something worked, and if there is no improvement, more work may be needed. Improvement may be valid, even if the students' answers to questions do not yield "some precise numeric comparison," he said.

While McCormick acknowledged the possibility that NSSE may need to be "more clear" about the value of relative comparisons as opposed to individual figures, he said "that's what most of the schools are doing."

It would be a problem, McCormick said, if research found that students at different kinds of institutions or in different types of academic programs answered questions in notably different ways, but he said he saw no evidence of this, and that this concern was "not a severe threat to the instrument."

Porter said he wasn't impressed with NSSE's response. Porter said that NSSE places too much emphasis on maximizing the number of respondents, as opposed to maximizing the chances that answers are accurate. "There seems to be this attitude that people doing surveys in the field should be held to lower standards. I don't agree," he said.

See all postings »
Advertisement
Advertisement

Matching Jobs

Comments on Engaged or Confused?

  • Scratching the Surface on NSSE Flaws
  • Posted by Keith Johnson on November 9, 2009 at 8:15am EST
  • So "NSSE 'fails to meet basic standards for validity and reliability,' writes Stephen R. Porter..." Actually, that's an understatement, and the points made by Porter are not the basic flaws, in my opinion. The result, NSSE is likely to lack basic validity to be used as it is.

    To be blunt, the type of questions used on this national survey (and others written by psychologists and educators are badly flawed. So here is a response by a sociologist.

    First, these surveys ask questions that have clearly "correct" responses. Second, they are administered in classrooms. The more closely the administration of such surveys mimics classroom exams, the greater the responses will elicit "correct" responses.

    The vague way the survey items are posed is discussed partially by Porter, but he is merely scratching the surface on this one. To measure the characteristics of an individual student we have to ask a general question, such as "How often do you..." But to measure the characteristics of a population, we have to ask a very specific question about behavior that is not "right or wrong," such as:

    "Thinking of what you did at school yesterday, did you skip any classes?" Then the choices would be: "Of my ____ scheduled classes, I attended _____" The behavior being measured may not be representative of that particular student's academic career, but in aggregate, it will demonstrate what is happening on campus.

    Thinking of measuring a population (sociology) works best in these circumstances. Thinking of individual students learning (psychology or education) doesn't work at all, especially combined with vague questions with obviously "correct" answers. The results are dubious at best, and subject to manipulation by changing the survey setting and administration.

  • Rats in the maze
  • Posted by Jeanne Phoenix Laurel , Assoc. Prof. and Chair of English at Niagara University on November 9, 2009 at 9:30am EST
  • Put a bunch of rats in a maze, give their butts a shove, and some will inevitably find the cheese. My students think NSSE is a joke. They don't really think about what their answers mean, or perhaps even what they're marking. They're habituated to making correct round marks in little circles using number two pencils, or the online equivalent. There is no incentive for any particular answer. Why should they be honest? Even with more specific questions, all we can really measure is whether students feel warm and fuzzy enough about the school to put down the answer they know is the desired one.

  • hm
  • Posted by Chad Ahren on November 9, 2009 at 9:45am EST
  • I have to land firmly on the side of NSSE here. NSSE administration doesn't take place in a classroom, emulate classroom dynamics or elicit expected responses. Administration of the survey is painstakingly tracked so as to avoid contaminating factors like those. Ask any campus project manager who liaises with NSSE - you work hard to get the best, most trustworthy results possible.

    Of course one could conceive of finer-grained instruments. But in order to have such a global data set you have to strike a balance in order to actually collect the responses (using stringent admin techniques as above). It's easy to snipe at NSSE's shortcomings but they are working to answer important questions and provide insight we can all use to focus on improving the educational experience. Providing data to schools is only part of the NSSE project's focus - they also want to provide useful research for the higher ed community and contribute to the betterment of the entire landscape.

    Adhering to that vision, of course, requires lending a careful ear to both constructive and frivolous criticism. That's how any responsible researcher or practitioner improves over time.

  • Posted by Raoul Ohio on November 9, 2009 at 11:30am EST
  • Bravo for Porter!

    I have NO expertise in NSSE or any similar tests. But, on the surface they appear to be TOTAL CRAP! They may well have some redeeming features or get something right, but why should I dig for it? The question is, why would anyone take NSSE seriously or use it to set policy?

    The entire matter is an indictment of the standards used in the professional education establishment. If I had time right now, I would say a lot more on that topic.

  • Posted by R , Grad Student on November 9, 2009 at 11:30am EST
  • NSSE certainly could be improved, language could be clarified, etc, but it is useful as one measure. What a practitioner should look for out of NSSE is an indicator of a general feeling about X topic on campus. You look to improve in those areas. The precise measurement isn't nearly as important as improvement in categories. The truth is that it's nearly impossible to accurately measure student development and engagement on large scale; it's difficult enough to do on an individual level.

    As a student that has taken the NSSE, (not in a classroom, by the way, but in the privacy of my dorm room) I can say that just as there was no incentive for me to be honest, there was also no incentive for me to be dishonest.

    Just because something isn't perfect doesn't mean it's not useful as one indicator. If NSSE results are the end all be all at any institution, that's more of an institutional problem than a NSSE problem. It is one valuable tool among an array of tools, instruments, and indicators. If used that way, it has a helpful niche.

  • Engagement and all that
  • Posted by GTKarnezis on November 9, 2009 at 3:15pm EST
  • Here's what we need to be thinking about: is it the case that a relatively "leisured" education -- one which allows for surprises and many opportunities for students to associate with professors and others with whom they can talk about books, ideas, and ultimate matters of concern --- is such an education becoming increasingly rare because the amount of time that students must (for whatever reasons) serve as employees has steadily increased? The real questions are about time and how you spend it, or are able to spend it, in interaction with people not limited to one's peers.

    Years ago, Paul Goodman wrote about the isolation of students from mentors, adults worth hanging out with, and the resulting infantilization of higher education. I am not arguing against working one's way through college: I am only saying that the increasing amount of time that students these days must spend at such "working" (either to satisfy their overstimulated appetites or to meet the ever-rising costs of even (allegedly) public institutions, is a definitive factor in their capacity for engagement rather than merely "being busy."Any "assessment" of engagement MUST take into account students' economic situations and the extent to which they inhibit or encourage whatever we mean by "engagement."

  • Anything Better?
  • Posted by Rhonda on November 9, 2009 at 3:15pm EST
  • I have no particular opinion about NESSIE. But I do have a question for those that are so critical. Is there anything out there that is any better? Educators are being asked to be accountable. This is one such attempt. It may not be perfect but at least someone is trying. And my kudos go to those colleges that are using the information to attempt improvements.

    Perhaps our energies are better focused on improving it rather than simply being critical.

  • Posted by Raoul Ohio on November 9, 2009 at 4:00pm EST
  • Re: "Anything Better, from Rhonda"

    This is like saying "Tarot cards are not any better than palm reading, so let's work to improve palm reading".

  • Perfect <> Good
  • Posted by Mark Freeman , Institutional Research on November 9, 2009 at 5:45pm EST
  • Let's not let the perfect be the enemy of the good. Or for that matter, the "marginally better than middling..." as the alternative is nothing. This kind of criticism is an easy game to play, and it is one all-too-familiar to anyone that has worked with faculty as a whole to move forward on the issue of assessing student learning.

    The real issue here is the relative unwillingness of faculty as a whole (excepting the author, obviously) to approach the issue of assessment of student learning with the same rigor and passion that they conduct their own research. The NSSE and all its flaws are a symptom, not a cause, of this problem. The NSSE has simply evolved in the direction permitted by low-to-nonexistent faculty interest in this issue.

    Any serious "fixes" to the NSSE instrument and its ilk presuppose considerable investment of faculty time and resources in assessing and being able to document the question of whether students are learning what we say they are. Maybe the weaknesses of the NSSE can be used as a foil for that sort of rallying-cry, but the author's criticism, rather, were reserved for short-sighted administrators and tenure-obsessed educational researchers.

  • Most Valid & Useful Perspective
  • Posted by Rod Skinkle , President at Research Consultancy on November 11, 2009 at 12:15pm EST
  • As someone who has coordinated the administration of NSSE in one of Canada's largest institutions and numerous related studies focused on higher education participation, outcomes etc. I feel that the most reasonable (valuable) comments were provided a graduate student (R). I won't repeat these points since R has articulated quite clearly.
    Rather, I would add the observation that Porter's comments have some value to the degree that they challenge us to to improve the research. There is no doubt that there should be some more longitudinal and rigorous studies that concentrate on both reliability and validity (ie. predictive). But in the meantime, R provides a most useful perspective.
    RS

  • Response rates!?
  • Posted by John , Professor at Vancouver Island University on November 16, 2009 at 10:15am EST
  • Nobody seems to be noticing the dismal self-selection effect which is characteristic of the NSSE surveys. Typical of many universities, where only 30-35% of the sample choose to respond, what do the other 65-70% believe about their educational experiences?
    Weighting the responses cannot correct for the nonresponse effect. I think someone should tell the emporer that he has no clothes.