Google Scholar will be 10 years old as of next month. Also coming up fast, early in the new year, is the 10th anniversary of the launching of Inside Higher Ed. Just to be totally clear about it, they were completely unrelated developments, though I do seem to remember each being met with skepticism and reservations, in some quarters. Certain senior academics insisted on calling them both “blogs.” (It was a simpler time.)
Suffice it to say that the beta days soon passed. The world of higher learning has grown ever more dependent on the internet’s intricate system of tubes -- with the academic web now serving as publishing platform and archive, and also as a point of access to restricted or proprietary digital collections. In principle, at least, we ought to be able to determine the dimensions of the scholarly web: how many items it contains (papers, dissertations, conference recordings, etc.), in however many formats, and with whatever depth of indexing and degree of retrievability. But actually taking the measurements is another matter. They belong to the epistemic category Donald Rumsfeld so aptly dubbed “the known unknowns.”
How about posing a narrower question, then? Just how big is Google Scholar? As with the company’s search-engine algorithms, that information remains a trade secret. But a couple of recent studies have tried to sound the depths of Google Scholar – using GS itself.
The earlier of the papers is “The Number of Scholarly Documents on the Public Web” by Madian Khabsa and C. Lee Giles, published in May by PLOS One, the online open access journal. Lee is a professor of information and computer sciences at Pennsylvania State University, University Park, where Khabsa is a Ph.D. candidate in computer science and engineering.
Their study, conducted in early 2013, started with a pool of 10 papers each from 15 categories used by another search engine, Microsoft Academic Search. The fields covered were agriculture science, arts and humanities, biology, chemistry, computer science, economics and business, engineering, environmental sciences, geosciences, material science, mathematics, medicine, physics, and social sciences, plus multidisciplinary. The researchers then requested from both Google Scholar and Microsoft Academic Search a list of the distinct incoming citations to each paper. That is, if paper X cited a target paper seven times, it counted only as one citation.
“Overall,” Khabsa and Lee report, “we obtained 41,778 citations from MAS and 86,870 citations from Google Scholar,” including all the metadata available: “the document's title, list of authors, number of citations, year of publications, and the venue of publication (if available).”
The format and range of metadata are not standardized, but with some effort the results from the two search engines could be compared to determine how many items appeared in both -- as well as the number GS had but MAS didn’t, and vice versa. The researchers also determined the degree of overlap in each of the 15 fields, and the percentage of papers available without payment or subscription.
At the time of the experiment, MAS claimed 48.7 million records. Taking into account the degree of overlap, the researchers “estimated Google Scholar to have 99.3 million documents, which is approximately, 87% of the total number of scholarly documents found on the web,” which they determine to be some 114 million items in English, with about a quarter of them freely available.
The share of open-access material varies considerably between fields, of course. At the low end is agricultural science, with 12 percent. Computer science is in the lead, with half of scholarly publications being freely available. (The percentages are broken down by field in the paper’s second table.)
In July, Enrique Orduña-Malea and three other researchers in Spain published “About the Size of Google Scholar: Playing the Numbers” through the site arXiv. The paper begins with an assessment of the methodology and findings of Khabsa and Lee. In particular, Orduña-Malea et al. stress “the low indexation of institutional repositories on Google Scholar” and the GS policy of not indexing files over 5MB, “a procedure which is especially critical for doctoral theses).” They suggest that the earlier study probably underestimates the size of the academic web – even of its strictly Anglophone component -- while overestimating how much of it Google Scholar indexes.
The authors go on to describe a battery of tests designed to measure GS “from the inside,” so to speak. Generally this involved performing searches for particular kinds of documents or search terms, using different temporal filters. For example: the search could be made for patents and citations issued in a century, then by decade within that century, and the result compared.
An intriguing variant is what the authors call an “absurd query,” in which the search is for a very common word likely to be found in most documents, with variations on the search parameters, including temporal filters.
The Khabsa-Lee study came up with the estimate that Google Scholar indexed 99.3 million documents in English. Extrapolating from that, with English accounting for 65 percent of GS material, Orduña-Malea et al. determine that the Khabsa-Lee method yields a total Google Scholar database of 152.7 million documents. (Despite their critique, the authors call Khabsa and Lee’s work “novel and brilliant.”)
The methods that Orduña-Malea and his colleagues tried were not restricted to English-language scholarship. Their findings varied from a low of 126.3 million documents in Google Scholar to a high of 176.8 million.
For their part, the Spanish researchers conclude that a judicious estimate would be in the neighborhood of 160 million items. “However,” they write, “the fact that all methods show great inconsistencies, limitations and uncertainties, makes us wonder why Google does not simply provide this information to the scientific community if the company really knows this figure.”
It’s a fair point, and one the company should answer. Its 10th anniversary might be a good time for Google Scholar to give its users the gift of transparency.
In his "Civility Manifesto" published here last week Cary Nelson cites my blog post, "Is Incivility the New Communism?," as "particularly hyperbolic" in its critique of efforts by some university administrators to demand that free speech be limited to expression they deem "civil." In his own expression Nelson himself has been known to employ hyperbole -- as well as sarcasm -- even to the point that some might call uncivil, so it's surprising he does not recognize that my exaggeration was intentional. Of course, we are not living through a new Red Scare on campus. But the point is that we could be if the incivility monitors have their way.
Nelson asks, "does unrestrained antagonism make for the best learning environment?" It surely does not. But that isn't the issue. No reasonable person welcomes hatred, harassment, or violence. According to Nelson, "Eloquence in the service of conviction does not require abusive rhetoric or personal accusation." But, like it or not, free speech does not require eloquence. It's one thing to encourage civil conduct and reasoned discourse, quite another to regulate expression in the name of such encouragement. But that is precisely what too many college and university administrators and trustees are threatening to do. The threat to free speech rights is real.
But we needn't go further than Nelson's own campus for the most chilling example of an effort to invoke civility as a criterion of free expression. According to University of Illinois at Urbana-Champaign Chancellor Phyllis Wise, “What we cannot and will not tolerate at the University of Illinois are personal and disrespectful words or actions that demean and abuse either viewpoints themselves or those who express them.” This extraordinary dictate goes well beyond encouraging civil communication. Under Wise's disturbing standard a biologist who "demeans" creationism could be out of a job; indeed, the evolutionary viewpoint itself, which creationists claim demeans humanity, could be banned.
While Dirks later clarified his ill-considered statement, Wise has not. Indeed, the Illinois Board of Trustees quickly declared its "collective and unwavering support of Chancellor Wise and her philosophy of academic freedom and free speech tempered in respect for human rights.” According to the board, UI must be a “university community that values civility as much as scholarship.” We know who should assess scholarship: the scholars. But who will judge what is civil? That is precisely the issue.
Nelson is correct that much of the uproar over civility stems from the Palestinian/Israeli dispute and I share his concern about the corrosive effect of that seemingly intractable conflict on both the campus climate and academic freedom. We differ sharply, however, about who is responsible. According to Nelson, only one side is to blame: "verbal excess, aggression, and ad hominem attacks are part of the standard repertoire of the Boycott, Sanctions, and Divestment movement. That typically stimulates raised tempers and sometimes similar behavior on the pro-Israeli side."
But the situation is more complicated. Elements on both sides invoke the protections of academic freedom and the First Amendment when it benefits them and violate these when it doesn't. What about efforts by some supporters of Israel to effectively blacklist faculty members whose views on Middle East studies do not comport with theirs? This move is truly evocative of the Red Scare and more threatening to academic freedom and the independence of the universities than a few "uncivil" words from pro-Palestinian students.
More important, the two sides are not in equivalent positions. Although support for the Palestinian cause is greater on American campuses than in society in general, those who control the universities -- administrators and trustees as well as powerful donors -- are most likely to support the Israeli cause. The expressive weapons of those in power and those without power almost always differ. It is usually the powerless whose voices must be shrill, who may break rules to be heard, who, in short, may be uncivil. Civility, however, can be a privilege of the powerful, whose control over institutions often leads them to silence opponents instead of engaging them. As Greg Lukianoff of the Foundation for Individual Rights in Education noted, in his experience "campus administrators are most likely to deem as 'uncivil' speech that criticizes them or the university’s sacred cows."
This imbalance is illustrated by events at Ohio University, which Nelson cites as an example of "uncivil" expression escalating to arrests. Nelson ignores, however, what preceded these arrests. A student government leader filmed a video in which she poured fake blood over herself to protest Israeli actions in Gaza. The video was bold and even graphic, but under the law it is protected speech and as a video abused or threatened no one. The student, however, received numerous death threats. How did the university president respond? In a mealy-mouthed statement he disassociated the university from the video and called vaguely for civility, but remained silent about the death threats, effectively coming down on one side of the controversy. Only later were four pro-Israeli students arrested for disrupting a meeting, which suggests that lame public calls for civility are also largely ineffective.
"When administrators urge us to be models of civility they are doing exactly what their job requires," Nelson declares. I agree, but the danger to which I and so many others have been responding is that such urgings show undeniable and dangerous signs of becoming requirements. And such requirements may threaten academic freedom and free speech as much as any loyalty oath.
Henry Reichman is first vice-president of the American Association of University Professors and chair of AAUP's Committee on Academic Freedom and Tenure. He teaches history at California State University at East Bay.
Printed posters carried aloft in a September demonstration at the University of Illinois at Urbana-Champaign read “Civility = Silence. Silence = Death.” In a particularly hyperbolic move, Henry Reichman, chair of the American Association of University Professor’s Committee A on Academic Freedom and Tenure, claimed that charges of incivility are being used to silence faculty members in the same way that accusations of communist sympathies were used to silence them during the McCarthy period of the 1950s. Historical comparisons should be carefully justified. This equation is at best frivolous; at worst it risks fomenting unwarranted feelings of victimhood.
Faculty members and students share with all Americans the right to indulge in uninformed and intemperate speech. Social media provide them with effective means for doing so. But does unrestrained antagonism make for the best learning environment? Does it advance knowledge in the way higher education is pledged to do? Does it train students to evaluate evidence dispassionately? Does it prepare students to participate productively in public life? Does it help students learn that it is possible, indeed preferable, to be zealous in advocating a point of view without vilifying or trying to silence those who differ?
These effects include long-term consequences. The immediate effects of uncontrolled campus hostility can be more dramatic. We have already seen intemperate campus speech escalate toward violence. That happens most often with debates over Israel. Indeed verbal excess, aggression, and ad hominem attacks are part of the standard repertoire of the Boycott, Sanctions, and Divestment movement. That typically stimulates raised tempers and sometimes similar behavior on the pro-Israeli side. Nothing good comes of these confrontations.
We have recently seen campus passions cross the line into a moment of physical violence at Temple University. We saw extreme speech and symbolic action degenerate into threats, accusations, and even arrests at Ohio University. If we add to the list instances when campus groups sought to undermine academic freedom by denying invited speakers the right to speak the list of examples would grow. A silent protest at a lecture is a dignified act of moral and political witness. A brief noisy demonstration that ends after a minute registers passionate discontent but preserves academic freedom. Both of course require group discipline. A vocal demonstration that blocks a lecture abandons civility and undermines the purpose of higher education.
A certain portion of the American left now regards civility as a bland form of corporate speak. Or, worse still, as an Orwellian effort to stifle academic freedom. And the far right thrives on hostility and ad hominem attack. Our divided national political culture can hardly be said to encourage anything different. But should a campus try hard to emulate Washington?
Academic freedom does indeed protect both current faculty members and students from institutional reprisals for deplorable speech. But it was never intended to protect people from criticism for what they write and say. Uncivil students and faculty at a university should not be punished. University presidents who urge civility are not trying to stifle dissent or suppress speech. They are trying to make the campus an oasis of sanity. They are trying to urge faculty and students to showcase productive dialogue. That is part of what higher education owes the country. That is part of the cultural and political difference higher education can make.
Civility does not preclude passionate advocacy. It doesn’t preclude devices like irony and humor. Nor does it mean ideas and arguments cannot be strongly expressed and severely criticized. Civil discourse need not be bland. Civility should lead us to treat people with respect, but it doesn’t mean that all arguments or ideas merit respect. Eloquence in the service of conviction does not require abusive rhetoric or personal accusation. It does not require us to claim we know what is in one another’s hearts and to indict people on that basis. It does not require us to demonize our opponents unless we believe they are beyond hope and fundamentally corrupt or evil, a perspective not likely to apply to campus colleagues. Campus speech that harasses, bullies, or intimidates cheapens our communities and diminishes their value.
When administrators urge us to be models of civility they are doing exactly what their job requires. Civility does not mark the boundaries of free speech protection. But it helps describe how we can most often relate to one another productively. Voluntary civility is the best way to conduct difficult debates, but it is not a limit on permissible speech. Faculty members need to teach by example. They need to take the lead in demonstrating what good citizenship entails. Unfortunately, far too many faculty members are doing precisely the opposite.
The Arab/Israeli conflict gives continuing evidence of how inflammatory rhetoric in the Middle East can lead to actual violence. It is thus both sad and ironic to see our campuses conduct debate on the subject as if campus debate amounted to war by other means.
Cary Nelson served as national president of the American Association of University Professors from 2006 to 2012. He teaches at the University of Illinois at Urbana-Champaign.
On Sunday, October 5, Mumia Abu-Jamal, African-American public intellectual and death row survivor, delivered a commencement address to graduates of Goddard College's low residency bachelor's program. The students chose their speaker and the speech was pre-recorded, given that Abu-Jamal is serving a sentence of life without parole in Pennsylvania. Following announcement of the speaker choice, Goddard endured a barrage of scornful press reports, hate-laced phone messages, and social media backlash. Pennsylvania Republican Senator Pat Toomey pressured the college to rescind its invitation, with police and corrections officials issuing similar calls.
As a Goddard faculty member and longtime social justice activist, I've been much distressed by the high volume of shrill, one-dimensional press coverage. You would never know that "convicted cop killer" Abu-Jamal (in Fox News parlance) was found by Amnesty International to have been deprived of a fair trial, nor that he and an impressive group of supporters here and abroad credibly claim he was framed. Nor could you grasp why Goddard would let its graduates pick their speaker and stand firm as the controversy severely taxed the small Vermont college's resources -- or why so many faculty and staff see upholding our association with Abu-Jamal (who received his own Goddard B.A. in 1996) as not just "the right thing to do," but an affirmation of everything we've long been about.
A transcript of the commencement speech may be found here, and a recording appears at the bottom of this essay.
Not that you would really expect any of this context to be clarified by soundbite journalism and Facebook flame wars. Abu-Jamal represents a tradition of uncompromising progressive activism within grassroots African-American communities, a political lineage relentlessly marginalized in the current political environment. Meanwhile, Goddard's own roots in a radical educational philosophy that values critical dialogue and social engagement don't make sense to a public encouraged to see higher education as job market training, worthwhile only when "learning" can be quantified and monetized.
Yet, in a wonderful irony, the obfuscating public uproar has sparked a rich internal conversation among Goddard's faculty, staff, students, and alumni. Does our penal system deserve the label "prison-industrial complex"? If so, why? Do recent protests in Ferguson, Missouri illuminate historical dynamics between police and low-income communities of color on levels relevant to what happened when Officer Faulkner was killed in Philadelphia in 1981 (the crime for which Abu-Jamal was convicted)? Apart from the specifics of this case, what are the implications of the fact that the name Mumia Abu-Jamal still sparks outrage in people who would never blink at academic honors for men like William Burroughs and Louis Althusser, both of whom killed their wives?
How can we uncover and name the often hidden ways in which race and class assumptions are buried within these reactions? Most challenging of all, how might we as faculty and students in a small, nontraditional liberal arts college begin to address our own participation and complicity in the oppressive aspects of the larger education system?
Goddard alumnus Kevin Price, who works on Abu-Jamal's defense, has written eloquently of how his own enrollment at Goddard was partly inspired by his contact with the man. He concludes that despite many "wonderful symbolic reasons to support Mumia as a commencement speaker, Mumia is not a symbol. He is a man who was wrongfully held in solitary confinement on death row for nearly 30 years and is now being wrongfully held in general population with no legal possibility for parole.... He is a man with a brilliant mind and an unstoppable pen.... With so much at stake it only seems right that we listen."
The example of this student's educational journey bears out the observation of Dr. Herukhuti, Goddard Faculty Council chair, that it is our educational philosophy rather than the political content of our academic program that makes Goddard a radical college: "We have created a space for people, like Mumia and our thousands of students and alumni/ae around the world, who have tremendous obstacles to their educational ambitions to unshackle their dreams and achieve their goals. We have created an incubator for thinkers, artists, healers, activists and writers who have decided not to allow their brilliance to be diminished nor snuffed out behind the walls of any form of prison — real or metaphoric."
How I wish that Goddard could "publish" our internal dialogue, thereby usefully complicating the seductively simplistic mainstream media account. What a teachable moment that would be!
Jan Clausen is a poet whose most recent book is Veiled Spill: A Sequence (GenPop Books, 2014). She teaches in the Goddard College M.F.A. in Writing Program. The opinions expressed in this article are solely those of the author.