Search News


Browse Archives

News

Digital Archives That Disappear

April 22, 2009

Share This Story

FREE Daily News Alerts

Advertisement

As digital archives have become more important and more popular, there are varying schools of thought among scholars about how best to guarantee that they will be around for good. Some think that the best possibility is for the creators of the archives -- people generally with some passion for the topic -- to keep control. Others favor acquisition, thinking that larger entities provide more security and resources for the long run.

The fate of "Paper of Record," a digital archive of early newspapers with a particularly strong collection of Mexican newspapers, may be cited in the years ahead as an example of the dangers of purchase by a large entity. Paper of Record was purchased (secretly) by Google in 2006, and shortly after Google took over management of the site, late last year, the archive disappeared from view. After weeks in which historians have complained to Google and others about the loss of their ability to work, the previous owner of the archive has received permission to bring the archive back for some period of time, and resumption of service could start as early next week.

While the imminent return of the site will please scholars, many are worried about what the incident says about the availability and accessibility of key resources. Writing on the blog of the American Historical Association, Robert B. Townsend quoted the late Roy Rosenzweig, a George Mason University professor who was a pioneer in digital history, on the "fragility of evidence in the digital era."

The current loss of access is "tragic," said Ted Beatty, director of the Kellogg Institute for International Studies and associate professor of history at the University of Notre Dame. Beatty said Paper of Record not only has a great archive, but its search tools "revolutionized" the ease with which scholars could search the newspapers in the collection. Beatty said that he has been using the database for years, and encouraging his undergraduate and graduate students to do so as well -- until access disappeared.

Beatty studies technological change in 19th century Mexico, and while he has done some of his research in Mexican archives and libraries, and will continue to do so, Paper of Record was extremely valuable to him. "There is a lot of historical evidence in newspapers that is typically not accessed because of the time consuming, low-yield nature of page-turning work," but that could be found online, he said.

The situation also shows the power Google now has over the collection. Even if Google puts the works back up, it could set prices for access -- and that could put the collection out of reach to some, Beatty said.

Richard Salvucci, a professor of economics at Trinity University, in Texas, said he has been blocked from pushing ahead on research on a lender that played a role in the Panic of 1837 in Mexico. Noting that Mexico's universities and government paid for some of the digitization of the newspapers from the country, Salvucci said "who the hell is Google to deny the very people who paid the freight the right to see their national patrimony? It stinks and it's an unfriendly and uncollegial gesture."

Word of the situation has also upset scholars in Mexico, where the Mexican Association for Economic History has issued a resolution (available here in Spanish) condemning Google and calling for the material to be made available immediately.

While Google declined to discuss the situation with Inside Higher Ed and has not returned the calls of many scholars, it has spoken with Bob Huggins, the founder of Paper of Record. Huggins said that he has permission to bring the site back up, and plans to do so next week, until Google is prepared to offer access to the archives. Huggins said that he sold Paper of Record to Google in 2006, but that the deal was secret until late last year, when Google exercised its right to have Huggins stop managing the site. At that point, Huggins expected a transition to having the material appear on Google, but the material just disappeared from public view.

"Google wasn't ready to put their version online and they have been taken aback by all the anger," he said.

Next week, Huggins will post rates for institutional subscriptions to the database of $3,600 a year for institutions with full-time enrollment of at least 20,000, and $1,800 for others -- prices that are less than the fees before the Google purchase. Huggins said he didn't know how long Google would let him keep the revived site up. "I don't think particularly that the communication of this was handled very well by Google. I'm very disappointed," he said.

At the same time, Huggins said that he thinks many academics have been too harsh in their criticisms of Google. While the National Endowment for the Humanities and other entities are supporting some digitization projects, "they are going to be 100-year projects" before they are really valuable to scholarship, Huggins said.

Whether scholars like it or not, Huggins said, Google is uniquely able to manage large-scale projects at a reasonable pace, despite the problems with Paper of Record. "There is no other entity on the planet that is Google," he said.

See all postings »
Advertisement
Advertisement

Matching Jobs

Comments on Digital Archives That Disappear

  • A Chilling Thought....
  • Posted by JOhn F. DeFelice , Associate Professor of History at University of Maine at Presque Isle on April 22, 2009 at 7:30am EDT
  • When big business or government controls the sources, they control history. They control the paths to access and perhaps even filter which primary sources are available and which are not. Unlike real world archives, digital sources can be manipulated, altered, edited, re-translated, falsified, adulterated, and made to disappear forever at the touch of a key. You can adulterate paper archives, but only with great difficulty.

    I am not claiming that Google has done such a thing. I am simply waking up to the enormous power this gives to those who manage electronic resources. Already libraries across the country have digitized their newspaper and mircofilms/fiche collections: Media that are static and almost impossible to change. State archives and government document collections are headed in the same direction. When the originals are recycled, what will be left?

    A few generations hence can some power digitize away the Holocaust, the McCarthy years, Stalin's purges or Mao? This seems to enable the ruthless to do the unthinkable. I'm no Luddite, but the potential for misuse is alarming. This past election, I heard the cry for paper ballots. And while I love the ease of electronic resources, keep the paper archives!

  • Disk Space is Cheap
  • Posted by Chris on April 22, 2009 at 8:45am EDT
  • If a digital resource is really vital to your research, download it!  The internet is a temporary and ever changing medium.  What's there today won't necessarily be there tomorrow.

  • Digital Archive Addiction
  • Posted by Chris , Associate Professor at University of Texas Arlington on April 22, 2009 at 9:30am EDT
  • I, like many others working in nineteenth-century Mexican Studies, became addicted to the ease of use of Paper of Record. For a nominal fee, I could do keyword searches and download pdfs of nineteenth-century newspapers with incredible ease. It used to be that only people with sizeable research funds and large blocks of free time could travel to faraway archives to do research. The digital age democratizes access to a certain degree, but then something like this happens, and you realize how fragile access is. Or your institution does not subscribe to a particular portal and you have to track down a friend from graduate school who is at an institution where they do have access, and beg for her library logon and password.

  • Are Not Others to Blame?
  • Posted by Christine A. Hudak , Associate Professor; Director of NurseWeb at Case Western Reserve University on April 22, 2009 at 9:45am EDT
  • I find it interesting that the only entity villified in this debate is Google. What about the original developer of the site? Was he not aware of Google's intentions? And, since the deal was made in 2006, what happened to make the digital archives disappear?

    I think that we need more information about the intentions of Google as well as Bob Huggins. Why is the site suddenly available again, and at cheaper prices? I may be a cynic but I think there is more to the story than simple disappearance of the Archives.

    While it is true that these resources are endangered, it is also true that as academics, we need to make a stronger case for their preservation. We can do that best by supporting efforts of large organizations such as Google to acquire and store the archives, but then we must hold the organization accountable for the preservation and continued access. How that is best done is unclear at this time, but it must be done.

  • 19th Century Mexican Newspapers
  • Posted by Marjorie , Associate Professor at California State University, Stanislaus on April 22, 2009 at 9:45am EDT
  • Did the National Endowment for the Humanities contribute to the restoration of the 19th century Mexican periodicals? If that is the case I believe all can obtain access through the University of Texas, Austin.

  • Rolling Stone Article
  • Posted by Download Research Now on April 22, 2009 at 10:00am EDT
  • I just read an article in Rolling Stone about a guy who is hired by recording artists to police sites where the artists show up on the web. Van Morrison had a couple of sites removed that displayed his lyrics. That's understandable.

    BUT the Artist Currently Known as Prince asked the guy to SCRUB all mentions of his name from the Internet!

    So, follow the advice of the last comment. Download what you need. The stuff is ephemeral.

  • Where these newspapers came from
  • Posted by Chris , Associate Professor at University of Texas Arlington on April 22, 2009 at 11:00am EDT
  • Marjorie wrote: "...I believe all can obtain access through the University of Texas, Austin."

    I don't think that is the case. The only other place where the same Paper of Record materials can be searched in their entirety is in the Hemeroteca Nacional at the Universidad Nacional Autonoma (UNAM). They have a computer room where you can search the SAME papers but through a different interface. The Paper of Record Mexican archive is a copy of what is at the UNAM hemeroteca. When you download materials from Paper of Record you can see their stamp on it. The gaps in coverage in Paper of Record are identical to the gaps in coverage on the ground at UNAM. What is held at the Benson collection at UT Austin is different and not searchable in full text format.

    It seems strange to me that the original owner of this archive in UNAM would cede all proprietary rights to their holdings. I know the content is public domain but the original microfilms and original paper belongs to UNAM. Strange....

  • Digital Preservation
  • Posted by Bill Goffe , Professor / Economics at SUNY Oswego on April 22, 2009 at 11:15am EDT
  • I'm a bit surprised that nobody mentioned LOCKSS (Lots of Copies Keep Stuff Safe), an effort by libraries to keep materials available. Besides copying files, it also ensures that they haven't been altered. See http://www.lockss.org .

  • WHY THE HIGH PRICE FOR THIS SERVICE?
  • Posted by Bob , SABR research dept. Cincinnati,Ohio at Society of American Baseball Research on April 22, 2009 at 11:30am EDT
  • OK, it is established that Bob Higgins will be charging for this service,thye "privilige" of obtaining valuable research papers thru POR.
    How about us private researchers who dont have access to a Library that will have to pay the hefty fee to join up? Why do we have to come up with $1800.00 to have what was free until January 2009.
    Doesn't make any sense to me and that fact alone is why folks should rebel,the cost factor for researching.

  • Who is Huggins?
  • Posted by Nancy , Visiting Professor at University of Kent, Canterbury, UK on April 22, 2009 at 12:00pm EDT
  • I agree with Chris, "it seems strange [...] that the original owner of this archive [...] would cede all proprietary rights to their holdings". Scott Jaschik has not mentioned how was it that Huggins came to be the sole owner of Paper of Record, able to sell it to Google.

    How - and when - did Huggins enter in this imbroglio? Where does he come from? There's more to this history then what is known up to now.

  • Libraries can help with this
  • Posted by interested academic , Associate Dean of University Libraries at Mercer University on April 22, 2009 at 12:45pm EDT
  • Libraries and librarians or archivists have been the keeper of culture(s) for thousands of years. They do a pretty good job of it - digitally or otherwise.

  • The Evil Empire
  • Posted by Bob Huggins , Founder at PaperofRecord.com on April 22, 2009 at 3:15pm EDT
  •  

    When exactly does the cat fight end?  It slays me to see the great American  Us versus Them debate rage on( I comment as a Canadian).  As person who pioneered the digitization of newspapers in the world with our company, Cold North Wind, I fail to see how this acrimony between Academics and Google helps 'joe public' access the public record.  I have stated on numerous occasions that the newspaper represents 'our' only record of daily public life for the past 500 years with a special emphasis on the word "public".

     

     

     

    I have been through the grinding wheels of both Google and many public institutions whose goal it seems is to preserve and present history from Newspapers.  Both have let me down.

     

     

     

    When we began our mission in 1999, there were no standards for digitization. It was a process of establishing a widely adopted TIF format combined  with a well deployed viewing software (Adobe Acrobat).  I think it naive that standards established by the NDNP in 2008 will be anymore long lasting that the 5 1/2 inch floppy disc.

     

     

     

    In simple mathematics, if one looks at the monies available for the NDNP project  from the NEH and divides them amongst all of the newspaper microfilm page images that exist, just in the continental United States, we should have a fine database by 2100.

     

     

     

    It's simply absurd not to call some form of Detante between Google and the public institutions.  It's time to form what would be a tremendous 'Public" record database of digitized newspaper microfilm for this generation and all others to come.

     

     

     

    Otherwise we are doomed to continue this provincial squabble to the detriment of all of the 'public'.

     

     

     

    R.J. (Bob) Huggins

     

    Founder

     

    Cold North Wind Inc.

     

    PaperofRecord.com

     

  • Downloading the database & login issues
  • Posted by E. Bosman , Assoc. Prof. Head of Technical Services at NMSU Library on April 24, 2009 at 12:00pm EDT
  • Chris wrote: "Posted by Chris on April 22, 2009 at 8:45am EDT
    If a digital resource is really vital to your research, download it!"

    You may download the articles you need, but you cannot download an entire database without being in violation of copyright. Normally, this is specified somewhere in the user agreement; even if it isn't specified in the user agreement (unlikely) the content of the database and its organization clearly belongs to someone else.

    Regarding Chris , Assoc.Prof. at U of Texas Arlington on April 22, 2009 at 9:30am EDT who assets if you don't have access to "track down a friend from graduate school who is at an institution where they do have access, and beg for her library logon and password" I understand your frustration, but this is a violation of your friend's access and if the friend is caught they could lose access, as could your friend's library. I can personally attest that database vendors do review a library's access patterns and they do turn off database access to a library when they suspect abuse. The library must track down the problem and resolve it before access is restored. So, please, reconsider putting your friends, yourself and libraries at risk with this practice. If you need access to a particular journal, sign up for table of contents alert services with that publisher or an aggregator (http://www.tictocs.ac.uk/) and request what you need via interlibrary loan.

  • Is it available at academic institutions? Anywhere?
  • Posted by Alison on May 1, 2009 at 2:15pm EDT
  • Any news on this issue?

  • Posted by Richard Salvucci on May 5, 2009 at 10:15pm EDT
  • I have asked this question directly of the former owner of POR. No one seems to know anything.

  • some content now available on Google
  • Posted by Alison with good news on May 27, 2009 at 11:30am EDT
  • The content that I was most interested in is now available on Google news archives. Go to advanced search and input the source you are interested in and any other aspects of your search.
    I'm happy for as long as it lasts!

  • spoke too soon
  • Posted by Alison on June 19, 2009 at 9:30am EDT
  • My content was there, for a few days. Now it is gone and the paper I was interested in is not in any of the 3 lists provided by Google (these lists are: Now there; coming soon; no rights). I am waiting for a response from World Vital Records which will apparently be hosting access to this data. Quite willing to pay their price if I can get at it in the same way that PoR offered it.

    I tried to interest the Canadian Broadcasting Corporation in this story (the tech radio show, Spark) - apparently they didn't find it interesting. I'd love to see someone else try to make this a story.

  • Paper of Record Back Online/ Legal issues
  • Posted by Chris , Associate Professor at University Texas Arlington on November 18, 2009 at 8:15pm EST
  • Paper of record is now a part of the subscription service world vital records.com

    I appreciate E. Bosman's comment about the ethics and legalities of inappropriate use of library resources, as well as the possible consequences. Of course, I know that improper use of databases is wrong. I have never had to do that, thankfully, and I still patronize real live archives in faraway places, despite the cost. But right or wrong, people do share logons and passwords to help each other out.

  • Paper of Record, Again
  • Posted by Richard Salvucci , Professor, Economics at Trinity University on December 17, 2009 at 9:15pm EST
  • The search engine at World Vital Records is practically useless for scholars who were using Mexican papers for historical work. Try it. Enter a historical name and see what you get.
    The situation is no better now than it was last May.