Category: RDF

The Vision of ORE

ORE logoOne form of serious intellectual work that could use much more respect and appreciation within the humanities is the often unglamorous—but occasionally revolutionary—work of creating technical standards. At their best, such standards transcend the code itself to envision new forms of human interaction or knowledge creation that would not be possible without a lingua franca. We need only think of the web; look at what the modest HTML 1.0 spec has wrought.

The Object Reuse and Exchange (ORE) specification that was unveiled today at Johns Hopkins University has, beyond all of the minute technical details, a very clear and powerful vision of scholarly research and communication in a digital age. It is thus worth following the specification as it moves toward a final version in the fall of 2008, and to begin thinking about how we might use it in the humanities (even though it will undoubtedly be adopted faster in the sciences).

The vision put forth by Carl Lagoze, Herbert Van de Sompel, and others in the ORE working group for the first time tries to map the true nature of contemporary scholarship onto the web. The ORE community realized in 2006 that neither basic web pages nor advanced digital repositories truly capture today’s scholarship.

This scholarship cannot be contained by web pages or PDFs put into an institutional repository, but rather consists of what the ORE team has termed “aggregates,” or constellations of digital objects that often span many different web servers and repositories. For instance, a contemporary astronomy article might consist of a final published PDF, its metadata (author, title, publication info, etc.), some internal images, and then—here’s the important part—datasets, telescope imagery, charts, several publicly available drafts, and other matter (often held by third parties) that does not end up in the PDF. Similarly, an article in art history might consist of the historian’s text, paintings that were consulted in a museum, low-resolution copies of those paintings that are available online (perhaps a set of photos on Flickr of the referenced paintings), citations to other works, and perhaps an associated slide show.

How can one reliably reference and take full advantage of such scholarly constellations given the current state of the web? As Herbert Van de Sompel put it, ORE tries to identify in a commonsensical way “identified, bounded aggregations of related objects that form a logical whole.” In other words, ORE attempts to shift the focus from repositories for scholarship to the complex products of scholarship themselves.

By forging semantic links between pieces entailed in a work of scholarship it keeps those links active and dynamic and allows for humans, as well as machines that wish to make connections, to easily find these related objects. It also allows for a much better preservation path for digital scholarship because repositories can use ORE to get the entirety of a work and its associated constellation rather than grabbing just a single published instantiation of the work.

The implementation of ORE is perhaps less commonsensical for those who do not wish to dive into lots of semantic web terms and markup languages, but put simply, the approach the ORE group has taken is to provide a permanent locator (i.e., a URI, like a web address) that links to what they call a “resource map,” which in turn describes an aggregation. Think of a constellation in the night’s sky. We have Orion, which consists of certain stars; a star map specifies which stars comprise Orion and where to find each of them. The creators of ORE have chosen to use widely adopted formats like RDF and Atom to “serialize” (or make available in a machine-readable and easily exchangeable text format) their resource maps. [Geeks can read the full specification in their user guide.]

In the afternoon today several compelling examples of ORE in action were presented. Ray Plante of the NCSA and National Virtual Observatory showed how astronomers could use ORE and a wiki to create aggregates and updates about unusual events like supernovas, as different observatories add links to images and findings about each event (again, think of Van de Sompel’s “logical whole”). Several presenters mentioned our Zotero project as an ideal use case for ORE, since it already downloads associated objects as part of a single parent item (e.g., it stores metadata, a link to the page it got an item from, and perhaps a PDF or web snapshot). Zotero is already ORE Lite, in a way, and it will be good to try out a full Zotero translator for ORE resource maps that would permit Zotero users to grab aggregates for their research and subsequently publish aggregates back onto the web—object reuse and exchange in action.

Obviously it’s still very early and the true impact of ORE remains to be seen. But it would be a shame if humanities scholars fail to participate in the creation of scholarly standards like ORE, or to help envision their uses in research, communication, and collaboration.

There has been much talk recently of the social graph, the network of human connections that sites like Facebook bring to light and take advantage of. If widely adopted, ORE could help create the scholarly graph, the networked relations of scholars, publications, and resources.

Zotero News, Big and Small

So much for a modest, stealthy launch of Zotero. I promised a couple of weeks ago that I would return to my blog soon with a few updates about user feedback, some hints about new features, and perhaps some additional news items. With a modest private beta test and a few pages explaining the software on our new site, I assumed that Zotero would quietly and slowly enter into public consciousness. Little did I know that within two weeks I would get over 400 emails asking to join the beta test, help develop and extend Zotero, make it work better with resources on the web, and evangelize it on campuses and in offices around the globe. (Sorry to those I haven’t responded to yet; I’m still working on my email backlog.) Better yet, we received some fantastic news about support for the project, which is where I’ll begin this update.

The big news is that the Center for History and New Media has received an incredibly generous grant from the Andrew W. Mellon Foundation to help build major new features into the 2.0 release of Zotero (coming in 2007). Included in this substantial upgrade are great capabilities that beta testers are already clamoring for (as I’ll describe below). I’m deeply appreciative to the Mellon Foundation and especially Ira Fuchs and Chris Mackie for their support of the project, and we’re delighted to join the stable of other Mellon-funded, open-source projects that are trying to revolutionize higher education and the scholarly enterprise through the use of innovative information technology. We have a very ambitious set of goals we would like to accomplish in the next two years under Mellon funding, and we’re really excited to get started and push these advances out to an eager audience.

My thanks also to the beta testers who have reported bugs and sent in suggestions. (For a few early reviews and thoughts about Zotero, see posts on the blogs of Bill Turkel, Bruce D’Arcus (1, 2), Adrian Cooke, Jeanne Kramer-Smyth, and Mark Phillipson.) We’re planning on rolling all of the bug fixes and a few of the suggestions that we’ve already implemented into the public beta that will be released shortly. The most requested new features were auto-completion/suggestions for tags, better support for non-Western and institutional authors, full-text searches of articles that are saved into one’s Zotero collection, more import/export options, support for other online collections and resources, and the detection of duplicate records. The developers are working feverishly on all of these fronts, and I think the Beta 2 release (our public beta) will be considerably better because of all of this helpful feedback.

I have intentionally left out perhaps the most wanted feature: tools for collaboration. Some of those who have started to hack the software have noticed what we at the Center for History and New Media have been thinking about from the start—that it seems very easy to add ways to send and receive information to and from Zotero (it does reside in the web browser, after all). What if you could share a folder of references and notes with a colleague across the country? What if you could receive a feed of new resources in your area of interest? What if you could synchronize your Zotero library with a server and access it from anywhere? What if you could send your personal collection to other web services, e.g., a mapping service or text analyzer or translation engine?

I’m glad so many of us are thinking alike. Those are the issues we’ve just started to work on, thanks to the Mellon Foundation. Stay tuned for the Zotero server and additional exciting extensions to the Zotero platform.

And despite my email backlog, please do contact me if you would like to join the Zotero movement.