what if | Spellbound Blog

My New Daydream: A Hosting Service for Digitized Collections

September 20, 2006 3 Comments

In her post Predictions over on hangingtogether.org, Merrilee asked “Where do you predict that universities, libraries, archives, and museums will be irresistibly drawn to pooling their efforts?” after reading this article.

And I say: what if there were an organization that created a free (or inexpensive fee-based) framework for hosting collections of digitized materials? What I am imagining is a large group of institutions conspiring to no longer be in charge of designing, building, installing, upgrading and supporting the websites that are the vehicle for sharing digital historical or scholarly materials. I am coming at this from the archivists perspective (also having just pondered the need for something like this in my recent post: Promise to Put It All Online ) – so I am imagining a central repository that would support the upload of digitized records, customizable metadata and a way to manage privacy and security.

The hurdles I imagine this dream solution removing are those that are roughly the same for all archival digitization projects. Lack of time, expertise and ongoing funding are huge challenges to getting a good website up and keeping it running – and that is even before you consider the effort required to digitize and map metadata to records or collections of records. It seems to me that if a central organization of some sort could build a service that everyone could use to publish their content – then the archivists and librarians and other amazing folks of all different titles could focus on the actual work of handling, digitizing and describing the records.

Being the optimist I am I of course imagine this service as providing easy to use software with the flexibility for building custom DTDs for metadata and security to protect those records that cannot (yet or ever) be available to the public. My background as a software developer drives me to imagine a dream team of talented analysts, designers and programmers building an elegant web based solution that supports everything needed by the archival community. The architecture of deployment and support would be managed by highly skilled technology professionals who would guarantee uptime and redundant storage.

I think the biggest difference between this idea and the wikipedias of the world is that there would be some step required for an institution to ‘join’ such that they could use this service. The service wouldn’t control the content (in fact would need to be super careful about security and the like considering all the issues related to privacy and copyright) – rather it would provide the tools to support the work of others. While I know that some institutions would not be willing to let ‘control’ of their content out of their own IT department and their own hard drives, I think others would heave a huge sigh of relief.

There would still be a place for the Archons and the Archivists’ Toolkits of the world (and any and all other fabulous open-source tools people might be building to support archivists’ interactions with computers), but the manifestation of my dream would be the answer for those who want to digitize their archival collection and provide access easily without being forced to invent a new wheel along the way.

If you read my GIS daydreams post, then you won’t be surprised to know that I would want GIS incorporated from the start so that records could be tied into a single map of the world. The relationships among records related to the same geographic location could be found quickly and easily.

Somehow I feel a connection in these ideas to the work that the Internet Archive is doing with Archive-IT.org. In that case, producers of websites want them archived. They don’t want to figure out how to make that happen. They don’t want to figure out how to make sure that they have enough copies in enough far flung locations with enough bandwidth to support access – they just want it to work. They would rather focus on creating the content they want Archive-It to keep safe and accessible. The first line on Archive-It’s website says it beautifully: “Internet Archive’s new subscription service, Archive-It, allows institutions to build, manage and search their own web archive through a user friendly web application, without requiring any technical expertise.”

So, the tag line for my new dream service would be “DigiCollection’s new subscription service, Digitize-It, allows institutions to upload, manage and search their own digitized collections through a user friendly web application, without requiring any technical expertise.”

GIS, Access, Archives and Daydreams

September 13, 2006 9 Comments

Today in my Information Structure class, our topic was Entity Relationship Modeling. While this is a technique that I have used frequently over the many years I have been designing Oracle databases, it was interesting to see a slightly different spin on the ideas. The second half of class was an exercise to take a stab (as a class) at coming up with a preliminary data model for a mythical genealogical database system.

While deciding if we should model PLACE as an entity, a woman in our class who is a genealogy specialist told us that only one database she has ever worked with tries to do any validation of location – but that it is virtually impossible due to the scale of the problem. Since the borders and names of places on earth have changed so rapidly over time, and often with little remaining documentation, it is hard to correlate place names from archival records with fixed locations on the planet. Anyone who has waded through the fabulous ship records on the Ellis Island website hunting for information about their grandparents or great-grandparents has struggled with trying to understand how the place names on those records relate to the physical world we live in.

So – now to my daydream. Imagine if we could somehow work towards a consolidated GIS database that included place names and boundary information throughout history. Each GIS layer would relate to specific years or eras in time. Imagine if you could connect any set of archival records that contained location data to this GIS database and not only visualize the records via a map – but visualize the records with the ability to change the layers so you could see how the boundaries and place names changed. And view the relationship between records that have different place names on them from different eras – but are actually from the same location.

I poked around to see what people are already doing – and found all of this:

Digital Earth and it’s more recently updated counterpart Geospatial Applications and Interoperability (GAI), a working group of the Federal Geographic Data Committee that seems to now exist within the National Geospatial Program Office of the USGS.
GOS – Geospatial One Stop which led me to the fabulous Lewis and Clark GeoSystems
The National Atlas (also found off GOS) that includes a special History Chapter (that starts to head in the direction I am imagining I think)
GEOnet Names Server (GNS) that provides access to the National Geospatial-Intelligence Agency’s (NGA) and the U.S. Board on Geographic Names‘ (US BGN) database of foreign geographic feature names (take this and add in a history element, and we are getting even warmer)
GIS for the Humanities – funded by a 2003 NEH Focus Grant, this project’s goal is “designed to create, and train faculty in the use of, mapping modules intended to enhance humanities courses”. I included this one because it gives a slice of the kind of teaching my dream GIS database could fuel.
And two clearinghouses for information: the US National Geospatial Data Clearinghouse and the United Nations Environment Programme / Global Resource Information Database (UNEP/GRID) Spatial Data Clearinghouse

I know it is a daydream – but I believe in my heart of hearts that it will exist someday as computing power increases, the price of storing data decreases and more data sources converge. I do forsee another issue related to the challenges presented by different versions of borders and place names from the same time period – but there are ways to address that too. It could happen – believe with me!

Ideas about Zotero and Digitized Archives

September 9, 2006 9 Comments

Dan Cohen posted recently about the soon to be available, open-source, firefox plugin, research support software named Zotero . Looking at the quick start guide, I immediately spotted the icon to “add a new collection folder”. As the “archivist-in-training” that I am, my reaction now to the word “collection” is different than it would have been a year ago. Though I strongly suspect it will not be the case (at least not in the first released version) I immediately was daydreaming of browsing a digitized collection online – clicking the “add a new collection folder” icon – and ending up with a copy of the entire collection of records for examination and comparison later.

Of course this would be most useful for the historian digging through and analyzing archival records if Zotero was able to pull down metadata beyond that of a standard citation and retain any hierarchical information or relationships among the records.

Now on Dead Reckoning‘s post on Zotero RDFa is mentioned. I don’t know anything about RDFa beyond what I have read in the last few hours, so it is not clear to me how complicated the metadata can be – perhaps it can support a full digital object XML record of some kind. So maybe the trick isn’t so much getting Zotero to do things it wasn’t designed to do – but rather the slow migration of sites to using the software packages and standards listed here.

I don’t want anyone to think that I am not excited about Zotero and all the neat things it is likely to do. I suspect I will rapidly become a frequent Zotero user verging on a zealot – but it is fun to daydream. I think it is most fun to daydream now, before I start using it and get lost in all the great stuff it CAN do. I definitely will post more after I get a chance to take it for a spin in early October.

Introduction

July 19, 2006 2 Comments

My name is Jeanne. I am a graduate student in an Archives program pursuing my MLS (aka, Master of Library Science). I have enjoyed all my classes to date (3) and I love the ideas that those classes have generated. Sometimes I leave class with just as many personal ideas scrawled in the margins of my notebook as class notes written on the main page. I am especially intrigued by the ways in which concepts from different fields intersect. How do ideas from my current field of software development and database design illuminate new issues, questions and concepts in the realm of archival studies?

I am particularly interested in topics related to audio and visual archival materials, digitization, description, meta-data, and retention of context in digitized collections.

So, here we are – you reading and I writing. I hope to make you think about things in a way you may not have before. I hope if you have been down the mental road I am taking and you have noticed something that I have missed, you might take a moment to point it out to me.

Please – ask questions and let me know your thoughts.

Category: what if

My New Daydream: A Hosting Service for Digitized Collections

GIS, Access, Archives and Daydreams

Ideas about Zotero and Digitized Archives

Introduction