<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Spellbound Blog &#187; internet archiving</title>
	<atom:link href="http://www.spellboundblog.com/category/internet-archiving/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.spellboundblog.com</link>
	<description>Archives, Digital Humanities, Cultural Heritage, Technology</description>
	<lastBuildDate>Mon, 06 Feb 2012 14:49:35 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>Encouraging Participation in the Census</title>
		<link>http://www.spellboundblog.com/2010/03/05/encouraging-census-participation/</link>
		<comments>http://www.spellboundblog.com/2010/03/05/encouraging-census-participation/#comments</comments>
		<pubDate>Fri, 05 Mar 2010 08:42:40 +0000</pubDate>
		<dc:creator>Jeanne</dc:creator>
				<category><![CDATA[future-proofing]]></category>
		<category><![CDATA[historical research]]></category>
		<category><![CDATA[internet archiving]]></category>
		<category><![CDATA[photography]]></category>
		<category><![CDATA[privacy]]></category>
		<category><![CDATA[video]]></category>

		<guid isPermaLink="false">http://www.spellboundblog.com/?p=685</guid>
		<description><![CDATA[While smart folks over at NARA are thinking about the preservation strategy for digitized 2010 census forms, I got inspired to take a look at what we have preserved from past censuses. In specific, I wanted to look at posters, photos and videos that give us a glimpse into how we encouraged and documented the [...]<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2010/03/05/encouraging-census-participation/">Encouraging Participation in the Census</a></p>
]]></description>
			<content:encoded><![CDATA[<p><a title="LOC: 1940 Census Poster" href="http://hdl.loc.gov/loc.pnp/cph.3g01801 "><img class="alignright size-full wp-image-717" title="1940-census-poster" src="http://www.spellboundblog.com/wp-content/uploads/2009/09/1940-census-poster.jpg" alt="1940-census-poster" width="300" height="426" /></a>While smart folks over at NARA are thinking about the <a title="NARAtions: NARA and the 2010 Census" href="http://blogs.archives.gov/online-public-access/?p=1192">preservation strategy for digitized 2010 census forms</a>, I got inspired to take a look at what we have preserved from past censuses. In specific, I wanted to look at posters, photos and videos that give us a glimpse into how we encouraged and documented the activity of participation in the past.</p>
<p>There is a dedicated <a title="Census History" href="http://www.census.gov/history/">Census History</a> area on the <a title="Census Website" href="http://www.census.gov/">Census website</a>, as well as a section of the 2010 website called <a title="The Big Count Archive" href="http://2010.census.gov/mediacenter/the-big-count/index.php?v,n13">The Big Count Archive</a>. While I like the wide range of <a title="2010 Posters" href="http://2010.census.gov/partners/materials/posters-materials.php">2010 Census Posters</a> &#8211; the 1940 census poster shown here (thank you Library of Congress) is just so striking.</p>
<p>I also loved the videos I found, especially when I realized that they were all available on YouTube &#8211; uploaded by a user named <a title="YouTube: JasonGCensus" href="http://www.youtube.com/user/JasonGCensus">JasonGCensus</a>. I am not clear on the relationship between JasonGCensus and the official <a title="YouTube: US Census Beaureau Channel" href="http://www.youtube.com/user/uscensusbureau">U.S. Census Bureau&#8217;s Channel</a> (which seems focused on 2010 Census content), but there are some real gems posted there.</p>
<p>For example, in the <a title="1970 Census PSA" href="http://www.youtube.com/user/JasonGCensus#p/u/19/Fb8s3iDWWxo">1970 Census PSA</a> shown below we learn about the privacy of our census data: &#8220;Our separate identities will be lost in the process which is concerned only with what we say, not who said it&#8221;. We are shown technology details &#8211; complete with old school beeping and blooping computer sounds. (NOTE: this video is also <a title="Census.gov: 1970 PSA Video" href="http://2010.census.gov/mediacenter/the-big-count/1970.php">available on Census.gov</a>, but I saw no way to embed that video here &#8211; hence my cheer at finding the same video on YouTube)</p>
<p><object classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" width="480" height="385" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0"><param name="allowFullScreen" value="true" /><param name="allowscriptaccess" value="always" /><param name="src" value="http://www.youtube.com/v/Fb8s3iDWWxo&amp;hl=en_US&amp;fs=1&amp;" /><param name="allowfullscreen" value="true" /><embed type="application/x-shockwave-flash" width="480" height="385" src="http://www.youtube.com/v/Fb8s3iDWWxo&amp;hl=en_US&amp;fs=1&amp;" allowscriptaccess="always" allowfullscreen="true"></embed></object></p>
<p>For the 1960 census, a <a title="1960 Census PSA" href="http://www.youtube.com/user/JasonGCensus#p/u/20/HHKAQF4kNK0">PSA</a> explains the new <a title="FOSDIC" href="http://www.census.gov/history/www/innovations/technology/fosdic.html">FOSDIC</a> technology which removed the need for punch-cards. With the tagline &#8216;Operation Rollcall, USA&#8217;, the ad presents our part in &#8220;this enterprise&#8221; as cooperation with the enumerators. In the <a title="1980 Census PSA" href="http://www.youtube.com/user/JasonGCensus#p/u/17/fzmf3XSq-lM">1980 PSA </a>the tag line is &#8216;Answer the Census: We&#8217;re counting on you!&#8217; and stresses that it is kept confidential and is used to provide services to communities. By the time you get to the 1990 and 2000 PSAs we see more stress on the benefits to communities that fill out the census and less stress on how the census is actually recorded.</p>
<p><a title="Women taking census" href="http://hdl.loc.gov/loc.pnp/cph.3b23345 "><img class="alignright size-full wp-image-887" title="LOC: Woman taking census of another woman at door of house" src="http://www.spellboundblog.com/wp-content/uploads/2010/03/3b23345r.jpg" alt="" width="332" height="409" /></a>I also found some lovely <a title="LOC Census Images" href="http://lcweb2.loc.gov/cgi-bin/query/f?fsaall,brum,detr,swann,look,gottscho,pan,horyd,genthe,var,cai,cd,hh,yan,lomax,ils,prok,brhc,nclc,matpc,iucpub,tgmi,lamb,hec,krb:0:./temp/~pp_SURL:">census images</a> in the <a title="LOC Prints and Photographs" href="http://www.loc.gov/rr/print/">Library of Congress Prints and Photographs</a> catalog including the image shown here and:</p>
<ul>
<li><a title="1870 Census Wood Engraving" href="http://hdl.loc.gov/loc.pnp/cph.3b39850 ">an 1870 Wood Engraving</a></li>
<li><a title="1890 Census Cartoon" href="http://hdl.loc.gov/loc.pnp/cph.3c00311 ">an 1890 Cartoon</a></li>
<li><a title="1910 Postcard" href="http://hdl.loc.gov/loc.pnp/cph.3c24436 ">a 1910 Postcard</a></li>
</ul>
<p>Exploring the area of Census.gov dedicated to the <a title="2010 Census" href="http://2010.census.gov/2010census/">2010 census</a> made me wonder what was available online for the 2000 census.</p>
<p>Wayback Machine to the rescue! They have what appears to be a fairly deep crawl of the <a title="Internet Archive: 2000 Census.gov website" href="http://web.archive.org/web/20000303195350/www.census.gov/dmd/www/2khome.htm">2000 Census.gov site</a> dating from March of 2000. For example &#8211; the <a title="2000 Census Posters" href="http://web.archive.org/web/20000306114555/www.census.gov/dmd/www/advposters.html">posters section</a> seems to include all the images and PDFs of the originals. I even found functional Quicktime videos in the <a title="2000 Census Video Zone" href="http://web.archive.org/web/20000303133738/www.census.gov/pubinfo/www/video/index.html">Video Zone</a>, like this one: <a title="Video: How America Knows What America Needs" href="http://web.archive.org/web/20000510161557/www.census.gov/pubinfo/www/video/hakwan.html">How America Knows What America Needs</a>.</p>
<p>The ten year interval makes for a nice way to get a sense of the country from the PR perspective. What did the Census Bureau think was the right way to appeal to the American public? Were we more intrigued by the latest technology or worried about our privacy? Did they need to communicate what the census is used for? Or was it okay to simply express it as an American&#8217;s duty? I appreciate the ease with which I can find and share the resources above. Great fun.</p>
<p>And for those of you in the United States, please consider this my personal encouragement to fill out your census forms!</p>
<p><em>Update:</em> The WashingtonPost has an interesting a<a title="WashingtonPost: 'Snapshot of America': These are Census Bureau ads? Go figure." href="http://www.washingtonpost.com/wp-dyn/content/article/2010/03/05/AR2010030504100.html">rticle about the &#8216;Snapshot of America&#8217; series of promotional videos</a> for the 2010 census. Definitely an interesting contrast to the videos I reviewed for this post.</p>
<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2010/03/05/encouraging-census-participation/">Encouraging Participation in the Census</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.spellboundblog.com/2010/03/05/encouraging-census-participation/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Leveraging Google Reader&#8217;s Page Change Tracking for Web Page Preservation</title>
		<link>http://www.spellboundblog.com/2010/01/26/leveraging-google-readers-page-change-tracking-for-web-page-preservation/</link>
		<comments>http://www.spellboundblog.com/2010/01/26/leveraging-google-readers-page-change-tracking-for-web-page-preservation/#comments</comments>
		<pubDate>Tue, 26 Jan 2010 06:49:21 +0000</pubDate>
		<dc:creator>Jeanne</dc:creator>
				<category><![CDATA[at risk records]]></category>
		<category><![CDATA[future-proofing]]></category>
		<category><![CDATA[internet archiving]]></category>
		<category><![CDATA[learning technology]]></category>

		<guid isPermaLink="false">http://www.spellboundblog.com/?p=826</guid>
		<description><![CDATA[The Official Google Reader Blog recently announced a new feature that will let users watch any page for updates. The way this works is that you add individual URLs to your Google Reader account. Just as with regular RSS feeds, when an update is detected &#8211; a new entry is added to that subscription. My [...]<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2010/01/26/leveraging-google-readers-page-change-tracking-for-web-page-preservation/">Leveraging Google Reader&#8217;s Page Change Tracking for Web Page Preservation</a></p>
]]></description>
			<content:encoded><![CDATA[<p style="text-align: center;"><a title="Cute RSS Icons" href="http://comingupforair.net/2008/01/rss-icons/"><img class="size-full wp-image-827  aligncenter" title="rss-icons" src="http://www.spellboundblog.com/wp-content/uploads/2010/01/rss-icons.jpg" alt="" width="418" height="373" /></a></p>
<p><a title="Cute RSS Icons" href="http://comingupforair.net/2008/01/rss-icons/"></a>The <a title="Official Google Reader Blog" href="http://googlereader.blogspot.com/">Official Google Reader Blog</a> recently announced a new feature that will let users <a title="Official Google Reader Blog: Follow Changes to Any Website" href="http://googlereader.blogspot.com/2010/01/follow-changes-to-any-website.html">watch any page for updates</a>. The way this works is that you add individual URLs to your Google Reader account. Just as with regular RSS feeds, when an update is detected &#8211; a new entry is added to that subscription.</p>
<p>My thinking is that this could be a really useful tool for archivists charged with preserving websites that change gradually over time, especially those fairly static sites that change infrequently with little or no notice of upcoming changes. If a web page was archived and then added to a dedicated Google Reader account, the archivist could scan their list of watch pages daily or weekly. Changes could then trigger the creation of a fresh snapshot of the site.</p>
<p>I will admit that there have been services out there for a while that do something similar to what Google has just rolled out. I personally have used <a title="Dapper" href="http://www.dapper.net">Dapper.net</a> to take a standard web page and generate an RSS feed based on updates to the page (sound familiar?). One Dapper.net feed that I created and follow is for the <a title="IFRC News Archive" href="http://www.ifrc.org/news/archives.asp">news archive page for the International Red Cross</a> and can be <a title="Dapper: IFRC News RSS" href="http://www.dapper.net/services/IFRC_News">found here</a>. What is funny is that now they actually have <a title="IFRC News RSS Feed" href="http://www.ifrc.org/news/rss.asp">an official RSS feed for their news</a> that includes exactly what my Dapper.net feed harvested off their news archive page &#8211; but when I built that Dapper feed there was no other way for me to watch for those news updates.</p>
<p>There are lots of different tools out there that aim to archive websites. <a title="Archive-It.org" href="http://www.archive-it.org/">Archive-It</a> is a subscription based service run by <a title="Internet Archive" href="http://www.archive.org">Internet Archive</a> that targets institutions and will archive sites on demand or on a regular schedule. Internet Archive also has an open source crawler called <a title="Heritrix" href="http://crawler.archive.org/index.html">Heritrix</a> for those who are comfortable dealing with the code. Other institutions are building their own software to tackle this too. <a title="Harvard University" href="http://www.harvard.edu/">Harvard University</a> has their own <a title="Harvard: WAX" href="http://wax.lib.harvard.edu/collections/home.do">Web Archive Collection Service (WAX)</a>. The <a title="LiWA: Living Web Archives" href="http://liwa-project.eu/index.php/about/">LiWA (Living Web Archives) Project</a> is based in Germany and aims to &#8220;extend the current state of the art and develop the next generation of Web content capture, preservation, analysis, and enrichment services to improve fidelity, coherence, and interpretability of web archives.&#8221; One could even use something as simple as <a title="PDF my URL" href="http://pdfmyurl.com/">PDFmyURL.com</a> &#8211; an online service that turns any URL into a PDF (be sure to play with the advanced options to make sure you get a wide enough snapshot). I know there are many more possibilities &#8211; these just scratch the surface.</p>
<p>What I like about my idea is that it isn&#8217;t meant to replace these services but rather work in tandem with them. The Internet Archive does an amazing job crawling and archiving many web pages &#8211; but they can&#8217;t archive everything and their crawl frequency may not match up with real world updates to a website. This approach certainly wouldn&#8217;t scale well for huge websites for which you would need to watch for changes on many pages. I am picturing this technique as being useful for small organizations or individuals who just need to make sure that a county government website makeover or a community organization&#8217;s website update doesn&#8217;t get lost in the shuffle. I like the idea of finding clever ways to leverage free services and tools to support those who want to protect a particular niche of websites from being lost.</p>
<p><em>Image Credit:</em> The <a title="RSS Icons" href="http://comingupforair.net/2008/01/rss-icons/">RSS themed image</a> above is by <a title="Matt Forsythe" href="http://comingupforair.net/about/">Matt Forsythe</a>.</p>
<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2010/01/26/leveraging-google-readers-page-change-tracking-for-web-page-preservation/">Leveraging Google Reader&#8217;s Page Change Tracking for Web Page Preservation</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.spellboundblog.com/2010/01/26/leveraging-google-readers-page-change-tracking-for-web-page-preservation/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>After The Games Are Over: Olympic Archival Records</title>
		<link>http://www.spellboundblog.com/2008/08/25/olympic-archival-records/</link>
		<comments>http://www.spellboundblog.com/2008/08/25/olympic-archival-records/#comments</comments>
		<pubDate>Mon, 25 Aug 2008 04:24:54 +0000</pubDate>
		<dc:creator>Jeanne</dc:creator>
				<category><![CDATA[appraisal]]></category>
		<category><![CDATA[at risk records]]></category>
		<category><![CDATA[electronic records]]></category>
		<category><![CDATA[historical research]]></category>
		<category><![CDATA[internet archiving]]></category>

		<guid isPermaLink="false">http://www.spellboundblog.com/2008/08/25/olympic-archival-records/</guid>
		<description><![CDATA[What does an archivist ponder after she turns off the Olympics? What happens to all the records of the Olympics after the closing ceremonies? Who decides what to keep? Not knowing any Olympic Archivists personally, I took to the web to see what I could find. Olympics.org uses the tag line &#8220;Official Website of the [...]<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2008/08/25/olympic-archival-records/">After The Games Are Over: Olympic Archival Records</a></p>
]]></description>
			<content:encoded><![CDATA[<p>What does an archivist ponder after she turns off the Olympics? What happens to all the records of the Olympics after the closing ceremonies? Who decides what to keep? Not knowing any Olympic Archivists personally, I took to the web to see what I could find.</p>
<p><a href="http://www.olympic.org" title="Olympics.org">Olympics.org</a> uses the tag line &#8220;Official Website of the Olympic Movement&#8221; and include information about <a href="http://www.olympic.org/uk/passion/studies/archives/index_uk.asp" title="The International Olympic Committee’s Historical Archives ">The International Olympic Committee’s Historical Archives</a>. The even have an <a href="http://www.olympic.org/uk/athletes/results/search_r_uk.asp" title="Olympic.org: Olympic Medals Database">Olympic Medals Database</a> with all the results from all the games.</p>
<p>The most detailed list of Olympics archives that I could find is the <a href="http://olympicstudies.uab.es/directory/main.asp" title=" Olympic Studies International Directory">Olympic Studies International Directory</a> listing of <a href="http://olympicstudies.uab.es/directory/area2_institution.asp?IDCLASSIFICATION=9" title="Archives &amp; Olympic Heritage">Archives &amp; Olympic Heritage</a> sites. It is from this page that I found my way to records from the <a href="http://www.records.nsw.gov.au/archives/sydney_olympic_park_authority_3119.asp" title="Sydney Olympic Park Authority Records">Sydney Olympic Park Authority</a>.</p>
<p>The <a href="http://www.otab.com" title="Olympic Television Archive Bureau">Olympic Television Archive Bureau</a> (OTAB) website explains that this UK based company &#8220;has over 30,000 hours of the most sensational sports footage ever seen, uniquely available in one library&#8221;  and aims to provide &#8220;prompt fulfilment of your Olympic footage requirements&#8221;.</p>
<p>Then I thought to dig into the Internet Archive. What a great treasure trove for all sorts of interesting Olympic bits!</p>
<p>First I found a <a href="http://www.archive.org/details/1964-10-26_Olympics_End" title="Internet Archive: 1964 Olympics">Universal Newsreel from the 1964 Olympics in Tokyo</a> (embedded below).<br />
<embed src="http://www.archive.org/flow/FlowPlayerLight.swf?config=%7Bembedded%3Atrue%2CshowFullScreenButton%3Atrue%2CshowMuteVolumeButton%3Atrue%2CshowMenu%3Atrue%2CautoBuffering%3Atrue%2CautoPlay%3Afalse%2CinitialScale%3A%27fit%27%2CmenuItems%3A%5Bfalse%2Cfalse%2Cfalse%2Cfalse%2Ctrue%2Ctrue%2Cfalse%5D%2CusePlayOverlay%3Afalse%2CshowPlayListButtons%3Atrue%2CplayList%3A%5B%7Burl%3A%271964%2D10%2D26%5FOlympics%5FEnd%2F1964%2D10%2D26%5FOlympics%5FEnd%2Eflv%27%7D%5D%2CcontrolBarGloss%3A%27high%27%2CshowVolumeSlider%3Atrue%2CbaseURL%3A%27http%3A%2F%2Fwww%2Earchive%2Eorg%2Fdownload%2F%27%2Cloop%3Afalse%2CcontrolBarBackgroundColor%3A%270x000000%27%7D" scale="noscale" bgcolor="111111" type="application/x-shockwave-flash" allowfullscreen="true" allowscriptaccess="always" allownetworking="all" pluginspage="http://www.macromedia.com/go/getflashplayer" width="320" height="268"></embed></p>
<p>I also found a 2002 Computer Chronicles episode <a href="http://www.archive.org/details/Olympics2002" title="Internet Archive: Computer Technology and the Olympics">Computer Technology and the Olympics</a> which explores the &#8220;high-tech innovations that ran the 2002 Winter Olympic Games&#8221; (embedded below).</p>
<p><embed src="http://www.archive.org/flow/FlowPlayerLight.swf?config=%7Bembedded%3Atrue%2CshowFullScreenButton%3Atrue%2CshowMuteVolumeButton%3Atrue%2CshowMenu%3Atrue%2CautoBuffering%3Atrue%2CautoPlay%3Afalse%2CinitialScale%3A%27fit%27%2CmenuItems%3A%5Bfalse%2Cfalse%2Cfalse%2Cfalse%2Ctrue%2Ctrue%2Cfalse%5D%2CusePlayOverlay%3Afalse%2CshowPlayListButtons%3Atrue%2CplayList%3A%5B%7Burl%3A%27Olympics2002%2FOlympics2002%2Eflv%27%7D%2C%7Burl%3A%27Olympics2002%2FOlympics2002%5Fedit%2Eflv%27%7D%5D%2CcontrolBarGloss%3A%27high%27%2CshowVolumeSlider%3Atrue%2CbaseURL%3A%27http%3A%2F%2Fwww%2Earchive%2Eorg%2Fdownload%2F%27%2Cloop%3Afalse%2CcontrolBarBackgroundColor%3A%270x000000%27%7D" scale="noscale" bgcolor="111111" type="application/x-shockwave-flash" allowfullscreen="true" allowscriptaccess="always" allownetworking="all" pluginspage="http://www.macromedia.com/go/getflashplayer" width="320" height="268"></embed></p>
<p>Other fun finds included a digitized copy of a book titled <a href="http://www.archive.org/details/olympicgamesstoc00sullrich" title="Internet Archive: Text: The Olympic Games, Stockholm, 1912">The Olympic games, Stockholm, 1912</a> and the <a href="http://web.archive.org/web/20061212201558/http://en.beijing2008.cn/" title="Wayback Machine: Beijing 2008 website">oldest snapshot of the Beijing 2008 website</a> (from December of 2006). Seeing the 2008 Summer Games pages in the archive made me curious. I found the old site of the <a href="http://www.athens2004.com/" title="official Athens summer games from 2004">official Athens summer games from 2004</a> which kindly states: &#8220;The site is no longer available, please visit http://www.olympic.org or http://en.beijing2008.com/&#8221;. The Internet Archive has a bit more than that on the <a href="http://web.archive.org/web/*/http://www.athens2004.com/" title="Wayback Machine: Athens2004.com">athens2004.com archive page</a> &#8211; though some clicking through definitely made it clear that not all of the site was crawled. Lucky for us we can still see the <a href="http://web.archive.org/web/20051105032948/www.athens2004.com/athens2004/page/ecards?lang=en&amp;cid=a828470429149f00VgnVCMServer28130b0aRCRD" title="Wayback Machine: Athens 2004 Olympics E-Cards">Athens 2004 Olympics E-Cards</a> you could send!</p>
<p>Then I turned to explore <a href="http://www.archives.gov/" title="NARA">NARA</a>&#8216;s assorted web resources. I found a few photos on the <a href="http://www.digitalvaults.org" title="Digital Vaults">Digital Vaults</a> website (search on the keyword Olympics).  A search in the <a href="http://www.archives.gov/research/arc/" title="NARA: ARC">Archival Research Catalog</a> (ARC) generates a long list &#8211; including footage of the US National Rifle Team in the 1960 Olympics in Italy.</p>
<p>My favorite items from NARA&#8217;s collections are in the <a href="http://aad.archives.gov/aad/" title="NARA: AAD">Access to Archival Databases</a> (AAD). First I found this telegram from the American Embassy in Ottawa to the Secretary of State in Washington DC (Document ID # 1975OTTAWA02204) sent in June 1975:</p>
<blockquote><p> 1. EMBASSY APPRECIATES DEPARTMENT&#8217;S EFFORTS TO ASSIST CONGEN IN CARING FOR VIPS WHO CERTAINLY WILL ARRIVE FOR 1976 OLYMPIC GAMES WITHOUT TICKETS OR LODGING. HAS DEPARTMENT EXPLORED POSSIBILITY OF OBTAINING 4,000 TICKETS ON CONSIGNMENT BASIS FROM MONTGOMERY WARD, WITH UNDERSTANDING THAT, AS TICKETS ARE SOLD, PROCEEDS WILL BE REMITTED? PERHAPS SUCH AN ARRANGEMENT COULD BE WORKED OUT WITH FURTHER UNDERSTANDING THAT UNSOLD TICKETS BE RETURNED TO MONTGOMERY WARD AT SOME SPECIFIED DATE PRIOR TO BEGINNING OF EVENTS.</p>
<p>2. EMBASSY WILL FURNISH AMOUNT REQUIRED TO RESERVE SIX DOUBLE ROOMS FOR PERIOD OF GAMES. AT PRESENT HOTEL OWNERS AND OLYMPIC OFFICIALS ARE IN DISAGREEMENT AS TO AMOUNTS THAT MAY BE CHARGED FOR ROOMS DURING OLYMPIC PERIOD. NEGOTIATIONS ARE CURRENTLY BEING CARRIED OUT AND AS SOON AS ROOM RATES HAVE BEEN ESTABLISHED, QUEEN ELIZABETH HOTEL MANAGER WILL ADVISE US OF THEIR REQUIREMENTS TO RESERVE THE SIX DOUBLE ROOMS.</p></blockquote>
<p>Immediately beneath that one, I found this telegram from October 1975 (Document Number 1975STATE258427):</p>
<blockquote><p>SUBJECT:INVITATION TO PRESIDENT FORD AND SECRETARY<br />
KISSINGER TO ATTEND OLYMPIC GAMES IN AUSTRIA,<br />
FEBRUARY 4-15, 1976</p>
<p>THE EMBASSY IS REQUESTED TO INFORM THE GOA THAT MUCH TO THE PRESIDENT&#8217;S AND THE SECRETARY&#8217;S REGRET, THE DEMANDS ON THEIR SCHEDULES DURING THAT PERIOD WILL NOT MAKE IT POSSIBLE FOR THEM TO ATTEND THE WINTER GAMES. KISSINGER</p></blockquote>
<p>There are definitely a lot of moving parts to Olympic Archival Records. So many nations participate.  New host countries with the option to handle records however they see fit. I explored this whole question two years ago and came up against the fact that control over the archival records produced by each Olympics was really in the hands of the hosting committee and their country. A quick glance down the list of  <a href="http://olympicstudies.uab.es/directory/area2_institution.asp?IDCLASSIFICATION=9" title="Archives &amp; Olympic Heritage">Archives &amp; Olympic Heritage</a> sites I mentioned above gives you an idea of all the different corners of the world in which one can find Olympic Archival Records in both government and independent repositories. Given that clearly not all Olympic Games are represented in that list, it makes me wonder what we will see on this front from China now that the closing ceremony is complete.</p>
<p>I also suspect that with each Olympic Games we increase the complexity of the electronic records being generated. Would it be worthwhile to create an online collection for each games &#8211; as has been done for the <a href="http://hurricanearchive.org/" title="Hurricane Digital Memory Bank">Hurricane Digital Memory Bank</a> or <a href="http://911digitalarchive.org/" title="The September 11 Digital Archive">The September 11 Digital Archive</a>, but extend it to include access to Olympic electronic records data sets? The shear quantity of information is likely overwhelming &#8211; but I suspect there is a lot of interesting information that people would love to examine.</p>
<p><em><strong>Update:</strong> For those of you (like me) who wondered what Montgomery Ward had to do with Olympic Tickets &#8211; take a look at <a href="http://http://vault.sportsillustrated.cnn.com/vault/article/magazine/MAG1089745/index.htm" title="SI Vault: Tickets For The '76 Olympics Go On Sale Shortly At Montgomery Ward">Tickets For The &#8217;76 Olympics Go On Sale Shortly At Montgomery Ward</a> over in the <a href="http://sportsillustrated.cnn.com/" title="Sports Illustrated">Sports Illustrated</a> online <a href="http://vault.sportsillustrated.cnn.com/" title="SI Vault">SI Vault</a>. Sports Illustrated&#8217;s Vault is definitely another interesting source of information about the Olympic Games. If my post above has made you nostalgic for Olympics gone by &#8211; definitely take a look at the current Summer Games feature on their front page. I couldn&#8217;t figure out a permanent link to this feature, but </em><em> if I ever do </em><em>I will update this post later. </em></p>
<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2008/08/25/olympic-archival-records/">After The Games Are Over: Olympic Archival Records</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.spellboundblog.com/2008/08/25/olympic-archival-records/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>New Skills for a Digital Era: Official Proceedings Now Available</title>
		<link>http://www.spellboundblog.com/2008/02/26/new-skills-for-a-digital-era-official-proceedings-now-available/</link>
		<comments>http://www.spellboundblog.com/2008/02/26/new-skills-for-a-digital-era-official-proceedings-now-available/#comments</comments>
		<pubDate>Wed, 27 Feb 2008 03:17:57 +0000</pubDate>
		<dc:creator>Jeanne</dc:creator>
				<category><![CDATA[archival community]]></category>
		<category><![CDATA[born digital records]]></category>
		<category><![CDATA[electronic records]]></category>
		<category><![CDATA[future-proofing]]></category>
		<category><![CDATA[internet archiving]]></category>
		<category><![CDATA[learning technology]]></category>

		<guid isPermaLink="false">http://www.spellboundblog.com/2008/02/26/new-skills-for-a-digital-era-official-proceedings-now-available/</guid>
		<description><![CDATA[From May 31st through June 2nd of 2006, The National Archives, the Arizona State Library and Archives, and the Society of American Archivists hosted a colloquium to consider the question &#8220;What are the practical, technical skills that all library and records professionals must have to work with e-books, electronic records, and other digital materials?&#8221;. The [...]<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2008/02/26/new-skills-for-a-digital-era-official-proceedings-now-available/">New Skills for a Digital Era: Official Proceedings Now Available</a></p>
]]></description>
			<content:encoded><![CDATA[<p><img src="http://www.spellboundblog.com/wp-content/uploads/2008/02/newskills_logo.gif" alt="New Skills for a Digital Era Logo" align="right" />From May 31st through June 2nd of 2006, <a href="http://archives.gov" title="National Archives">The National Archives</a>, the <a href="http://www.lib.az.us/" title="Arizona State Library and Archives">Arizona State Library and Archives</a>, and the <a href="http://archivists.org/" title="Society of American Archivists">Society of American Archivists</a> hosted a colloquium to consider the question &#8220;What are the practical, technical skills that all library and records professionals must have to work with e-books, electronic records, and other digital materials?&#8221;. The website for the <a href="http://rpm.lib.az.us/newskills/index.asp" title="New Skills for a Digital Era">New Skills for a Digital Era</a> colloquium already includes <a href="http://rpm.lib.az.us/newskills/CaseStudies.asp" title="New Skills for a Digital Era: Schedule and Case Study Links">links to the eleven case studies</a> considered over the course of the three days of discussion as well as a <a href="http://rpm.lib.az.us/newskills/readings.asp" title="New Skills for a Digital Era: Readings">list of additional suggested readings</a>. As mentioned over on The Ten Thousand Year Blog, <a href="http://www.davidmattison.ca/wordpress/?p=2118" title="New Skills for a Digital Era colloquium proceedings preprint">the pre-print of the proceedings has been available since August, 2007</a>.</p>
<p>As announced in SAA&#8217;s online newsletter, the <a href="http://www.archivists.org/news/NewSkillsForADigitalEra.pdf" title="Official Proceedings of the New Skills for a Digital Era Colloquium">Official Proceedings of the New Skills for a Digital Era Colloquium</a>, edited by <a href="http://rpm.lib.az.us/" title="Richard Pearce-Moses">Richard Pearce-Moses</a> and Susan E. Davis, is now available for free download. Published under <a href="http://creativecommons.org/licenses/by-nc/3.0/us/" title="Creative Commons Attribution 3.0">Creative Commons Attribution</a>, this document is 143 pages long and includes all the original case studies. I have a lot of reading to do!</p>
<p>The meat of the proceedings consists of a 32 page &#8216;Knowledge and Skills Inventory&#8217; and a page and a half of reflections &#8211; both co-authored by Richard Pearce-Moses and Susan E. Davis. The Keynote Address by <a href="http://www.si.umich.edu/people/faculty-detail.htm?sid=48" title="Margaret Hedstrom">Margaret Hedstrom</a> titled &#8216;Are We Ready for New Skills Yet?&#8217; is also included.</p>
<p>I am very pleased with how much access has been provided to these materials. These topics are clearly of interest to many beyond the 60 individuals who were able to take part in the original gathering. As an archival studies student it has often been a great source of frustration that so few of the archives related conferences publish proceedings of any kind. It is part of what has driven me to attempt to assemble exhaustive session summaries for those sessions I have personally attended at the past two SAA Annual meetings (see <a href="http://www.spellboundblog.com/category/saa2006/" title="SAA2006">SAA2006</a> and <a href="http://www.spellboundblog.com/category/saa2007/" title="SAA2007">SAA2007</a>). I think that the <a href="http://www.ibiblio.org/saa2007/index.php/Main_Page" title="Unofficial SAA2007 Conference Wiki">Unofficial Conference Wiki for SAA2007</a> was also a big step in the right direction and I hope it will continue to evolve and improve for the upcoming SAA2008 annual meeting in San Francisco.</p>
<p>The course I elected to take this term is dedicated to studying <a href="http://en.wikipedia.org/wiki/Community_of_practice" title="Wikipedia: Community of Practice">Communities of Practice</a>. This announcement about the New Skills for a Digital Era&#8217;s proceedings has me thinking about the community of practice that seems to currently be taking form across the library, archives and records management communities. I will share more thoughts on this as I sort through them myself.</p>
<p>Finally, a question for anyone reading this post who attended the colloquium: Are you still discussing the case studies with others from that session two years ago? If not, do you wish you were?</p>
<p><em>Image Credit: The image at the top of this post is from the <a href="http://rpm.lib.az.us/NewSkills/" title="New Skills for a Digital Era">New Skills for a Digital Era</a> website.</em></p>
<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2008/02/26/new-skills-for-a-digital-era-official-proceedings-now-available/">New Skills for a Digital Era: Official Proceedings Now Available</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.spellboundblog.com/2008/02/26/new-skills-for-a-digital-era-official-proceedings-now-available/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Blog Action Day: A Look At Earth Day as Archived Online</title>
		<link>http://www.spellboundblog.com/2007/10/15/blog-action-day-a-look-at-earth-day-as-archived-online/</link>
		<comments>http://www.spellboundblog.com/2007/10/15/blog-action-day-a-look-at-earth-day-as-archived-online/#comments</comments>
		<pubDate>Tue, 16 Oct 2007 02:23:13 +0000</pubDate>
		<dc:creator>Jeanne</dc:creator>
				<category><![CDATA[Blog Action Day]]></category>
		<category><![CDATA[internet archiving]]></category>
		<category><![CDATA[preservation]]></category>
		<category><![CDATA[search]]></category>

		<guid isPermaLink="false">http://www.spellboundblog.com/2007/10/15/blog-action-day-a-look-at-earth-day-as-archived-online/</guid>
		<description><![CDATA[In honor of this year&#8217;s Blog Action Day theme of discussing the environment, I decided to see what records the Internet had available about the history of Earth Day. I started by simply Googling Earth Day. In a new browser window I opened the Internet Archive&#8217;s Wayback Machine. These were to be my two main [...]<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2007/10/15/blog-action-day-a-look-at-earth-day-as-archived-online/">Blog Action Day: A Look At Earth Day as Archived Online</a></p>
]]></description>
			<content:encoded><![CDATA[<p>In honor of this year&#8217;s <a href="http://blogactionday.org/" title="Blog Action Day">Blog Action Day</a> theme of discussing the environment, I decided to see what records the Internet had available about the history of Earth Day.</p>
<p>I started by simply <a href="http://www.google.com/search?q=earth+day" title="Google: Earth Day">Googling Earth Day</a>. In a new browser window I opened the <a href="http://www.archive.org/web/web.php" title="Internet Archive: Wayback Machine">Internet Archive&#8217;s Wayback Machine</a>. These were to be my two main avenues for unearthing the way that Earth Day was represented on the internet over the years.</p>
<p>Wikipedia&#8217;s <a href="http://en.wikipedia.org/w/index.php?title=Earth_Day&amp;oldid=507540" title="Wikipedia: first version of an Earth Day page">first version of an Earth Day page</a> was created on December 16th, 2002. This is the <a href="http://en.wikipedia.org/w/index.php?title=Earth_Day&amp;oldid=162955685" title="Wikipedia: Earth Day">current Earth Day page</a> as of the creation of this post &#8211; last updated about a week ago.</p>
<p>The <a href="http://www.earthday.net/" title="Earthday Network">current home page for the Earthday Network</a> appears identical to the most recent <a href="http://web.archive.org/web/20070629112757/http://earthday.net/" title="Wikipedia: Earthday Network page June 29th 2007">version stored in the Wayback Machine</a>, dated June 29, 2007 &#8211; until you notice that the featured headline on the link to <a href="http://www.earthdaynetwork.tv">http://www.earthdaynetwork.tv</a> is different.</p>
<p>The site that claims to be &#8216;The Official Site of International Earth Day&#8217; is <a href="http://www.earthsite.org/" title="EarthSite.Org">EarthSite.org</a>. The oldest version from the Wayback Machine is from <a href="http://web.archive.org/web/19961223074057/http://www.earthsite.org/" title="Earth Site: December 1996">December of 1996</a>. This version shows a web visitor counter perpetually set to 1,671. Earth day ten years ago was scheduled for March 20th, 1997. If you scroll down a bit on the <a href="http://web.archive.org/web/19980415034928/www.earthsite.org/new.htm" title="Earth Site What's New (December 2006)">What&#8217;s New page</a> you can read the 1997 State of the World Message By John McConnell (attributed as the founder of Earth Day).</p>
<p>The U.S. Government portal for Earth Day was first <a href="http://web.archive.org/web/20030406140625/http://www.earthday.gov/" title="Wayback Machine: EarthDay.Gov (April 6, 2003)">archived in the Internet Archive on April 6, 2003</a>. The site, <a href="http://www.earthday.gov/index.htm" title="EarthDay.Gov">EarthDay.gov</a>, hasn&#8217;t changed much in the past 4 years. The EPA has an <a href="http://www.epa.gov/earthday/" title="EPA: Earth Day">Earth Day page of it&#8217;s own</a>, that was <a href="http://web.archive.org/web/19990203160413/http://www.epa.gov/earthday/" title="Wayback Machine: EPA Earth Day (February 3, 1999)">first archived in early 1999</a>. No clear way to know if that actually means that the EPA&#8217;s Earth Day page is older or if it was just found earlier by the <a href="http://www.archive.org/web/web.php" title="Wayback Machine homepage">Internet Archives ambitious web crawlers</a>.</p>
<p><a href="http://www.envirolink.org/index.html" title="Envirolink.Org">Envirolink.org</a>, with the tagline &#8220;The Online Environmental Community&#8221;, was first archived back in 1996. You can see on <a href="http://web.archive.org/web/*/http://www.envirolink.org" title="Wayback Machine page for Environlink.org">the Wayback Machine page for Environlink.org</a>, has a fairly full ten years worth of web page archiving.</p>
<p>Next I wanted to explore what the world of government records might produce on the subject. A quick stop over at <a href="http://www.footnote.com/searchdocuments.php?query=%22Earth+Day%22&amp;collection=-1&amp;submit=Search#9665925" title="Footnote.com: Earth Day">Footnote.com to search for &#8220;Earth Day&#8221;</a> didn&#8217;t yield a terribly promising list of results (no surprise there &#8211; most of their records date to before the time period we are looking for). Next I tried searching in <a href="http://arcweb.archives.gov/arc/basic_search.jsp" title="NARA: Archival Research Catalog">Archival Research Catalog</a> (ARC) over on the U.S. National Archives website. I got 15 hits &#8211; all fairly interesting looking&#8230; but none of them linked to digitized content. A search in Access to Archival Databases (AAD) system found 2 hits &#8211; one to some sort of contract between the EPA and a Fairfax Virginia company named EARTH DAY XXV from 1995 and the other a <a href="http://aad.archives.gov/aad/createpdf?rid=98221&amp;dt=1822&amp;dl=823" title="State Department telegram">State Department telegram</a> including this passage:</p>
<blockquote><p>THIS NATION IS COMMITTED TO STRIVING FOR AN ENVIRONMENT THAT NOT ONLY SUSTAINS LIFE, BUT ALSO ENRICHES THE LIVES OF PEOPLE EVERYWHERE &#8211; - HARMONIZING THE WORKS OF MAN AND NATURE. THIS COMMITMENT HAS RECENTLY BEEN REINFORCED BY MY PROCLAMATION, PURSUANT TO A JOINT RESOLUTION OF THE CONGRESS, DESIGNATING MARCH 21, 1975 AS EARTH DAY, AND ASKING THAT SPECIAL ATTENTION BE GIVEN TO EDUCATIONAL EFFORTS DIRECTED TOWARD PROTECTING AND ENHANCING OUR LIFE-GIVING ENVIRONMENT.</p></blockquote>
<p>I also thought to check the Government Printing Office&#8217;s (GPO) website for the <a href="http://www.gpoaccess.gov/pubpapers/search.html" title="GPO: Public Papers of the Presidents of the United States">Public Papers of the Presidents of the United States</a>. Currently it only permits searching back through 1991 online &#8211; but my <a href="http://frwebgate.access.gpo.gov/cgi-bin/multidb.cgi?WAIStemplate=multidb_results.html&amp;WAISqueryRule=%24WAISqueryString&amp;WAISdbName=2003_public_papers+2003+Public+Papers+of+the+Presidents+of+the+United+States%2C+Books+I+and+II&amp;WAISdbName=2002_public_papers+2002+Public+Papers+of+the+Presidents+of+the+United+States%2C+Book+I+and+II&amp;WAISdbName=2001_public_papers+2001+Public+Papers+of+the+Presidents+of+the+United+States%2C+Books+I+and+II&amp;WAISdbName=2000_public_papers+2000-2001+Public+Papers+of+the+Presidents+of+the+United+States%2C+Books+I%2C+II+and+III&amp;WAISdbName=1999_public_papers+1999+Public+Papers+of+the+Presidents+of+the+United+States%2C+Books+I+and+II&amp;WAISdbName=1998_public_papers+1998+Public+Papers+of+the+Presidents+of+the+United+States%2C+Books+I+and+II&amp;WAISdbName=1997_public_papers+1997+Public+Papers+of+the+Presidents+of+the+United+States%2C+Books+I+and+II&amp;WAISdbName=1996_public_papers+1996+Public+Papers+of+the+Presidents+of+the+United+States%2C+Books+I+and+II&amp;WAISdbName=1995_public_papers+1995+Public+Papers+of+the+Presidents+of+the+United+States%2C+Books+I+and+II&amp;WAISdbName=1994_public_papers+1994+Public+Papers+of+the+Presidents+of+the+United+States%2C+Books+I+and+II&amp;WAISdbName=1993_public_papers+1993+Public+Papers+of+the+Presidents+of+the+United+States%2C+Books+I+and+II&amp;WAISdbName=1992_public_papers+1992+Public+Papers+of+the+Presidents+of+the+United+States%2C+Books+I+and+II&amp;WAISdbName=1991_public_papers+1991+Public+Papers+of+the+Presidents+of+the+United+States%2C+Book+I&amp;WAISqueryString=%22Earth+Day%22&amp;Submit.=Submit&amp;WAISmaxHits=50&amp;WrapperTemplate=pubpapers_wrapper.html" title="Presidential Papers: Earth Day Search">search for &#8220;Earth Day&#8221;</a> did bring back 50 speeches, proclamations and other writings by the various presidents.</p>
<p>Frustrated by the total scattering of documents without any big picture, I headed back to Google &#8211; this time to <a href="http://www.google.com/archivesearch?q=%22Earth+Day%22&amp;scoring=t&amp;sa=N&amp;sugg=d&amp;as_hdate=1989&amp;lnav=dt" title="Google News Archives: Earth Day">search the Google News Archive for articles including &#8220;Earth Day&#8221; published before 1990</a>. The timeline display showed me articles mostly from TIME, the Washington Post and the New York Times &#8211; some of which claimed I would need to pay in order to read.</p>
<p>Back again to do one more regular Google search &#8211; this time for <a href="http://www.google.com/search?q=earth+day+archive" title="Google: earth day archive">earth day archive</a>. This yielded an assortment of hits &#8211; and just <a href="http://en.wikipedia.org/wiki/Above_the_fold" title="Wikipedia: Above the Fold">above the fold</a> I found my favorite snapshot of Earth Day history. <a href="http://www.time.com/time/archive/collections/0,21428,c_earth_day,00.shtml" title="TIME Collection: Earth Day">The TIME Earth Day Archive Collection</a> is a selection of the best covers, quotes and articles about Earth Day &#8211; from February 2, 1970 to the present. This is the gold mine for getting perspective on Earth Day as it has been perceived and celebrated in the United States. The covers are brilliant! If I had started this post early enough, I would have <a href="http://www.time.com/time/reprints/covers/" title="TIME Cover Usage request information">requested permission</a> to include some here.</p>
<p>With the passionate title <a href="http://www.time.com/time/magazine/article/0,9171,878179,00.html?internalid=ACA" title="TIME: Fighting to Save the Earth from Man">Fighting to Save the Earth from Man</a>, the first article in the TIME Earth Day Collection begins by quoting then <a href="http://www.usa-presidents.info/union/nixon-1.html" title="Richard Nixon First State of the Union Address">President Nixon&#8217;s first State of the Union Address</a>:</p>
<blockquote><p>The great question of the seventies is, shall we surrender to our surroundings, or shall we make our peace with nature and begin to make reparations for the damage we have done to our air, to our land, and to our water?</p></blockquote>
<p>Fast forward to the recent awarding of the Nobel Peace Prize for 2007 to the <a href="http://nobelprize.org/nobel_prizes/peace/laureates/2007/index.html" title="Nobel Peace Prize 2007">Intergovernmental Panel on Climate Change (IPCC) and Al Gore</a> and I have to image that the answer to that question of if we were ready to make peace with nature asked so long ago was &#8216;Not Yet&#8217;.</p>
<p>Overall, this was an interesting experiment. The hunt for &#8216;old&#8217; (such as it is in the fast moving world of the Internet) data about a topic online is a strange and frustrating experience. Even with the Wayback Machine, I often found myself with only part of the picture. Often the pages I tried to view were missing images or other key elements. Sometimes I found a link to something tantalizing, only to realize that the target page was not archived (or is so broken as to be of no use). The search through government records and old newspaper stories did produce some interesting results &#8211; but again seemed to fail to produce any sense of the big picture of Earth Day over the years.</p>
<p>The TIME Collection about Earth Day was assembled by humans and arranged nicely for examination by those interested in the subject. It is properly named a &#8216;collection&#8217; (in the archival sense) because it is not the pure output of activities surrounding Earth Day, but rather a selected snapshot of related articles and images that share a common topic. That said, it is my fervent hope that websites such as these appear more and more. I suspect that the lure of attracting more readers to their websites with existing content will only encourage more content creators with a long history to join in the fun. If other do it as well as TIME has seemed to in this case, it will be a win/win situation for everyone.</p>
<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2007/10/15/blog-action-day-a-look-at-earth-day-as-archived-online/">Blog Action Day: A Look At Earth Day as Archived Online</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.spellboundblog.com/2007/10/15/blog-action-day-a-look-at-earth-day-as-archived-online/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Thoughts on Digital Preservation, Validation and Community</title>
		<link>http://www.spellboundblog.com/2007/07/06/thoughts-on-digital-preservation-validation-and-community/</link>
		<comments>http://www.spellboundblog.com/2007/07/06/thoughts-on-digital-preservation-validation-and-community/#comments</comments>
		<pubDate>Fri, 06 Jul 2007 04:21:19 +0000</pubDate>
		<dc:creator>Jeanne</dc:creator>
				<category><![CDATA[archival community]]></category>
		<category><![CDATA[born digital records]]></category>
		<category><![CDATA[digitization]]></category>
		<category><![CDATA[electronic records]]></category>
		<category><![CDATA[future-proofing]]></category>
		<category><![CDATA[internet archiving]]></category>
		<category><![CDATA[preservation]]></category>
		<category><![CDATA[software]]></category>

		<guid isPermaLink="false">http://www.spellboundblog.com/2007/07/06/thoughts-on-digital-preservation-validation-and-community/</guid>
		<description><![CDATA[The preservation of digital records is on the mind of the average person more with each passing day. Consider the video below from the recent BBC article Warning of data ticking time bomb. Microsoft UK Managing Director Gordon Frazer running Windows 3.1 on a Vista PC (Watch video in the BBC News Player) The video [...]<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2007/07/06/thoughts-on-digital-preservation-validation-and-community/">Thoughts on Digital Preservation, Validation and Community</a></p>
]]></description>
			<content:encoded><![CDATA[<p>The preservation of digital records is on the mind of the average person more with each passing day. Consider the video below from the recent BBC article <a href="http://news.bbc.co.uk/2/hi/technology/6265976.stm" title="BBC: Warning of data ticking time bomb">Warning of data ticking time bomb</a>.</p>
<p><embed src='http://news.bbc.co.uk/flash/app/embedded_video_trial/r_0_3/player.swf' flashVars='config=/nol/shared/bsp/hi/embedded_video_trial/frazer_vista/config/video.xml&#038;e=1' scale='noscale' quality='high' width='400' height='246' name='player' allowScriptAccess='always' type='application/x-shockwave-flash' pluginspage='http://www.macromedia.com/go/getflashplayer'></embed><br />
Microsoft UK Managing Director Gordon Frazer running Windows 3.1 on a Vista PC<br />
(<em><a href="http://www.bbc.co.uk/mediaselector/check/player/nol/newsid_6260000/newsid_6266800?redirect=6266892.stm&amp;news=1&amp;nbram=1&amp;bbwm=1&amp;nbwm=1&amp;bbram=1" title="BBC Video: Warning of data ticking time bomb" target="_blank">Watch video in the BBC News Player</a></em>)</p>
<p>The video discusses Microsoft&#8217;s <a href="http://www.microsoft.com/windows/products/winfamily/virtualpc/overview.mspx?wt_svl=20323a&amp;mg_id=20323b" title="Microsoft Virtual PC">Virtual PC</a> program that permits you to run multiple operating systems via a Virtual Console. This is an example of the emulation approach to ensuring access to old digital objects &#8211; and it seems to be done in a way that the average user can get their head around. Since a big part of digital preservation is ensuring you can do something beyond reading the 1s and 0s &#8211; it is promising step.  It also pleased me that they specifically mention the UK National Archives and how important it is to them that they can view documents as they originally appeared &#8211; not &#8216;converted&#8217; in any way.</p>
<p>Dorthea Salo of <a href="http://cavlec.yarinareth.net/" title="Caveat Lector">Caveat Lector</a> recently posted <a href="http://cavlec.yarinareth.net/archives/2007/06/28/hello-is-it-me-youre-looking-for/" title="Caveat Lector: Hello? Is it me you’re looking for?">Hello? Is it me you’re looking for?</a>. She has  a lot to say about <a href="http://en.wikipedia.org/wiki/Digital_curation" title="Wikipedia: Digital Curation">digital curation</a> , IR (which I took to stand for Information Repositories rather than Information Retrieval) and librarianship. Coming, as I do, from the software development and database corners of the world I was pleased to find someone else who sees a gap between the standard assumed roles of librarians and archivists and the reality of how well suited librarians&#8217; and archivists&#8217; skills are to &#8220;long-term preservation of information for use&#8221; &#8211; be it digital or analog.</p>
<p>I skimmed through the 65 page <a href="http://www.jisc.ac.uk/" title="JISC">Joint Information Systems Committee</a> (JISC) report Dorthea mentioned (<a href="http://www.jisc.ac.uk/media/documents/programmes/digital_repositories/dealing_with_data_report-final.pdf" title="JISC: Dealing with data: Roles, rights, responsibilities and relationships">Dealing with data: Roles, rights, responsibilities and relationships</a>). A search on the term &#8216;archives&#8217; took me to this passage on page 22:</p>
<blockquote><p>There is a view that so-called “dark archives” (archives that are either completely inaccessible to users or have very limited user access), are not ideal because <em><strong>if data are corrupted over time, this is not realised until point of use</strong></em>. <em>(emphasis added)</em></p></blockquote>
<p>For those acquainted with software development, the term <a href="http://en.wikipedia.org/wiki/Regression_testing" title="Wikipedia: Regression Testing">regression testing</a> should be familiar. It involves the creation of automated suites of test programs that ensure that as new features are added to software, the features you believe are complete keep on working. This was the first idea that came to my mind when reading the passage above. How do you do regression testing on a dark archive? And thinking about regression testing, digital preservation and dark archives fueled a fresh curiosity about what existing projects are doing to automate the validation of digital preservation.</p>
<p>A bit of Googling found me the UK National Archives requirements document for <a href="http://www.nationalarchives.gov.uk/electronicrecords/seamless_flow/preservation.htm" title="UK National Archives: Seamless Flow Preservation and Maintenance">The Seamless Flow Preservation and Maintenance Project</a>. They list regression testing as a &#8216;desirable&#8217; requirement in the <a href="http://www.nationalarchives.gov.uk/documents/preservation-maintenance-requirements-cat.pdf" title="Statement of Requirements for Preservation and Maintenance Project Digital Object Store">Statement of Requirements for Preservation and Maintenance Project Digital Object Store</a>  (defined as &#8220;those that should be included, but possibly as part of a later phase of development&#8221;). Of course it is very hard to tell if this regression testing is for the software tools they are building or for access to the data itself. I would bet the former.</p>
<p>Next I found my way to the website for <a href="http://www.lockss.org/lockss/Home" title="LOCKSS">LOCKSS</a> (Lots of Copies Keep Stuff Safe). While their goals relate to the preservation of electronically published scholarly assets&#8217; on the web, their approach to ensuring the validity of their data over time should be interesting to anyone thinking about long term digital preservation.</p>
<p>In the paper <a href="http://www.eecs.harvard.edu/~mema/publications/SOSP2003.pdf" title="LOCKSS: Preserving Peer Replicas By Rate­Limited Sampled Voting">Preserving Peer Replicas By Rate­Limited Sampled Voting</a> they share details of how they manage validation and repair of the data they store in their peer-to-peer architecture. I was bemused by the categories and subject descriptors assigned to the paper itself: <em>H.3.7 [Information Storage and Retrieval]: Digital Libraries; D.4.5 [Operating Systems]: Reliability</em> . Nothing about preservation or archives.</p>
<p>It is also interesting to note that you can view most of the original presentation at the <a href="http://www.cs.rochester.edu/meetings/sosp2003/home.shtml" title="SOSP 2003">19th ACM Symposium on Operating Systems Principles</a> (SOSP 2003) from a video archive of webcasts of the conference. The presentation of the LOCKSS paper begins about halfway through the <a href="http://ramp.ucsd.edu/voelker/sosp03/Day1.2.mov" title="SOSP 2003: Day 1 Session 2: File and Storage Systems">2nd video</a> on the <a href="http://www.cs.rochester.edu/meetings/sosp2003/videoarchive.shtml" title="SOSP 2003: Video Archive">video archive page</a> .</p>
<p>The start of the section on design principles explains:</p>
<blockquote><p>Digital preservation systems have some unusual features. First, such systems must be very cheap to build and maintain, which precludes high-performance hardware such as RAID, or complicated administration. Second, they need not operate quickly. Their purpose is to prevent rather than expedite change to data. Third, they must function properly for decades, without central control and despite possible interference from attackers or catastrophic failures of storage media such as fire or theft.</p></blockquote>
<p>Later they declare the core of their approach as &#8220;..replicate all persistent storage across peers, audit replicas regularly and repair any damage they find.&#8221; The paper itself has lots of details about HOW they do this &#8211; but for the purpose of this post I was more interested in their general philosophy on how to maintain the information in their care.</p>
<p><a href="http://daitss.fcla.edu/" title="DAITSS">DAITSS</a> (Dark Archive in the Sunshine State) was built by the <a href="http://www.fcla.edu/" title="Florida Center for Library Automation">Florida Center for Library Automation</a> (FCLA) to support their own needs when creating the <a href="http://www.fcla.edu/digitalArchive/" title="Florida Digital Archive">Florida Center for Library Automation Digital Archive</a> (Florida Digital Archive or FDA). In mid May of 2007, FCLA <a href="http://blogs.fcla.edu/index.php/digitalarchive/2007/05/14/daitss_open_source_release_announced" title="FCLA announced the release of DAITSS as open source software">announced the release of DAITSS as open source software</a> under the GPL license.</p>
<p>In the document <a href="http://www.fcla.edu/digitalArchive/pdfs/IJDL_article.pdf" title="The Florida Digital Archive and DAITSS: A Working Preservation Repository Based on Format Migration">The Florida Digital Archive and DAITSS: A Working Preservation Repository Based on Format Migration</a> I found:</p>
<blockquote><p>&#8230; the [Florida Digital Archive] is configured to write three copies of each file in the [Archival Information Package] to tape. Two copies are written locally to a robotic tape unit, and one copy is written in real time over the Internet to a similar tape unit in Tallahassee, about 130 miles away. The software is written in such a way that all three writes must complete before processing can continue.</p></blockquote>
<p>Similar to LOCKSS, DAITSS relies on what they term &#8216;multiple masters&#8217;. There is no concept of a single master. Since all three are written virtually simultaneously they are all equal in authority. I think it is very interesting that they rely on writing to tapes. There was a mention that it is cheaper &#8211; yet due to many issues they might still switch to hard drives.</p>
<p>With regard to formats and ensuring accessibility, the same document quoted above states on page 2:</p>
<blockquote><p>Since most content was expected to be documentary (image, text, audio and video) as opposed to executable (software, games, learning modules), FCLA decided to implement preservation strategies based on reformatting rather than emulation&#8230;.Full preservation treatment is available for twelve different file formats: AIFF, AVI, JPEG, JP2, JPX, PDF, plain text, QuickTime, TIFF, WAVE, XML and XML DTD.</p></blockquote>
<p>The design of DAITSS was based on the <a href="http://www.rlg.org/en/page.php?Page_ID=3201" title="RLG: More information about OAIS">Reference Model for an Open Archival Information System</a> (OAIS). I love this paragraph from page 10 of <a href="http://public.ccsds.org/publications/archive/650x0b1.pdf" title="OAIS Formal Specifications">the formal specifications for OAIS</a> adopted as ISO 14721:2002.</p>
<blockquote><p>The information being maintained has been deemed to need Long Term Preservation, even if the OAIS itself is not permanent. Long Term is long enough to be concerned with the impacts of changing technologies, including support for new media and data formats, or with a changing user community. <em><strong>Long Term may extend indefinitely.</strong></em> <em>(emphasis added)</em></p></blockquote>
<p>Another project implementing the OAIS reference model is <a href="http://www.casparpreserves.eu/" title="CASPAR - Cultural, Artistic and Scientific knowledge for Preservation, Access and Retrieval">CASPAR &#8211; Cultural, Artistic and Scientific knowledge for Preservation, Access and Retrieval</a>. This project appears much greater in scale than DAITSS. It started a bit more than 1 year ago (April 1, 2006) with a projected duration of 42 months, 17 partners and a projected budget of 16 million Euros (roughly 22 million US Dollars at the time of writing). Their <a href="http://www.casparpreserves.eu/publications" title="CASPER: Publications">publications section</a> looks like it could sidetrack me for weeks! On page 25 of the <a href="http://www.casparpreserves.eu/Members/metaware/ReferenceDocuments/caspar-description-of-work/at_download/file" title="CASPAR Description of Work">CASPAR Description of Work</a>, in a section labeled Validation, a distinction is made between &#8220;here and now validation&#8221; and &#8220;the more fundamental validation techniques on behalf of the &#8216;not yet born&#8217;&#8221;. What eloquent turns of phrase!</p>
<p>Page 7 found me another great tidbit in a list of digital preservation metrics that are expected:</p>
<blockquote><p>2) Provide a practical demonstration by means of what may be regarded as “accelerated lifetime” tests. These should involve demonstrating the ability of the Framework and digital information to survive:<br />
<em>a. environment (including software, hardware) changes:</em> Demonstration to the External Review Committee of usability of a variety of digitally encoded information despite changes in hardware and software of user systems, and such processes as format migration for, for example, digital science data, documents and music<br />
<em>b. changes in the Designated Communities and their Knowledge Bases:</em> Demonstration to the External Review Committee of usability of a variety of digitally encoded information by users of different disciplines</p></blockquote>
<p>Here we have thought not only about the technicalities of how users may access the objects in the future, but consideration of users who might not have the frame of reference or understanding of the original community responsible for creating the object. I haven&#8217;t seen any explicit discussion of this notion before &#8211; at least not beyond the basic idea of needing good documentation and contextual background to support understanding of data sets in the future. I love the phrase &#8216;accelerated lifetime&#8217; but I wonder how good a job we can do at creating tests for technology that does not yet exist (consider the <a href="http://www.yorktownhistory.org/homepages/1900_predictions.htm" title="The Ladies Home Journal  of December 1900">Ladies Home Journal predictions for the year 2000 published in 1900</a>).</p>
<p>What I love about LOCKSS, DAITSS and CASPAR (and no, it isn&#8217;t their fabulous acronyms) is the very diverse groups of enthusiastic people trying to do the right thing. I see many technical and research oriented organizations listed as members of the <a href="http://www.casparpreserves.eu/caspar-project/the-caspar-consortium" title="CASPAR Consortium">CASPAR Consortium</a> &#8211; but I also see the <a href="http://www.uniurb.it/it/portale/index.php?mist_id=0&amp;lang=IT&amp;tipo=IST&amp;page=0" title="Università degli studi di Urbino">Università degli studi di Urbino</a> (noted as &#8220;created in 1998 to co-ordinate all the research and educational activities within the University of Urbino in the area of archival and library heritage, with specific reference to the creation, access, and preservation of the documentary heritage&#8221;) and the <a href="http://www.hatii.arts.gla.ac.uk/" title="Humanities Advanced Technology and Information Institute, University of Glasgow">Humanities Advanced Technology and Information Institute, University of Glasgow</a> (noted as having &#8220;developed a cutting edge research programme in humanities computing, digitisation, digital curation and preservation, and archives and records management&#8221;). LOCKSS and DAITSS have both evolved in library settings.</p>
<p>Questions relating to digital archives, preservation and validation are hard ones. New problems and new tools (like Microsoft&#8217;s Virtual PC shown in the video above) are appearing all the time. Developing best practices to support real world solutions will require the <strong><em>combined</em></strong> attention of those with the skills of librarians, archivists, technologists, subject matter specialists and others whose help we haven&#8217;t yet realized we need. The challenge will be to find those who have experience in multiple areas and pull them into the mix. Rather than assuming that one group or another is the best choice to solve digital preservation problems, we need to remember there are scores of problems &#8211; most of which we haven&#8217;t even confronted yet.  I vote for cross pollination of knowledge and ideas rather than territorialism. I vote for doing your best to solve the problems you find in your corner of the world. There are more than enough hard questions to answer to keep everyone who has the slightest inclination to work on these issues busy for years. I would hate to think that any of those who want to contribute might have to spend energy to convince people that they have the &#8216;right&#8217; skills. Worse still &#8211; many who have unique viewpoints might not be asked to share their perspectives because of general assumptions about the &#8216;kind&#8217; of people needed to solve these problems. Projects like CASPAR give me hope that there are more examples of great teamwork than there are of people being left out of the action.</p>
<p>There is so much more to read, process and understand. Know of a digital preservation project with a unique approach to validation that I missed? Please <a href="http://www.spellboundblog.com/contact/">contact me</a> or post a comment below.</p>
<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2007/07/06/thoughts-on-digital-preservation-validation-and-community/">Thoughts on Digital Preservation, Validation and Community</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.spellboundblog.com/2007/07/06/thoughts-on-digital-preservation-validation-and-community/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
<enclosure url="http://ramp.ucsd.edu/voelker/sosp03/Day1.2.mov" length="128824262" type="video/quicktime" />
		</item>
		<item>
		<title>Digital Archiving Articles &#8211; netConnect Spring 2007</title>
		<link>http://www.spellboundblog.com/2007/04/19/digital-archiving-articles-netconnect-spring-2007/</link>
		<comments>http://www.spellboundblog.com/2007/04/19/digital-archiving-articles-netconnect-spring-2007/#comments</comments>
		<pubDate>Thu, 19 Apr 2007 06:04:03 +0000</pubDate>
		<dc:creator>Jeanne</dc:creator>
				<category><![CDATA[born digital records]]></category>
		<category><![CDATA[electronic records]]></category>
		<category><![CDATA[funding]]></category>
		<category><![CDATA[internet archiving]]></category>

		<guid isPermaLink="false">http://www.spellboundblog.com/2007/04/19/digital-archiving-articles-netconnect-spring-2007/</guid>
		<description><![CDATA[Thanks to Jessamyn West&#8217;s blog post, I found my way to a series of articles in the Spring 2007 edition of netConnect: Funding the Past &#8211; and Future by Fancine Fialkoff Saving Digital History by Jessamyn West LC Needs Digital Support by Norman Oder &#8220;Saving Digital History&#8221; is the longest of the three and is [...]<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2007/04/19/digital-archiving-articles-netconnect-spring-2007/">Digital Archiving Articles &#8211; netConnect Spring 2007</a></p>
]]></description>
			<content:encoded><![CDATA[<p>Thanks to Jessamyn West&#8217;s <a href="http://www.librarian.net/stax/2021/saving-digital-history-me-in-library-journal/" title="Librarian.net">blog post</a>, I found my way to a series of articles in the Spring 2007 edition of <a href="http://www.libraryjournal.com/index.asp?layout=netConnect" title="netConnect">netConnect</a>:</p>
<ul>
<li><a href="http://www.libraryjournal.com/article/CA6431790.html" title="Library Journal: Funding the Past - and Future">Funding the Past &#8211; and Future</a> by Fancine Fialkoff</li>
<li><a href="http://www.libraryjournal.com/article/CA6430408.html" title="Library Journal: Saving Digital History">Saving Digital History</a> by <a href="http://librarian.net/" title="Librarian.net">Jessamyn West</a></li>
<li><a href="http://www.libraryjournal.com/article/CA6431790.html" title="Library Journal: LC Needs Digital Support">LC Needs Digital Support</a> by Norman Oder</li>
</ul>
<p>&#8220;Saving Digital History&#8221; is the longest of the three and is a nice survey of many of the issues found at the interseciton of archiving, born digital records and the wild world of the web. I especially love the extensive Link List at the end of the articles &#8212; there are lots of interesting related resources. This is the sort of list of links I wish were available with ALL articles online!</p>
<p>I can see the evolution of some of the ideas she and her co-speakers touched on in their session at SAA 2006: <a href="http://www.archivists.org/conference/dc2006/dc2006prog-Session.asp?event=1775" title="SAA 2006: Everyone’s Doing It: What Blogs Mean for Archivists in the 21st Century">Everyone’s Doing It: What Blogs Mean for Archivists in the 21st Century</a>. I hope we continue to see more of these sorts of panels and articles. There is a lot to think about related to these issues &#8211; and there are no easy answers to the many hard questions.</p>
<p><strong><em>Update:</em></strong> Here is a link to Jessamyn&#8217;s presentation from the SAA session mentioned above: <a href="http://www.librarian.net/talks/saa/" title="Capturing Collaborative Information News, Blogs, Librarians, and You">Capturing Collaborative Information News, Blogs, Librarians, and You</a>.</p>
<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2007/04/19/digital-archiving-articles-netconnect-spring-2007/">Digital Archiving Articles &#8211; netConnect Spring 2007</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.spellboundblog.com/2007/04/19/digital-archiving-articles-netconnect-spring-2007/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Google, Privacy, Records Managment and Archives</title>
		<link>http://www.spellboundblog.com/2007/03/16/google-privacy-records-managment-and-archives/</link>
		<comments>http://www.spellboundblog.com/2007/03/16/google-privacy-records-managment-and-archives/#comments</comments>
		<pubDate>Fri, 16 Mar 2007 21:21:04 +0000</pubDate>
		<dc:creator>Jeanne</dc:creator>
				<category><![CDATA[access]]></category>
		<category><![CDATA[born digital records]]></category>
		<category><![CDATA[electronic records]]></category>
		<category><![CDATA[historical research]]></category>
		<category><![CDATA[internet archiving]]></category>
		<category><![CDATA[preservation]]></category>
		<category><![CDATA[privacy]]></category>
		<category><![CDATA[search]]></category>

		<guid isPermaLink="false">http://www.spellboundblog.com/2007/03/16/google-privacy-records-managment-and-archives/</guid>
		<description><![CDATA[BoingBoing.net posted on March 14 and March 15 about Google&#8217;s announcement of a plan to change their log retention policy . Their new plan is to strip parts of IP data from records in order to protect privacy. Read more in the AP article covering the announcement. For those who are not familiar with them [...]<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2007/03/16/google-privacy-records-managment-and-archives/">Google, Privacy, Records Managment and Archives</a></p>
]]></description>
			<content:encoded><![CDATA[<p><a href="http://boingboing.net/" title="BoingBoing">BoingBoing.net</a> posted on <a href="http://www.boingboing.net/2007/03/14/google_will_purge_so.html" title="Google will purge some user data to better protect privacy">March 14</a> and <a href="http://www.boingboing.net/2007/03/15/npr_xeni_tech_google.html" title="NPR Xeni Tech on Google">March 15</a> about <a href="http://googleblog.blogspot.com/2007/03/taking-steps-to-further-improve-our.html" title="Google Announcment about privacy">Google&#8217;s announcement</a> of a plan to <a href="http://216.239.57.110/blog_resources/google_log_retention_policy_faq.pdf" title="Log Retention Policy FAQ">change their log retention policy</a> . Their new plan is to strip parts of IP data from records in order to protect privacy. Read more in the <a href="http://news.yahoo.com/s/ap/20070315/ap_on_hi_te/google_privacy" title="Google Privacy (AP News)">AP article</a> covering the announcement.</p>
<p>For those who are not familiar with them &#8211; IP addresses are made up of sets of numbers and look something like 192.39.288.3. To see how good a job they can do figuring out the location you are in right now &#8211; go to <a href="http://www.ip-adress.com/" title="IP Address">IP Address</a> or <a href="http://www.ipaddressguide.com/">IP Address Guide</a> (click on &#8216;Find City&#8217;).</p>
<p>Google currently keeps IP addresses and their corresponding search requests in their log files (more on this in the <a href="http://www.google.com/privacy_faq.html#personalinfo" title="Google Privacy Policy: Personal Info">personal info section</a> of their <a href="http://www.google.com/privacypolicy.html" title="Google Privacy Policy">Privacy Policy</a>). Their new plan is that after 18-24 months they will permanently erase part of the IP address, so that the address no longer can point to a single computer &#8211; rather it would point to a set of 256 computers (according to the AP article linked above).</p>
<p>Their choice to permanently redact these records after a set amount of time is interesting. They don&#8217;t want to get rid of the records &#8211; just remove the IP addresses to reduce the chance that those records could be traced back to specific individuals. This policy will be retroactive &#8211; so all log records more than 18-24 months old will be modified.</p>
<p>I am not going to talk about how good an idea this is.. or if it doesn&#8217;t go far enough (plenty of others are doing that, see articles at <a href="http://www.eff.org/deeplinks/archives/005162.php" title="EFF on Google Plan">EFF</a> and <a href="http://blog.wired.com/27bstroke6/2007/03/google_to_anony.html" title="Ryan Singel (WIRED)">Wired: 27B Stroke 6</a> ). I want to explore the impact of choices like these on the records we will have the opportunity to preserve in archives in the future.</p>
<p>With my &#8216;archives&#8217; hat on &#8211; the bigger question here is how much the information that Google captures in the process of doing their business could be worth to the historians of the future. I wonder if we will one day regret the fact that the only way to protect the privacy of those who have done Google searches is to erase part of the electronic trail. One of the archivist tenants is to never do anything to the record you cannot undo.  In order for Google to succeed at their goal (making the records useless to government investigators) &#8211; it will HAVE to be done such that it cannot be undone.</p>
<p>In my information visualization course yesterday, our professor spoke about how great maps are at tying information down. We understand maps and they make a fabulous stable framework upon which we can organize large volumes of information. It sounds like the new modified log records would still permit a general connection to the physical geographic world &#8211; so that is a good thing. I do wonder if the &#8216;edited&#8217; versions of the log records will still permit the grouping of search requests such that they can be identified as having been performed by the same person (or at least from the same computer)? Without the context of other searches by the same person/computer, would this data still be useful to a historian? Would being able to examine the searches of a &#8216;community&#8217; of 256 computers be useful (if that is what the IP updates mean).</p>
<p>What if Google could lock up the unmodified version of those stats in a box for 100 years (and we could still read the media it is recorded on and we had documentation telling us what the values meant and we had software that could read the records)? What could a researcher discover about the interests of those of us who used Google in 2007? Would we loose a lot by if we didn&#8217;t know what each individual user searched for? Would it be enough to know <strong>what</strong> a gillion groups of 256 people/computers from around the world were searching for &#8211; or would loosing that tie to an individual turn the data into noise?</p>
<p>Privacy has been such a major issue with the records of many businesses in the past. Health records and school records spring to mind. I also find myself thinking of Arthur Anderson who would not have gotten into trouble for shredding their records if they had done so according to their own records disposition schedules and policies. Googling <a href="http://www.google.com/search?hl=en&amp;q=electronic+document+retention+policy&amp;btnG=Search" title="Google: Electronic Document Retention Policy">Electronic Document Retention Policy</a> got me over a million hits. Lots of people (lawyers in particular) have posted articles all over the web talking about the importance of a well implemented Electronic Document Retention Policy. I was intrigued by the final line of a USAToday article from January 2006 about <a href="http://www.usatoday.com/tech/news/internetprivacy/2006-01-19-google-search-records_x.htm" title="Google Battles Government over Porn Investigation">Google and their battle with the government over a pornography investigation</a>:</p>
<blockquote><p>Google has no stated guidelines on how long it keeps data, leading critics to warn that retention could be for years because of inexpensive data-storage costs.</p></blockquote>
<p>That isn&#8217;t true any longer.</p>
<p>For me, this choice by Google has illuminated a previously hidden perfect storm. That the US government often request of this sort of log data is clear, though Google will not say how often. The intersection of concerns about privacy, government investigations, document retention and tremendous volumes of private sector business data seem destined to cause more major choices such as the one Google has just announced. I just wonder what the researchers of the future will think of what we leave in our wake.</p>
<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2007/03/16/google-privacy-records-managment-and-archives/">Google, Privacy, Records Managment and Archives</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.spellboundblog.com/2007/03/16/google-privacy-records-managment-and-archives/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>The Archives and Archivists Listserv: hoping for a stay of execution</title>
		<link>http://www.spellboundblog.com/2007/03/14/the-archives-and-archivists-listserv-hoping-for-a-stay-of-execution/</link>
		<comments>http://www.spellboundblog.com/2007/03/14/the-archives-and-archivists-listserv-hoping-for-a-stay-of-execution/#comments</comments>
		<pubDate>Wed, 14 Mar 2007 14:34:58 +0000</pubDate>
		<dc:creator>Jeanne</dc:creator>
				<category><![CDATA[access]]></category>
		<category><![CDATA[born digital records]]></category>
		<category><![CDATA[electronic records]]></category>
		<category><![CDATA[future-proofing]]></category>
		<category><![CDATA[historical research]]></category>
		<category><![CDATA[internet archiving]]></category>
		<category><![CDATA[preservation]]></category>

		<guid isPermaLink="false">http://www.spellboundblog.com/2007/03/14/the-archives-and-archivists-listserv-hoping-for-a-stay-of-execution/</guid>
		<description><![CDATA[There has been a lot of discussion (both on the Archives &#38; Archivists (A&#38;A) Listserv and in blog posts) about the SAA&#8216;s recent decision to not preserve the A&#38;A listserv posts from 1996 through 2006 when they are removed from the listserv&#8217;s old hosting location at Miami University of Ohio. Most of the outcry against [...]<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2007/03/14/the-archives-and-archivists-listserv-hoping-for-a-stay-of-execution/">The Archives and Archivists Listserv: hoping for a stay of execution</a></p>
]]></description>
			<content:encoded><![CDATA[<p>There has been a lot of discussion (both on the <a href="http://forums.archivists.org/read/?forum=archives" title="A&amp;A Listserv">Archives &amp; Archivists (A&amp;A) Listserv</a> and in <a href="http://www.technorati.com/search/www.boingboing.net%2F2007%2F03%2F13%2Fsociety_of_american_.html?sub=mtcosmos" title="A&amp;A Listserv Blog Posts">blog posts</a>) about the <a href="http://archivists.org/" title="SAA">SAA</a>&#8216;s recent <a href="http://forums.archivists.org/read/messages?id=3392" title="A&amp;A Post Announcing SAA Decision">decision to not preserve the A&amp;A listserv posts from 1996 through 2006</a> when they are removed from the <a href="http://listserv.muohio.edu/archives/archives.html" title="A&amp;A Old Listserv Posts">listserv&#8217;s old hosting location</a> at <a href="http://www.miami.muohio.edu/" title="Miami University of Ohio">Miami University of Ohio</a>.</p>
<p>Most of the outcry against this decision has fallen into two camps:</p>
<ul>
<li>Those who don&#8217;t understand how the SAA task force assigned to appraise the listserv archives could decide it does not have informational value &#8211; lots of discussion about how the listserv reflects the move of archivists into the digital age as well as it&#8217;s usefulness for students</li>
<li>Those who just wish it wouldn&#8217;t go away because they still use it to find old posts. Some mentioned that there are scholarly papers that reference posts in the listserv archives as their primary sources.</li>
</ul>
<p>I added this suggestion on the listserv:</p>
<blockquote><p>I would have thought that the Archives Listserv would be the ideal test case for developing a set of best practices for archiving an organization&#8217;s web based listserv or bboard.</p>
<p>Perhaps a graduate student looking for something to work on as an independent project could take this on? Even if they only got permission for working with posts from 2001 onward <em>[post 2001 those who posted had to agree to 'terms of participation' that reduce issues with copyright and ownership]</em> &#8211; I suspect it would still be worthwhile.</p></blockquote>
<p>I have always found that you can&#8217;t understand all the issues related to a technical project (like the preservation of a listserv) until you have a real life case to work on. Even if SAA doesn&#8217;t think we need to keep the data forever &#8211; here is the perfect set of data for archivists to experiment with. Any final set of best practices would be meant for archivists to use in the future &#8211; and would be all the easier to comprehend if they dealt with a listserv that many of them are already familiar with.</p>
<p>Another question: couldn&#8217;t the listserv posts still be considered &#8216;active records&#8217;? Many current listserv posters claim they still access the old list&#8217;s archives on a regular basis. I would be curious what the traffic for the site is. That is one nice side effect of this being on a website &#8211; it makes the usage of records quantifiable.</p>
<p>There are similar issues in the analog world when records people still want to use loose their physical home and are disposed of but, as others have also pointed out, digital media is getting cheaper and smaller by the day. We are not talking about paying rent on a huge wharehouse or a space that needs serious temperature and humidity control.</p>
<p>I was glad to see Rick Prelinger&#8217;s response on the current listerv that simply reads:</p>
<blockquote><p>The Internet Archive is looking into this issue.</p></blockquote>
<p>I had already checked when I posted my response to the listerv yesterday &#8211; having found my way to <a href="http://web.archive.org/web/*/http://listserv.muohio.edu/archives/archives.html" title="A&amp;A Old Listserv in Wayback Machine">the A&amp;A old listserv page</a> in the <a href="http://www.archive.org/web/web.php" title="Wayback Machine">Wayback Machine</a>. For now all that is there is the list of links to each week&#8217;s worth of postings &#8211; nothing beyond that has been pulled in.</p>
<p>I have my fingers crossed that enough of the right people have become aware of the situation to pull the listserv back from the brink of the digital abyss.</p>
<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2007/03/14/the-archives-and-archivists-listserv-hoping-for-a-stay-of-execution/">The Archives and Archivists Listserv: hoping for a stay of execution</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.spellboundblog.com/2007/03/14/the-archives-and-archivists-listserv-hoping-for-a-stay-of-execution/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Book Review: Digital Preservation</title>
		<link>http://www.spellboundblog.com/2007/01/18/book-review-digital-preservation/</link>
		<comments>http://www.spellboundblog.com/2007/01/18/book-review-digital-preservation/#comments</comments>
		<pubDate>Fri, 19 Jan 2007 02:02:59 +0000</pubDate>
		<dc:creator>Jeanne</dc:creator>
				<category><![CDATA[book review]]></category>
		<category><![CDATA[born digital records]]></category>
		<category><![CDATA[digitization]]></category>
		<category><![CDATA[electronic records]]></category>
		<category><![CDATA[future-proofing]]></category>
		<category><![CDATA[internet archiving]]></category>
		<category><![CDATA[metadata]]></category>
		<category><![CDATA[preservation]]></category>

		<guid isPermaLink="false">http://www.spellboundblog.com/2007/01/18/book-review-digital-preservation/</guid>
		<description><![CDATA[In my quest for information about archiving geospatial data last term, I got my hands on a copy of Marilyn Deegan and Simon Tanner&#8217;s Digital Preservation (part of the Digital Futures Series). This excellent volume consists of nine chapters each written by different authors who are leaders in their respective fields (shown in order of [...]<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2007/01/18/book-review-digital-preservation/">Book Review: Digital Preservation</a></p>
]]></description>
			<content:encoded><![CDATA[<p>In my quest for information about archiving geospatial data last term, I got my hands on a copy of Marilyn Deegan and Simon Tanner&#8217;s <a href="http://www.amazon.com/gp/product/1856044858?ie=UTF8&amp;tag=spellboundblog-20&amp;linkCode=as2&amp;camp=1789&amp;creative=9325&amp;creativeASIN=1856044858">Digital Preservation</a> (part of the <a href="http://www.facetshop.co.uk/mm5/merchant.mvc?Screen=CTGY&amp;Store_Code=1&amp;Category_Code=837">Digital Futures Series</a>). This excellent volume consists of nine chapters each written by different authors who are leaders in their respective fields (shown in order of their respective chapters):</p>
<ul>
<li><a title="David Holdsworth" href="http://web.archive.org/web/20060429211840/http://www.personal.leeds.ac.uk/~ecldh/">David Holdsworth</a>: known for his work on the <a title="CEDARS" href="http://www.leeds.ac.uk/cedars/">CEDARS</a> and <a title="CAMiLEON" href="http://www.si.umich.edu/CAMILEON/">CAMiLEON</a> projects</li>
<li><a title="Robin Wendler" href="http://hul.harvard.edu/ldi/html/robin_wendler.html">Robin Wendler</a>: metadata analyst at the <a title="Harvard Libraries" href="http://lib.harvard.edu/">Harvard University Library</a> Office for Information Studies</li>
<li>Julien Masanès: co-founder of the <a title="European Archive" href="http://www.europarchive.org">European Archive</a></li>
<li>Elisa Mason: maintains the <a title="Forced Migration Current Awareness Blog" href="http://fm-cab.blogspot.com/">Forced Migration Current Awareness Blog</a></li>
<li><a title="Brian Lavoie" href="http://www.oclc.org/research/staff/lavoie.htm">Brian F. Lavoie</a>: a research scientist at <a title="OCLC" href="http://www.oclc.org">OCLC</a></li>
<li><a title="Stephen Chapman" href="http://hul.harvard.edu/ldi/html/steve_chapman.html">Stephen Chapman</a>: Preservation Librarian for Digital Initiatives in the <a title="Library Preservation at Harvard" href="http://preserve.harvard.edu/">Weissman Preservation Center, Harvard University Library</a></li>
<li>Peter McKinney: research officer for the <a title="espida" href="http://www.gla.ac.uk/espida/">espida project</a> at the <a title="University of Glasgow" href="http://www.gla.ac.uk/">University of Glasgow</a></li>
<li>Jasmine Kelly: a former research assistant at the <a title="Centre for Computing in the Humanities" href="http://www.kcl.ac.uk/schools/humanities/cch">Centre for Computing in the Humanities, King&#8217;s College London</a></li>
</ul>
<p>This fabulous band of writers and researchers were led by Marilyn Deegan and <a title="Simon Tanner" href="http://www.ulcc.ac.uk/dptp/dptp-presenters/simon-tanner.html">Simon Tanner</a>, both based out of the <a title="King's College London" href="http://www.kcl.ac.uk/">King&#8217;s College London</a>.</p>
<p>Published in 2006, this is one of the most comprehensive and up to date books I found on the subject. The book starts out with two chapters addressing the basic issues related to digital preservation. Subsequent chapters present information about all kinds of metadata, web archiving, the costs of digital preservation and an overview of European approaches. The final chapter presents an extensive series of case studies &#8211; complete with URLs to give you plently of information online to explore.</p>
<p>This book gave me a great foundation from which to explore the details of various geospatial data and GIS archiving efforts. For those faced with the challenge of planning for digital preservation, the two chapters on costs should be very useful.  So many articles talk about how it will be so expensive to ensure proper digital preservation, but don&#8217;t give people in the field any practical advice in planning for the costs &#8211; this book is different. The exploration of existing approaches being used at major institutions throughout Europe give a good sense of evolving standards and best practices.</p>
<p>If you are looking for a way to get a handle on the issues involved in digital preservation &#8211; this a great starting point. The final chapter on case studies alone could keep you busy for a month as you explore all the websites of projects from around the world. While the book has a decidedly European focus, the concepts are applicable the world over. If you are responsible for ensuring that digital records (either digitized or born digital) are protected and preserved &#8211; this book explains the basics and explores various strategies. They don&#8217;t oversimplify things &#8211; but take the time to explain things well. They are honest about those questions that aren&#8217;t answered yet&#8230; and they point to as many resources, standards and examples as they can. While  <a href="http://www.amazon.com/gp/product/1856044858?ie=UTF8&amp;tag=spellboundblog-20&amp;linkCode=as2&amp;camp=1789&amp;creative=9325&amp;creativeASIN=1856044858">Digital Preservation</a> cannot provide a formula for everyone to follow, it can help you start asking the right questions and begin to understand the possibilities.</p>
<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2007/01/18/book-review-digital-preservation/">Book Review: Digital Preservation</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.spellboundblog.com/2007/01/18/book-review-digital-preservation/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

