<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Spellbound Blog &#187; learning technology</title>
	<atom:link href="http://www.spellboundblog.com/category/learning-technology/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.spellboundblog.com</link>
	<description>Archives, Digital Humanities, Cultural Heritage, Technology</description>
	<lastBuildDate>Mon, 06 Feb 2012 14:49:35 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>Digitization Program Site Visit: Archives of American Art</title>
		<link>http://www.spellboundblog.com/2012/02/03/digitization-program-site-visit-archives-of-american-art/</link>
		<comments>http://www.spellboundblog.com/2012/02/03/digitization-program-site-visit-archives-of-american-art/#comments</comments>
		<pubDate>Fri, 03 Feb 2012 05:43:15 +0000</pubDate>
		<dc:creator>Jeanne</dc:creator>
				<category><![CDATA[access]]></category>
		<category><![CDATA[digitization]]></category>
		<category><![CDATA[learning technology]]></category>

		<guid isPermaLink="false">http://www.spellboundblog.com/?p=1196</guid>
		<description><![CDATA[The image of Alexander Calder above shows him in his studio, circa 1950. It is from a folder titled Photographs: Calder at Work, 1927-1956, undated, part of Alexander Calder&#8217;s Papers held by the Smithsonian Archives of American Art and available online through the efforts of their digitization project. I love that this image capture him in [...]<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2012/02/03/digitization-program-site-visit-archives-of-american-art/">Digitization Program Site Visit: Archives of American Art</a></p>
]]></description>
			<content:encoded><![CDATA[<p style="text-align: center;"><strong><a href="http://www.aaa.si.edu/collections/images/detail/alexander-calder-his-studio-10308"><img class="wp-image-1242 aligncenter" title="Alexander Calder in his studio, ca. 1950 / unidentified photographer." src="http://www.spellboundblog.com/wp-content/uploads/2012/02/AAA_caldalex_26738.jpg.jpeg" alt="" width="471" height="479" /></a></strong></p>
<p>The image of Alexander Calder above shows him in his studio, circa 1950. It is from a folder titled <a title="Photographs: Calder at Work" href="http://www.aaa.si.edu/collections/container/viewer/Calder-at-Work--189357">Photographs: Calder at Work, 1927-1956, undated</a>, part of <a title="Alexander Calder's Papers" href="http://www.aaa.si.edu/collections/alexander-calder-papers-7294/more">Alexander Calder&#8217;s Papers</a> held by the <a title="Smithsonian Archives of American Art" href="http://www.aaa.si.edu/">Smithsonian Archives of American Art</a> and available online through the efforts of their digitization project. I love that this image capture him in his creative space &#8211; you get to see the happy chaos from which Calder drew his often sleek and sparse sculptures.</p>
<p>Back in October, I had the opportunity to visit with staff of the digitization program for the Smithsonian Archives of American Art along with a group of my colleagues from the World Bank. This is a report on that site visit. It is my hope that these details can help others planning digitization projects – much as it is informing our own internal planning.</p>
<p><strong>Date of Visit:</strong> October 18, 2011</p>
<p><strong>Destination:</strong> <a title="Smithsonian Archives of American Art" href="http://www.aaa.si.edu/">Smithsonian Archives of American Art</a></p>
<p><strong>Smithsonian Archives of American Art Hosts:</strong></p>
<ul>
<li><a title="LinkedIn: Karen Weiss" href="http://www.linkedin.com/pub/karen-b-weiss/11/aa2/251">Karen Weiss</a></li>
<li><a title="LinkedIn: Barbara Aikens" href="http://www.linkedin.com/pub/barbara-aikens/3b/5ab/6a7">Barbara Aikens </a></li>
<li>Many additional staff members</li>
</ul>
<p><strong>Summary: </strong> This visit was two hours in length and consisted of a combination of presentation, discussion and site tour to meet staff and examine equipment.</p>
<p><strong>Background: </strong>The Smithsonian’s Archives of American Art (AAA) program was first funded by a grant from the <a title="Terra Foundation of American Art" href="http://www.terraamericanart.org/">Terra Foundation of American Art</a> in 2005, recently extended through 2016. This funding supports both staff and research.</p>
<p>Their digitization project replaced their existing microfilm program and focuses on digitizing complete collections. Digitization focused on in-house collections (in contrast with collections captured on microfilm from other institutions across the USA as part of their microfilm program).</p>
<p>Over the course of the past 6 years, they have scanned over 110 collections &#8211; a total of 1,000 linear feet – out of an available total of 13,000 linear feet from 4,500 collections. They keep a prioritized list of what they want digitized.</p>
<p>The Smithsonian DAM (digital asset management system) had to be adjusted to handle the hierarchy of EAD and the digitized assets. Master files are stored in the Smithsonian DAM. Files stored in intermediate storage areas are only for processing and evaluation and are disposed of after they have been ingested into the DAM.</p>
<p>Current staffing is two and a half archivists and two digital imaging specialists. One digital imaging specialist focuses on scanning full collections, while the other focuses on on-demand single items.</p>
<p>The website is built in <a title="ColdFusion" href="http://en.wikipedia.org/wiki/ColdFusion">ColdFusion</a> and pulls content from a SQL database. Currently they have no way to post media files (audio, oral histories, video) on the external web interface.</p>
<p>They do not delineate separate items within folders. When feedback comes in from end users about individual items, this information is usually incorporated into the scope note for the collection, or the folder title of the folder containing the item. Full size images in both the image gallery and the full collections are watermarked.</p>
<p>They track the processing stats and status of their projects.</p>
<p><strong>Standard Procedures:</strong></p>
<p><span style="text-decoration: underline;">Full Collection Digitization:</span></p>
<ul>
<li>Their current digitization workflow is based on their microfilm process. The workflow is managed via an internal web-based management system. Every task required for the process is listed, then crossed off and annotated with the staff and date the action was performed.</li>
<li>Collections earmarked for digitization are thoroughly described by a processing archivist.</li>
<li>Finding aids are encoded in <a title="EAD" href="http://www.loc.gov/ead/">EAD</a> and created in XML using <a title="NoteTab" href="http://www.notetab.com/">NoteTab Pro</a> software.</li>
<li><a title="MARC" href="http://en.wikipedia.org/wiki/MARC_standards">MARC</a> records are created when the finding aid is complete. The summary information from the MARC record is used to create the summary of the collection published on the website.</li>
<li>Box numbers and folder numbers are assigned and associated with a finding aid. The number of the box and folder are all a scanning technician needs.</li>
<li>A ‘scanning information worksheet’ provides room for notes from the archivist to the scanning technician.  It provides the opportunity to indicate which documents should not be scanned. Possible reasons for this are duplicate documents or those containing personal identifying information (PIP).</li>
<li>A directory structure is generated by a script based on the finding aid, creating a directory folder for each physical folder which exists for the collection. Images are saved directly into this directory structure. The disk space to hold these images is centrally managed by the Smithsonian and automatically backed up.</li>
<li>All scanning is done in 600dpi color, according to their internal  guidelines. They frequently have internal projects which demand high resolution images for use in publication.</li>
<li>After scanning is complete, the processing archivist does the post scanning review before the images are pushed into the DAM for web publication.</li>
<li>Their policy is to post everything from a digitized collection, but they do support a take-down policy.</li>
<li>A recent improvement was made in January, 2010. At that time they relaunched the site to include all of their collections co-located on the same list, both digitized and non-digitized.</li>
</ul>
<p><span style="text-decoration: underline;">On Demand Digitization:</span></p>
<ul>
<li>Patrons may request the digitization of individual items.</li>
<li>These requests are evaluated by archivists to determine if it is appropriate to digitize the entire folder (or even box) to which the item belongs.</li>
<li>Requests are logged in a paper log.</li>
<li>Item level scanning ties back to an item level record with an item ID. There is an ‘Online Removal Notice’ to create item level stub.</li>
<li>An item level cataloger describes the content after it is scanned.</li>
<li>Unless there is an explicit copyright or donor restriction, the items is put online in the <a title="Archives of American Art Image Gallery" href="http://www.aaa.si.edu/collections/images">Image Gallery</a> (which currently has 12,000 documents).</li>
<li>Access to images is provided by keyword searching.</li>
<li>Individual images are linked back to the archival description for the collection from which they came.</li>
</ul>
<p><strong>Improvements/Changes they wish for: </strong></p>
<ul>
<li>They currently have no flexibility to make changes in the database nimbly. It is a tedious process to change the display and each change requires a programmer.</li>
<li>They would like to consider a move to open source software or to use a central repository – though they have concerns about what other sacrifices this would require.</li>
<li>Show related collections, list connected names (currently the only options for discovery are an A-Z list of creators or keyword search).</li>
<li>Ability to connect to guides and other exhibits.</li>
</ul>
<p><strong>References:</strong></p>
<ul>
<li><a title="Archives of American Art Image Gallery" href="http://www.aaa.si.edu/collections/images">Image Gallery</a></li>
<li><a title="Archives of American Art" href="http://www.aaa.si.edu/">Main Website</a></li>
<li><a title="Digitization Project" href="http://www.aaa.si.edu/collections/projects/terra">Digitization Project</a></li>
<li><a title="Technical Documentation" href="http://www.aaa.si.edu/collections/documentation">Technical Documentation</a> &#8211; shares internal procedures and guidelines</li>
<li><a title="OCLC rapid capture paper" href="http://www.oclc.org/research/publications/library/2011/2011-04r.htm">OCLC rapid capture paper</a></li>
<li><a title="Scanning equipment" href="http://www.digitaltransitions.com/page/divison-of-cultural-heritage-products">Scanning equipment</a></li>
</ul>
<p><em>Image Credit:</em> Alexander Calder papers, Archives of American Art, Smithsonian Institution.</p>
<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2012/02/03/digitization-program-site-visit-archives-of-american-art/">Digitization Program Site Visit: Archives of American Art</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.spellboundblog.com/2012/02/03/digitization-program-site-visit-archives-of-american-art/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Day of Digital Archives</title>
		<link>http://www.spellboundblog.com/2011/10/06/day-of-digital-archives/</link>
		<comments>http://www.spellboundblog.com/2011/10/06/day-of-digital-archives/#comments</comments>
		<pubDate>Thu, 06 Oct 2011 18:26:00 +0000</pubDate>
		<dc:creator>Jeanne</dc:creator>
				<category><![CDATA[born digital records]]></category>
		<category><![CDATA[digital humanities]]></category>
		<category><![CDATA[digitization]]></category>
		<category><![CDATA[electronic records]]></category>
		<category><![CDATA[future-proofing]]></category>
		<category><![CDATA[learning technology]]></category>

		<guid isPermaLink="false">http://www.spellboundblog.com/?p=1179</guid>
		<description><![CDATA[To be honest, today was a half day of digital archives, due to personal plans taking me away from computers this afternoon. In light of that, my post is more accurately my &#8216;week of digital archives&#8217;. The highlight of my digital archives week was the discovery of the Digital Curation Exchange. I promptly joined and [...]<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2011/10/06/day-of-digital-archives/">Day of Digital Archives</a></p>
]]></description>
			<content:encoded><![CDATA[<p><a title="Day of Digital Archives" href="http://dayofdigitalarchives.blogspot.com/"><img class="alignright size-medium wp-image-1180" title="Day of Digital Archives" src="http://www.spellboundblog.com/wp-content/uploads/2011/10/sticker1-copy.jpg-300x300.jpg" alt="" width="300" height="300" /></a>To be honest, today was a half day of digital archives, due to personal plans taking me away from computers this afternoon. In light of that, my post is more accurately my &#8216;week of digital archives&#8217;.</p>
<p>The highlight of my digital archives week was the discovery of the <a title="Digital Curation Exchange" href="http://digitalcurationexchange.org/">Digital Curation Exchange</a>. I promptly joined and began to explore their &#8216;space for all things &#8216;digital curation&#8217; &#8216;. This led me to a fabulous list of <a title="Digital Curation Resources" href="http://digitalcurationexchange.org/resources">resources</a>, including a set of <a title="Digital Curation Syllabi" href="http://digitalcurationexchange.org/resources?field_resource_type_value[]=376&amp;keys=">syllabi for courses related to digital curation</a>. Each link brought me to an extensive reading list, some with full slide decks related to weekly in classroom presentations. My &#8216;to read&#8217; list has gotten much longer &#8211; but in a good way!</p>
<p>On other days recently I have found myself involved in all of the following:</p>
<ul>
<li>review of metadata standards for digital objects</li>
<li>creation of internal guidelines and requirements documents</li>
<li>networking with those at other institutions to help coordinate site visits of other digitization projects</li>
<li>records management planning and reviews</li>
<li>learning about the OCR software available to our organization</li>
<li>contemplation of the web archiving efforts of organizations and governments around the world</li>
<li>reviewing my organization&#8217;s social media policies</li>
<li>listening to the audio of online training available from <a title="PLANETS training" href="http://www.planets-project.eu/training-materials/">PLANETS</a> (Preservation and Long-term Access through NETworked Services)</li>
<li>contemplation of the new <a title="Journal of Digital Media Management" href="http://www.henrystewart.com/jdmm/about.aspx"><em>Journal of Digital Media Management</em></a> and their recent <a title="call for articles - JDMM" href="http://www.henrystewart.com/jdmm/callforarticles.aspx">call for articles</a></li>
</ul>
<p>My new favorite quote related to digital preservation comes from <a title="What we reckon about keeping digital archives: High level principles guiding State Records’ approach" href="http://futureproof.records.nsw.gov.au/what-we-reckon-about-keeping-digital-archives-high-level-principles-guiding-state-records%E2%80%99-approach/">What we reckon about keeping digital archives: High level principles guiding State Records’ approach</a> from the State Records folks in New South Wales Australia, which reads:</p>
<blockquote><p>We will keep the Robert De Niro principle in mind when adopting any software or hardware solutions: “You want to be makin moves on the street, have no attachments, allow nothing to be in your life that you cannot walk out on in 30 seconds flat if you spot the heat around the corner” (Heat, 1995)</p>
<p>In other words, our digital archives technology will be designed to be sustainable given our limited resources so it will be flexible and scalable to allow us to utilise the most appropriate tools at a given time to carry out actions such as creation of preservation or access copies or monitoring of repository contents, but replace these tools with new ones easily and with minimal cost and with minimal impact.</p></blockquote>
<p>I like that this speaks to the fact that no plan can perfectly accommodate the changes in technology coming down the line. Being nimble and assuming that change will be the only constant are key to ensuring access to our digital assets in the future.</p>
<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2011/10/06/day-of-digital-archives/">Day of Digital Archives</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.spellboundblog.com/2011/10/06/day-of-digital-archives/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Rescuing 5.25&#8243; Floppy Disks from Oblivion</title>
		<link>http://www.spellboundblog.com/2011/07/25/rescuing-5-25-floppy-disks-from-oblivion/</link>
		<comments>http://www.spellboundblog.com/2011/07/25/rescuing-5-25-floppy-disks-from-oblivion/#comments</comments>
		<pubDate>Tue, 26 Jul 2011 02:42:03 +0000</pubDate>
		<dc:creator>Jeanne</dc:creator>
				<category><![CDATA[at risk records]]></category>
		<category><![CDATA[electronic records]]></category>
		<category><![CDATA[future-proofing]]></category>
		<category><![CDATA[learning technology]]></category>
		<category><![CDATA[preservation]]></category>
		<category><![CDATA[software]]></category>

		<guid isPermaLink="false">http://www.spellboundblog.com/?p=1006</guid>
		<description><![CDATA[Step-by-step instructions for saving files from 5 1/4" floppy disks.<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2011/07/25/rescuing-5-25-floppy-disks-from-oblivion/">Rescuing 5.25&#8243; Floppy Disks from Oblivion</a></p>
]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.spellboundblog.com/wp-content/uploads/2011/07/IMG_4121.jpg"><img class="alignright size-medium wp-image-1158" title="My 5 1/4&quot; Floppy Disks from the 1980s" src="http://www.spellboundblog.com/wp-content/uploads/2011/07/IMG_4121-300x225.jpg" alt="" width="300" height="225" /></a>This post is a careful log of how I rescued data trapped on 5 1/4&#8243; floppy disks, some dating back to 1984 (including those pictured here). While I have tried to make this detailed enough to help anyone who needs to try this, you will likely have more success if you are comfortable installing and configuring hardware and software.</p>
<p>I will break this down into a number of phases:</p>
<ul>
<li>Phase 1: Hardware</li>
<li>Phase 2: Pull the data off the disk</li>
<li>Phase 3: Extract the files from the disk image</li>
<li>Phase 4: Migrate or Emulate</li>
</ul>
<p><strong>Phase 1: Hardware</strong></p>
<p>Before you do anything else, you actually need a 5.25&#8243; floppy drive of some kind connected to your computer.  I was lucky &#8211; a friend had a floppy drive for us to work with. If you aren&#8217;t that lucky, you can generally find them on eBay for around $25 (sometimes less). A friend had been helping me by trying to connect the drive to my existing PC &#8211; but we could never get the communications working properly. Finally I found Device Side Data&#8217;s <a title="5.25&quot; Floppy Drive Controller" href="http://www.deviceside.com/fc5025.html">5.25&#8243; Floppy Drive Controller</a> which they <a title="Buy 5.25&quot; Floppy Drive Controller" href="http://shop.deviceside.com/prod/FC5025">sell online</a> for $55. What you are purchasing will connect your 5.25 Floppy Drive to a USB 2.0 or USB 1.1 port. It comes with drivers for connection to Windows, Mac and Linux systems.</p>
<p>If you don&#8217;t want to mess around with installing the disk drive into our computer, you can also purchase an <a title="Disk drive external enclosure and power supply" href="http://shop.deviceside.com/prod/CASE1">external drive enclosure and a tabletop power supply</a>. Remember, you still need the USB controller too.</p>
<p><em>Update:</em> I just found a <a title="Device Side's Drive Controller operation instructions" href="http://mith.umd.edu/vintage-computers/fc5025-operation-instructions">fantastic step-by-step guide to the hardware installation of Device Side&#8217;s drive controller</a> from the Maryland Institute for Technology in the Humanities (MITH), including tons of photographs, which should help you get the hardware install portion done right.</p>
<p><strong>Phase 2: Pull the data off the disk</strong></p>
<p>The next step, once you have everything installed, is to extract the bits (all those ones and zeroes) off those floppies. I found that creating a new folder for each disk I was extracting made things easier. In each folder I store the disk image, a copy of the extracted original files and a folder named &#8216;converted&#8217; in which to store migrated versions of the files.</p>
<p>Device Side provides software they call &#8216;Disk Image and Browse&#8217;. You can see an assortment of <a title="Disc Image &amp; Browse Screenshots" href="http://www.deviceside.com/screenshots.html">screenshots</a> of this software on their website, but this is what I see after putting a floppy in my drive and launching USB Floppy -&gt; Disk Image and Browse:</p>
<p><a href="http://www.spellboundblog.com/wp-content/uploads/2011/07/disk-image-and-browse.jpg"><img class="aligncenter size-full wp-image-1146" title="Disk Image and Browse" src="http://www.spellboundblog.com/wp-content/uploads/2011/07/disk-image-and-browse.jpg" alt="" width="385" height="337" /></a></p>
<p>You will need to select the &#8216;Disk Type&#8217; and indicate the destination in which to create your disk image. Make sure you create the destination directory <em>before</em> you click on the &#8216;Capture Disk File Image&#8217; button. This is what it may look like in progress:</p>
<p><a href="http://www.spellboundblog.com/wp-content/uploads/2011/07/disk-capture-in-progress.jpg"><img class="aligncenter size-full wp-image-1147" title="Disk Capture in Progress" src="http://www.spellboundblog.com/wp-content/uploads/2011/07/disk-capture-in-progress.jpg" alt="" width="289" height="160" /></a></p>
<p>Fair warning that this won&#8217;t always work. At least the developers of the software that comes with Device Side Data&#8217;s controller had a sense of humor. This is what I saw when one of my disk reads didn&#8217;t work 100%:</p>
<p><a href="http://www.spellboundblog.com/wp-content/uploads/2010/07/capture-disk-image-bummer.jpg"><img class="size-full wp-image-1007 aligncenter" title="Capturing Disk Image File... Bummer!" src="http://www.spellboundblog.com/wp-content/uploads/2010/07/capture-disk-image-bummer.jpg" alt="" width="289" height="159" /></a></p>
<p>If you are pressed for time and have many disks to work your way through, you can stop here and repeat this step for all the disks you have on hand.</p>
<p><strong>Phase 3: Extract the files from the disk image</strong></p>
<p>Now that you have a disk image of your floppy, how do you interact with it? For this step I used a free tool called <a title="Virtual Floppy Drive" href="http://vfd.sourceforge.net/">Virtual Floppy Drive</a>. After I got this installed properly, when my disk image appeared, it was tied to this program. Double clicking on the Floppy Image icon opens the floppy in a view like the one shown below:</p>
<p style="text-align: center;"><a href="http://www.spellboundblog.com/wp-content/uploads/2011/07/vfd-display.jpg"><img class="alignnone size-full wp-image-1143" title="Virtual Floppy Disk Display" src="http://www.spellboundblog.com/wp-content/uploads/2011/07/vfd-display.jpg" alt="" width="501" height="394" /></a></p>
<p style="text-align: left;">It looks like any other removable disk drive. Now you can copy any or all of the files to anywhere you like.</p>
<p><strong>Phase 4: Migrate or Emulate<br />
</strong></p>
<p>The last step is finding a way to open your files. Your choice for this phase will depend on the file formats of the files you have rescued. My files were almost all <a title="WordStar" href="http://www.wordstar.org/">WordStar</a> word processing documents. I found a <a title="tools for converting wordstar files" href="http://www.wordstar.org/index.php/wordstar-file-conversion/wordstar-for-dos">list of tools for converting WordStar files to other formats</a>.</p>
<p>The best one I found was <a title="HABit Version 3" href="http://www.hotkey.net.au/%7Ehambar/habit/wsc-ver3.htm">HABit version 3</a>.</p>
<p>It converts Wordstar files into text or html and even keeps the spacing reasonably well if you choose that option. If you are interested in the content more than the layout, then not retaining spacing will be the better choice because it will not put artificial spaces in the middle of sentences to preserve indentation. In a perfect world I think I would capture it both with layout and without.</p>
<p><strong>Summary</strong></p>
<p>So my rhythm of working with the floppies after I had all the hardware and software installed was as follows:</p>
<ul>
<li>create a new folder for each disk, with an empty &#8216;converted&#8217; folder within it</li>
<li>insert floppy into the drive</li>
<li>run DeviceSide&#8217;s Disk Image and Browse software (found on my PC running Windows under Start -&gt; Programs -&gt; USB Flopy)</li>
<li>paste the full path of the destination folder</li>
<li>name the disk image</li>
<li>click &#8216;Capture Disk Image&#8217;</li>
<li>double click on the disk image and view the files via vfd (virtual floppy drive)</li>
<li>copy all files into the folder for that disk</li>
<li>convert files to a stable format (I was going from WordStar to ASCII text) and save the files in the &#8216;converted&#8217; folder</li>
</ul>
<p>These are the detailed instructions I tried to find when I started my own data rescue project. I hope this helps you rescue files currently trapped on 5 1/4&#8243; floppies. Please let me know if you have any questions about what I have posted here.</p>
<p><em>Update:</em> Another great source of information is Archive Team&#8217;s wiki page on <a title="Archive Team: Rescuing Floppy Disks" href="http://archiveteam.org/index.php?title=Rescuing_Floppy_Disks">Rescuing Floppy Disks</a>.</p>
<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2011/07/25/rescuing-5-25-floppy-disks-from-oblivion/">Rescuing 5.25&#8243; Floppy Disks from Oblivion</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.spellboundblog.com/2011/07/25/rescuing-5-25-floppy-disks-from-oblivion/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>Heading to Austin for SXSW Interactive</title>
		<link>http://www.spellboundblog.com/2011/03/09/heading-to-austin-for-sxsw-interactive/</link>
		<comments>http://www.spellboundblog.com/2011/03/09/heading-to-austin-for-sxsw-interactive/#comments</comments>
		<pubDate>Wed, 09 Mar 2011 05:05:10 +0000</pubDate>
		<dc:creator>Jeanne</dc:creator>
				<category><![CDATA[archival community]]></category>
		<category><![CDATA[digital humanities]]></category>
		<category><![CDATA[interface design]]></category>
		<category><![CDATA[learning technology]]></category>
		<category><![CDATA[virtual collaboration]]></category>
		<category><![CDATA[web 2.0]]></category>

		<guid isPermaLink="false">http://www.spellboundblog.com/?p=1087</guid>
		<description><![CDATA[Anyone out there going to be at SXSWi? I would love to find like-minded DH (digital humanities) and GLAM (Galleries, Libraries, Archives &#38; Museums) folks in Austin. If you can&#8217;t go, what do you wish I would attend and blog about after the fact? No promises on thoroughness of my blogging of course. I never [...]<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2011/03/09/heading-to-austin-for-sxsw-interactive/">Heading to Austin for SXSW Interactive</a></p>
]]></description>
			<content:encoded><![CDATA[<p><a href="http://sxsw.com/interactive"><img class="alignnone size-full wp-image-1088" title="SXSW Interactive 2011" src="http://www.spellboundblog.com/wp-content/uploads/2011/03/sxsw-interactive-e1299646562529.jpg" alt="" width="504" height="112" /></a></p>
<p>Anyone out there going to be at <a title="SXSW Interactive" href="http://sxsw.com/interactive">SXSWi</a>? I would love to find like-minded DH (digital humanities) and GLAM (Galleries, Libraries, Archives &amp; Museums) folks in Austin. If you can&#8217;t go, what do you wish I would <a title="SXSW Interactive Schedule" href="http://schedule.sxsw.com/?conference=interactive#">attend</a> and blog about after the fact?</p>
<p>No promises on thoroughness of my blogging of course. I never have mastered the &#8216;live blogging&#8217; approach, but I do enjoy taking notes and if the past is any guide to the future I usually manage at least 2 really detailed posts on sessions from any one conference. The rest end up being notes to myself that I always mean to somehow go back to and post later. Maybe I need to spend a month just cleaning up and posting old session summaries (or at least those that still seem interesting and relevant!).</p>
<p>Drop me a comment below or <a title="Contact Jeanne" href="http://www.spellboundblog.com/contact/">contact me</a> directly and let me know if you will be in Austin between March 10 and 15. Hope to see some of you there!</p>
<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2011/03/09/heading-to-austin-for-sxsw-interactive/">Heading to Austin for SXSW Interactive</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.spellboundblog.com/2011/03/09/heading-to-austin-for-sxsw-interactive/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>ArchivesZ Needs You!</title>
		<link>http://www.spellboundblog.com/2010/07/07/archivesz-needs-you/</link>
		<comments>http://www.spellboundblog.com/2010/07/07/archivesz-needs-you/#comments</comments>
		<pubDate>Wed, 07 Jul 2010 04:48:24 +0000</pubDate>
		<dc:creator>Jeanne</dc:creator>
				<category><![CDATA[archival community]]></category>
		<category><![CDATA[ArchivesZ]]></category>
		<category><![CDATA[learning technology]]></category>
		<category><![CDATA[open source]]></category>
		<category><![CDATA[software]]></category>
		<category><![CDATA[virtual collaboration]]></category>
		<category><![CDATA[what if]]></category>

		<guid isPermaLink="false">http://www.spellboundblog.com/?p=996</guid>
		<description><![CDATA[I got a kind email today asking &#8220;Whither ArchivesZ?&#8221;. My reply was: &#8220;it is sleeping&#8221; (projects do need their rest) and &#8220;I just started a new job&#8221; (I am now a Metadata and Taxonomy Consultant at The World Bank) and &#8220;I need to find enthusiastic people to help me&#8221;. That final point brings me to [...]<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2010/07/07/archivesz-needs-you/">ArchivesZ Needs You!</a></p>
]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.spellboundblog.com/wp-content/uploads/2010/07/Unclesamwantyou2.jpg"><img class="alignright size-full wp-image-997" title="I Want You!" src="http://www.spellboundblog.com/wp-content/uploads/2010/07/Unclesamwantyou2.jpg" alt="" width="288" height="320" /></a>I got a kind email today asking &#8220;Whither ArchivesZ?&#8221;. My reply was: &#8220;it is sleeping&#8221; (projects do need their rest) and &#8220;I just started a new job&#8221; (I am now a Metadata and Taxonomy Consultant at The World Bank) and &#8220;I need to find enthusiastic people to help me&#8221;. That final point brings me to this post.</p>
<p>I find myself in the odd position of having finished my Master&#8217;s Degree and not wanting to sign on for the long haul of a PhD. So I have a big project that was born in academia, initially as a joint class project and more recently as independent research with a grant-funded programmer, but I am no longer in academia.</p>
<p>What happens to projects like ArchivesZ? Is there an evolutionary path towards it being a collaborative project among dispersed enthusiastic individuals? Or am I more likely to succeed by recruiting current graduate students at my former (and still nearby) institution? I have discussed this one-on-one with a number of individuals, but I haven&#8217;t thrown open the gates for those who follow me here online.</p>
<p>For those of you who have been waiting patiently, the <a title="ArchivesZ" href="http://zaphod.mindlab.umd.edu/ArchivesZ/Main.html">ArchivesZ  version 2 prototype</a> is avaiable online. I can&#8217;t promise it will stay  online for long &#8211; it is definitely brittle for reasons I haven&#8217;t  totally identified. A few things to be aware of:</p>
<ul>
<li>when you  load the main page, you should see tags listed at the bottom &#8211; if you  don&#8217;t at all, then drop me an email via my contact form and I will try  and get Tomcat and Solr back up. If you have a small screen &#8211; you may need to  view your browser full screen to get to all the parts of the UI.</li>
<li>I know there are lots of bugs of various sizes. Some paths through  the app work &#8211; some don&#8217;t. Some screens are just placeholders. Feel free  to poke around and try things &#8211; you can&#8217;t break it for anyone else!</li>
</ul>
<p>I think there are a few key challenges to building what I would think of as the first &#8216;full&#8217; version of ArchivesZ &#8211; listed here in no particular order:</p>
<ul>
<li>In the process of creating version 2, I was too ambitious. The current version of ArchivesZ has lots of issues, some usability &#8211; some bugs (see prototype above!)</li>
<li>Wherever a collaborative workspace of ArchivesZ were going to live, it would need large data sets. I did a lot of work on data from eleven institutions in the spring of 2009, so there is a lot of data available &#8211; but it is still a challenge.</li>
<li>A lot of my future ideas for ArchivesZ are trapped in my head. The good news is that I am honestly open to others&#8217; ideas for where to take it in the future.</li>
<li>How do we build a community around the creation of ArchivesZ?</li>
</ul>
<p>I still feel that there is a lot to be gained by building a centralized visualization tool/service through which researchers and archivists could explore and discover archival materials. I even think there is promise to a freestanding tool that supports exploration of materials within a single institution. I can&#8217;t build it alone. This is a good thing &#8211; it will be a much better in the end with the input, energy and knowledge of others. I am good at ideas and good at playing the devil&#8217;s advocate. I have lots of strength on the data side of things and visualization has been a passion of mine for years. I need smart people with new ideas, strong tech skills (or a desire to learn) and people who can figure out how to organize the herd of cats I hope to recruit.</p>
<p>So &#8211; what can you do to help ArchivesZ? Do you have mad Action Script 3 skills? Do you want to dig into the scary little ruby script that populates the database? Maybe you prefer to organize and coordinate? You have always wanted to figure out how a project like this could group from a happy (or awkward?) prototype into a real service that people depend on?</p>
<p>Do you have a vision for how to tackle this as a project? Open source? Grant funded? Something else clever?</p>
<p>Know any graduate students looking for good research topics? There are juicy bits here for those interested in data, classification, visualization and cross-repository search.</p>
<p>I will be at SAA in DC in August chairing a panel on search engine optimization of archival websites. If there is even just one of you out there who is interested, I would cheerfully organize an ArchivesZ summit of some sort in which I could show folks the good, bad and ugly of the prototype as it stands. Let me know in the comments below.</p>
<p>Won&#8217;t be at SAA but want to help? Chime in here too. I am happy to set up some shared desktop tours of whatever you would like to see.</p>
<p>PS: Yes, I do have all the version 2 code &#8211; and what is online at the <a title="Google Code: ArchivesZ" href="http://code.google.com/p/archivesz/">Google Code ArchivesZ page</a> is not up to date. Updating the <a title="ArchivesZ" href="http://www.archivesz.org">ArchivesZ website</a> and uploading the current code is on my to do list!</p>
<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2010/07/07/archivesz-needs-you/">ArchivesZ Needs You!</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.spellboundblog.com/2010/07/07/archivesz-needs-you/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Gridworks: Super Data Cleanup and Exploration Tool</title>
		<link>http://www.spellboundblog.com/2010/05/29/gridworks-data-cleanup-exploration-tool/</link>
		<comments>http://www.spellboundblog.com/2010/05/29/gridworks-data-cleanup-exploration-tool/#comments</comments>
		<pubDate>Sat, 29 May 2010 06:26:31 +0000</pubDate>
		<dc:creator>Jeanne</dc:creator>
				<category><![CDATA[electronic records]]></category>
		<category><![CDATA[information visualization]]></category>
		<category><![CDATA[learning technology]]></category>
		<category><![CDATA[MARAC]]></category>
		<category><![CDATA[metadata]]></category>
		<category><![CDATA[software]]></category>

		<guid isPermaLink="false">http://www.spellboundblog.com/?p=987</guid>
		<description><![CDATA[In my presentation at the Spring 2010 Mid-Atlantic Regional Archives Conference (MARAC), Whirlwind Tour of Visualization-Land,  I showed some screenshots of a tool called Gridworks. At the time, Gridworks was not available to the general public. The good news is that earlier this month Gridworks 1.0 was officially released and you can get Gridworks right [...]<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2010/05/29/gridworks-data-cleanup-exploration-tool/">Gridworks: Super Data Cleanup and Exploration Tool</a></p>
]]></description>
			<content:encoded><![CDATA[<p style="text-align: center;"><a href="http://code.google.com/p/freebase-gridworks/"><img class="size-full wp-image-988  aligncenter" title="ridworks" src="http://www.spellboundblog.com/wp-content/uploads/2010/05/gridworks.jpg" alt="" width="400" height="100" /></a></p>
<p>In my presentation at the Spring 2010 <a title="MARAC" href="http://www.marac.info">Mid-Atlantic Regional Archives Conference</a> (MARAC), <a title="Whirlwind Tour of Visualization-Land" href="http://www.slideshare.net/JKramerSmyth/marac-2010-visualization">Whirlwind Tour of  Visualization-Land</a>,  I showed some screenshots of a tool called Gridworks. At the time, Gridworks was not available to the general public. The good news is that earlier this month <a title="Gridworks 1.0 Announcment" href="http://blog.freebase.com/2010/05/10/announcing-the-release-of-freebase-gridworks-1-0/">Gridworks 1.0 was officially released</a> and you can <a title="Gridworks on Google Code" href="http://code.google.com/p/freebase-gridworks/">get Gridworks right now</a>.</p>
<p>For those of you who didn&#8217;t see my presentation, Gridworks is tool you run locally on your computer via a web browser. It permits you to load &#8216;grid-shaped data&#8217; for examination, filtering and data cleanup. That makes is sound so much less exciting than it is. The best way to get a sense of what you can do is to watch the <a title="Gridworks Videos" href="http://vimeo.com/groups/gridworks/videos">Gridworks Videos</a>.</p>
<p>What sort of data do I think there is in archives to be pumped  into Gridworks? How about collection descriptive data and electronic  record datasets? Since all the data is kept locally, you don&#8217;t need to worry about uploading your data to some anonymous server in order to work with it. It all stays safely on your local computer the whole time.</p>
<p>A quick list of things that Gridworks can do:</p>
<ul>
<li>Cluster data to find values that are almost the same so you can normalize your data (for example &#8211; NYC vs N.Y.C.)</li>
<li>Create instant facetted browsing based on any column in your data</li>
<li>Provide scatterplots of the values from any two numeric columns as well as a way to spot the most interesting combinations across many possible columns</li>
<li>Reconcilliation and validation of values based on data from within <a title="Freebase.com" href="http://www.freebase.com/">Freebase.com</a></li>
<li>Pull data from Freebase.com based on a matched column &#8211; such as the population of a country, if you have a column in your dataset with country specified</li>
<li>Splitting data within a cell based on a specified delimiter</li>
<li>Application of <a title="Wikipedia: Regular Expressions" href="http://en.wikipedia.org/wiki/Regular_expression">regular expressions</a> and other simple code to data to create new columns</li>
</ul>
<p>This list just scratches the surface, but it should give you a decent idea of the power of Gridworks. Even if the only feature you ever use is the one which lets you cluster and update your data to remove the &#8216;almost the same&#8217; values, Gridworks can save you hours of painstaking data cleanup.</p>
<p>Why is data cleanup exciting? Because once you have nice clean data with all the attributes that are usefull to have for your data set &#8211; then you can start playing with the data in visualization tools! So go watch some <a title="Gridworks Videos" href="http://vimeo.com/groups/gridworks/videos">Gridworks Videos</a>, <a title="Gridworks on Google Code" href="http://code.google.com/p/freebase-gridworks/">get Gridworks for yourself</a> and start playing with data. It is free and it makes working with data fun!</p>
<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2010/05/29/gridworks-data-cleanup-exploration-tool/">Gridworks: Super Data Cleanup and Exploration Tool</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.spellboundblog.com/2010/05/29/gridworks-data-cleanup-exploration-tool/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>MARAC Spring 2010: Hurray for Archival Metadata (Session S2)</title>
		<link>http://www.spellboundblog.com/2010/05/07/marac-spring-2010-hurray-for-archival-metadata-session-s2/</link>
		<comments>http://www.spellboundblog.com/2010/05/07/marac-spring-2010-hurray-for-archival-metadata-session-s2/#comments</comments>
		<pubDate>Fri, 07 May 2010 05:16:30 +0000</pubDate>
		<dc:creator>Jeanne</dc:creator>
				<category><![CDATA[learning technology]]></category>
		<category><![CDATA[MARAC]]></category>
		<category><![CDATA[metadata]]></category>
		<category><![CDATA[virtual collaboration]]></category>

		<guid isPermaLink="false">http://www.spellboundblog.com/2010/05/07/marac-spring-2010-hurray-for-archival-metadata-session-s2/</guid>
		<description><![CDATA[The official title for this session is &#8220;Discovery Tools for Archival Collections: Getting the Most Out of Your Metadata&#8221; and was divided into two presentations with introduction and question moderation by Jaime L. Margalotti, senior assistant librarian in Special Collections at the University of Delaware. Introduction to Metadata Standards Michael Bolam, metadata librarian for digital [...]<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2010/05/07/marac-spring-2010-hurray-for-archival-metadata-session-s2/">MARAC Spring 2010: Hurray for Archival Metadata (Session S2)</a></p>
]]></description>
			<content:encoded><![CDATA[<p><a title="Flickr Commons: South Colonnade, arches and statues by Henry Hering" href="http://www.flickr.com/photos/field_museum_library/3333082739/"><img class="alignright" title="Statue of Research" src="http://www.spellboundblog.com/wp-content/uploads/2010/05/research-statue.jpg" alt="research-statue.jpg" width="185" height="272" /></a>The official title for this session is &#8220;Discovery Tools for Archival Collections: Getting the Most Out of Your Metadata&#8221; and was divided into two presentations with introduction and question moderation by <a title="LinkedIn: Jaime Margalotti" href="http://www.linkedin.com/pub/jaime-margalotti/11/259/406">Jaime L. Margalotti</a>, senior assistant librarian in Special Collections at the <a title="University of Delaware" href="http://www.udel.edu/">University of Delaware</a>.</p>
<p><strong>Introduction to Metadata Standards</strong></p>
<p><a title="LinkedIn: Michael Bolam" href="http://www.linkedin.com/pub/michael-bolam/4/a41/978">Michael Bolam</a>, metadata librarian for digital production, is in charge of all the metadata for all the collections at the <a title="University of Pittsburgh Digital Research Library" href="http://www.library.pitt.edu/libraries/drl/">Digital Research Library</a> at the <a title="University of Pittsburgh" href="http://www.pitt.edu/">University of Pittsburgh</a>. He is not an archivist &#8211; but does know where the archives is at Pitt! He has put lots of archival material online through digitization and assignment of metadata.</p>
<p>The best definition he has found of metadata, good for all audiences: &#8220;Metadata consists of statements we make about resources to help us find, identify, use, manage, evaluate and preserve them&#8221; Marty Kurth &#8211; Head of Metadata Services, Cornell University Libraries</p>
<p>Reviewed examples of metadata for images, text documents and archival collections. There is also data related to the business of scanning and making content available &#8211; administrative/behind the scene. Standards let you take your data and use it for other purposes.</p>
<p>Overview of alphabet soup of metadata standards:</p>
<ul>
<li><a title="MARC" href="http://www.loc.gov/marc/">MARC</a>: bibliographic information in machine-readable form (a <span class="pcolor"><strong>MA</strong></span>chine-<span class="pcolor"><strong>R</strong></span>eadable <span class="pcolor"><strong>C</strong></span>ataloging record).</li>
<li><a title="Dublin Core" href="http://dublincore.org/documents/dces/">Dublin Core</a>: the goal of Dublin Core was to create a core set of metadata fields that could be used across platforms, across various disciplines.</li>
<li><a title="MARCXML" href="http://www.loc.gov/standards/marcxml/">MARCXML</a>: schema for representing MARC in XML. Makes it easy to convert to and from MARC without loosing any data. May have more data than you need. MARCXML is not very &#8216;human readable&#8217;. You need to recall all the code numbers for the different data elements. Can be exported from Archivist Toolkit.</li>
<li><a title="MODS" href="http://www.loc.gov/standards/mods/">MODS</a>: <strong>M</strong>etadata <strong>O</strong>bject <strong>D</strong>escription <strong>S</strong>chema &#8211; sort of a &#8216;MARCXML light&#8217;. Tries to be a step between MARCXML (robust &amp; complicated) and Dublin Core (really simple). May result in compacting multiple MARCXML fields into single MODS fields. May loose some of the granularity of the data. The tags ARE human readable. The tag is the word &#8216;author&#8217; &#8211; not a number. Also can be exported in Archivists Toolkit.</li>
<li><a title="ONYX" href="http://www.editeur.org/8/ONIX">ONIX</a>: <strong>ON</strong>line <strong>I</strong>nformation e<strong>X</strong>change &#8211; standard used by the book publishing industry. XML-based standard for making available intellectual property in published form, both physical &amp; digital. Data created by the publisher. They use different ways of representing authors, keywords..etc in comparison to LOC and library cataloging.</li>
<li><a title="METS" href="http://www.loc.gov/standards/mets">METS</a>: <strong>M</strong>etadata <strong>E</strong>ncoding &amp; <strong>T</strong>ransmission <strong>S</strong>tandard. XML standard wrapper for describing divergent types of content within a digital library. The metadata for books, images, collections etc keep this data in different formats &#8211; METS lets you bring them together.</li>
<li><a title="OAI-PMH" href="http://www.loc.gov/standards/marcxml/">OAI-PMH</a>: Not a metadata standard &#8211; but rather a protocol for sharing metadata. Gives us a way to pull baseline information about a digital object out of a database and put it out somewhere where it can be harvested and used.</li>
</ul>
<p>Examples of projects built on shared metadata:</p>
<ul>
<li><a title="Worldcat.org" href="http://www.worldcat.org">Worldcat.org</a>: Has everything that is shared with OCLC. They do expose their records to google and yahoo harvesting.</li>
<li><a title="OAIster" href="http://oaister.worldcat.org">OAIster</a>: Searches a harvested data set &#8211; it is not going live out on the web. The OAIster records are also available in Worldcat. Example: search for Pittsburgh City Photographer (that is a provider of data). Most digitization software will generate an OAIster harvestable version. In his example we see that address and location get compressed into Notes. This is because there is not always a place in Dublin Core that maps to the level of detail you collect at your local institution. http://www.oclc.org/us/en/oaister/default.htm &#8211; has the info about contributing your content for crawling.</li>
<li><a href="http://www.archivegrid.org">Archive Grid</a>: The goal is to pull in finding aids from many sources. It is a service &#8211; requires some sort of subscription and payment to see the data. Uses Lucene for searching. The content in Archive Grid is now available in Worldcat. To participate &#8211; see http://www.oclc.org/us/en/archivegrid/default.htm</li>
</ul>
<p>Google and Yahoo do index OAIster and WorldCat, so that is one path to being found in search engines.</p>
<p><strong>MARC Records for Archival Materials in WorldCat Local</strong></p>
<p><a title="LinkedIn: Jennifer MacDonald" href="http://www.linkedin.com/pub/jennifer-macdonald/4/663/609">Jennifer MacDonald</a> from the <a title="University of Delaware" href="http://www.udel.edu/">University of Delaware</a> presented a cataloger&#8217;s perspective of a WorldCat Local environment. She is a &#8220;concerned enthusiast&#8221; with regard to metadata. The University of Delaware was the first institution to buy <a title="WorldCat Local" href="http://www.oclc.org/worldcatlocal/default.htm">WorldCat Local</a>. She ended up on the WorldCat Local Special collections and Archives Task Force. The task force made their <a title="WorldCat Local Special Collections and Archives Task Force Report 2008" href="http://www.rbms.info/committees/bibliographic_standards/committee-docs/FinalReportWCLSpecCollTaskForce.pdf">final report in 2008</a> and got a <a title="OCLC Feedback to Task Force 2009" href="http://www.rbms.info/committees/bibliographic_standards/committee-docs/OCLCResponseWCLTaskForce.pdf">response from OCLC in 2009</a>. They did get some immediate changes based on their feedback &#8211; like moving the 520 &#8220;summary&#8221; data element higher in the display. For some problems the task force identified, such as Archival Materials that were not being identified properly (Internet Resource is the type for all OAI records), it is hard to tell if the issue has been fixed.</p>
<p>She showed some screenshots from WorldCat local to show what data elements are there and how they are organized. In the FirstSearch screenshot (only available at the school), Notes and General Info holds a mishmash of content from various data elements consolidated into single fields. The task force asked for the &#8220;Browse&#8221; feature but apparently this feature is dead. They got no response from OCLC to this request in their report.</p>
<p>If you use the <a title="University of Delaware WorldCat Local" href="http://udel.worldcat.org">University of Delaware instance of WorldCat Local</a> to search for <a title="UDel Worldcat Local: walter penn shipley" href="http://udel.worldcat.org/search?q=walter+penn+shipley&amp;qt=results_page&amp;scope=0&amp;oldscope=1">walter penn shipley</a> and drill down to the detail record display for the <a title="Walter Penn Shipley Papers" href="http://udel.worldcat.org/title/walter-penn-shipley-papers-1879-1951/oclc/502285399&amp;referer=brief_results">Walter Penn Shipley Papers</a> you will see what was shown during the session. This display is customizable at the institution level in WorldCat Local. Some data is shown. You see lots of Web 2.0 options to add your own data, but the display is missing some of the data from the original MARC record. The full MARC record is indexed for keyword search, but since some of it is not displayed, users may not be able to determine why a record was returned.</p>
<p>Fields missing from the WorldCat Local display:</p>
<ul>
<li>351 &#8211; Organization and Arrangement of Materials</li>
<li>545 &#8211; biographical note</li>
<li>506 &#8211; restrictions on access</li>
<li>540 &#8211; Use of materials &#8211; with link to an askspec page: http://www.lib.udel.edu/cgi-bin/askspec.cgi</li>
<li>525 &#8211; preferred citation form &#8211; and this is where the manuscript number is</li>
<li>655 &#8211; some of the parts of the genre terms are missing</li>
<li>656 &#8211; occupation</li>
</ul>
<p>OCLC says that they have not included all this because people don&#8217;t want this displayed. Given that local organization is already deciding what to show, the task force would prefer the option to displayable all data elements. Due to this missing data, Jennifer prefers the FirstSearch interface &#8211; but this option is not always available at all institutions. You should take advantage of the Web 2.0 features. Archivist can create an account on WorldCat Local and add data elements.</p>
<p><strong>Questions and Answers</strong></p>
<p><strong>QUESTION:</strong> You talk about having the metadta in a format that is accessible to harvesting. What I have is a bunch of CDs with images on them that have a folder and descriptor structure. Is there a metadata harvester that can go in and pull that metadata out? New York Stock Exchange photographer sent these.</p>
<p><strong>ANSWER (Michael):</strong> So the metadata you are looking to extract is the filename and descriptors? You could have someone write a little script and extract what you need. I would hand it to the guy I work with because he writes perl. If then you made that available via your website &#8211; then people could find it. To get it into a database &#8211; it is just a small script.</p>
<p><strong>QUESTION:</strong> Are there any specifically useful webinars/seminars for becoming familiar with these formats for skillbuilding?</p>
<p><strong>ANSWER (Michael):</strong> Tons on the web. The LoC websites are very useful. You may have heard the term &#8216;crosswalking&#8217; &#8211; that is where you take one format and turn it into another. Looking at the crosswalks can make it much easier to understand how a format you understand maps to one you are trying to learn about. Shareable Metadata &#8211; metadata for you and me. Not online yet &#8211; but someone in the audience said the plan is to post the materials. There have been a couple of books and ALA publications. Most of the ones I know of are about 10 years old. <strong>Jaime</strong>: SAA has a good workshop series.</p>
<p><strong>QUESTION:</strong> One of the first things you said was to take data out of EAD and you didn&#8217;t go into detail in that. Were you talking about DAO tagged items?</p>
<p><strong>ANSWER (Michael):</strong> I was just talking about reusing data in a new environment. For example, we just started digitizing manuscripts and each item is becoming an individual digital object. The only metadata we have is in the EAD finding aid &#8211; so we are using that data to make descriptive data about the digital objects. We are going to create a MODS or METS record for every digital object. <strong>Jaime:</strong> We use EAD to make MODS records. She has been manually extracting EAD data as Dublin Core data for ContentDM.</p>
<p><strong>My QUESTION:</strong> What format does OAIster want?</p>
<p><strong>ANSWER (Michael):</strong> OAIster is just harvesting Dublin Core. You can share MODS and other metadata types and you may find other aggregators that are expecting their users to work in a more detailed environment. You may publish more data elements for other harvesters as well &#8211; but OAIster will only pull the Dublin Core data elements.</p>
<p><strong>QUESTION:</strong> We are working on a digitization project to digitize local historical societies, museums and libraries. Might the catalogers be able to deal with MODS or will the loss of granularity be a problem?</p>
<p><strong>ANSWER (Michael):</strong> I am not a MODS expert. MARC is very granular. Maybe look at the MARCXML &#8211; MODS crosswalk?</p>
<p><strong>QUESTION:</strong> At the University of Delaware, do you have any other systems?</p>
<p><strong>ANSWER (Jennifer):</strong> When we first got WorldCat Local you had to know the URL to get to the library. That changed fast! The patrons couldn&#8217;t find anything. <strong>Jaime:</strong> In WorldCat Local you cannot scope the search to specific sub-collections.</p>
<p><strong>QUESTION:</strong> Thank you Jennifer for your remarks. Is there a problem with catalogers trying to &#8216;sneak&#8217; data elements into other places &#8211; are standards in danger?</p>
<p><strong>ANSWER (Jennifer):</strong> I would hope we wouldn&#8217;t move 524 data into a 500 field just to get it displayed. There is some danger of loosing the granularity by pushing everything to Dublin Core. I don&#8217;t know how real that danger is at this point.</p>
<p><strong>QUESTION:</strong> A political question for Jennifer: Who has the clout to push for changes with OCLC?</p>
<p><strong>ANSWER (Jennifer):</strong> I think leaning encouraging users to give feedback is important. We were told that users don&#8217;t want that &#8220;we have proven that users don&#8217;t want that&#8221;. Users need to make comments about their challenges in dealing with the interface. <strong>FROM AUDIENCE:</strong> The strongest is to say that you are looking at Sky River. <strong>FROM AUDIENCE:</strong> Make your data more discoverable outside the catalog world &#8211; internal websites and Google. <strong>Jaime:</strong> We are working hard to make MARC records to push access to our collections. The push is to make the data available in as many locations as possible.</p>
<p><strong>QUESTION:</strong> Are these all different levels of subscriptions? Are they trying to push people to buy more subscriptions?</p>
<p><strong>ANSWER (Jennifer):</strong> There is a sense that WorldCat Local is pushed at local public libraries. Yes &#8211; WorldCat Local is something they have to pay for. <strong>Michael:</strong> With Archive Grid you are going a step further &#8211; EVERYTHING in the finding aid is indexed. Every search I did in there returned thousands of records. Then I filtered by institution &#8211; and it never loaded. <strong>FROM AUDIENCE:</strong> I think they are revamping Archive Grid &#8211; but I don&#8217;t know how far they are in the process. <strong>Michael:</strong> I love the detail &#8211; you don&#8217;t have to dig through other data to find something useful. Depending on the institution &#8211; and how they are allowing their data to be harvested &#8211; you may see less information. <strong>Jaime:</strong> You have to actively work with OCLC to get Archive Grid to pick up your data.</p>
<p><strong>QUESTION:</strong> We are tinkering with users adding tags &#8211; are you having any success with people adding tags?</p>
<p><strong>ANSWER (Jaime):</strong> No &#8211; it isn&#8217;t something we have dealt with. WorldCat Local does let you add stuff like that.</p>
<p><strong>QUESTION:</strong> Will OCLC provide that UGC (user generated content) back to the institution?</p>
<p><strong>ANSWER:</strong> We wouldn&#8217;t know.</p>
<p><strong>QUESTION:</strong> Have they provided access to the user studies?</p>
<p><strong>ANSWER:</strong> Yes &#8211; but it is based on watching individuals use the tools.</p>
<p><em>Image Credit:</em> Statue representing Research by Henry Hering from <a title="Flickr Commons: South Colonnade, arches and statues by Henry Hering" href="http://www.flickr.com/photos/field_museum_library/3333082739/">image of the interior of the Field Museum of Natural History interior</a>.</p>
<p><em>As is the case with all my session summaries from MARAC, please accept my apologies in advance for any cases in which I misquote, overly simplify or miss points altogether in the post above. These sessions move fast and my main goal is to capture the core of the ideas presented and exchanged. Feel free to contact me about corrections to my summary either via comments on this post or via my <a title="Contact Jeanne" href="http://www.spellboundblog.com/contact/">contact form</a>.</em></p>
<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2010/05/07/marac-spring-2010-hurray-for-archival-metadata-session-s2/">MARAC Spring 2010: Hurray for Archival Metadata (Session S2)</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.spellboundblog.com/2010/05/07/marac-spring-2010-hurray-for-archival-metadata-session-s2/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>National Archives Transitions to Flickr Commons Membership</title>
		<link>http://www.spellboundblog.com/2010/02/16/national-archives-transitions-flickr-commons/</link>
		<comments>http://www.spellboundblog.com/2010/02/16/national-archives-transitions-flickr-commons/#comments</comments>
		<pubDate>Tue, 16 Feb 2010 06:57:51 +0000</pubDate>
		<dc:creator>Jeanne</dc:creator>
				<category><![CDATA[access]]></category>
		<category><![CDATA[GIS]]></category>
		<category><![CDATA[learning technology]]></category>
		<category><![CDATA[outreach]]></category>
		<category><![CDATA[photography]]></category>
		<category><![CDATA[virtual collaboration]]></category>
		<category><![CDATA[web 2.0]]></category>

		<guid isPermaLink="false">http://www.spellboundblog.com/?p=824</guid>
		<description><![CDATA[Even with the recent announcement that the Flickr Commons is not currently accepting new applications, there are clearly still applications being processed. NARA has been on Flickr since February of 2009 and loaded 49 sets of images. As announced in a recent press release, on the first of February 2010 Flickr flipped the switch and all the images in the The U.S. National Archives' photostream was shifted over into the Commons.<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2010/02/16/national-archives-transitions-flickr-commons/">National Archives Transitions to Flickr Commons Membership</a></p>
]]></description>
			<content:encoded><![CDATA[<p style="text-align: center;"><a href="http://www.flickr.com/photos/usnationalarchives/3660777810/"><img class="size-full wp-image-852 aligncenter" title="Flickr Commons: NARA: Ladies in Gas Masks" src="http://www.spellboundblog.com/wp-content/uploads/2010/02/3660777810_2049a5718f.jpg" alt="Ladies in Gas Masks" width="428" height="424" /></a></p>
<p style="text-align: left;">Even with the recent announcement that the <a title="Flickr Commons not accepting new applications" href="http://www.archivesnext.com/?p=932">Flickr Commons is not currently accepting new applications</a>, there are clearly still applications being processed. NARA has been on Flickr since February of 2009 and loaded 49 sets of images. As announced in a recent <a title="Press Release: NARA joins Flickr Commons" href="http://www.archives.gov/press/press-releases/2010/nr10-57.html">press release</a>, on the first of February 2010 Flickr flipped the switch and all the images in the <a title="Flickr: The U.S. National Archives' photostream" href="http://www.flickr.com/photos/usnationalarchives/">The U.S. National Archives&#8217; photostream</a> was shifted over into the <a title="Flickr Commons" href="http://www.flickr.com/commons">Commons</a>.</p>
<p>The <a title="NARA Flickr Commons Sets" href="http://www.flickr.com/photos/usnationalarchives/sets/">49 sets</a> are sorted into <a title="NARA Flickr Commons Collections" href="http://www.flickr.com/photos/usnationalarchives/collections/">4 collections</a>:</p>
<ul>
<li><a title="Historical Photographs and Documents" href="http://www.flickr.com/photos/usnationalarchives/collections/72157620415261553/">Historical Photographs and Documents</a> (19 sets) &#8211; including NARA favorites like <a title="Flickr Commons: We Can Do It!" href="http://www.flickr.com/photos/usnationalarchives/3678696585/in/set-72157620680828773/">Rosie the Riveter</a> and <a title="Flickr Commons: Nixon and Elvis" href="http://www.flickr.com/photos/usnationalarchives/3679494978/in/set-72157620680828773/">Nixon and Elvis</a> and documents from regional archives across the country.</li>
<li><a title="DOCUMERICA Project by the Environmental Protection Agency " href="http://www.flickr.com/photos/usnationalarchives/collections/72157620729903309/">DOCUMERICA Project by the Environmental Protection Agency</a> (27 sets) &#8211; one set dedicated to top picks and the rest organized by photographer. Interestingly, NARA&#8217;s website has indexed the 15,000+ images from this project by <a title="Documerica by subject" href="http://www.archives.gov/research/arc/topics/environment/documerica-topics.html">subject</a> and by <a title="Documerica by location" href="http://www.archives.gov/research/arc/topics/environment/documerica-geographic.html">location</a>. I wonder how the picked which image from DOCUMERICA to port over to Flickr?</li>
<li><a title="Mathew Brady Civil War Photographs" href="http://www.flickr.com/photos/usnationalarchives/collections/72157622495226723/">Mathew Brady Civil War Photographs</a> <a href="http://www.flickr.com/photos/usnationalarchives/3995277577/in/set-72157622549882756"><img class="alignright size-full wp-image-850" title="Poplar Grove Church" src="http://www.spellboundblog.com/wp-content/uploads/2010/02/3995277577_2c4b28e495_m.jpg" alt="" width="240" height="193" /></a>(2 sets) &#8211; currently 473 out of the 6,066 digitized Mathew Brady images are uploaded into the Commons. The images posted in the Commons are available in a much higher resolution than they are within ARC. A great example from this collection is the image of the <a title="Flickr Commons: Poplar Church" href="http://www.flickr.com/photos/usnationalarchives/3995277577/in/set-72157622549882756">Poplar Church</a> (image shown to right) available as a 600 x 483 GIF on ARC and as a 3000 x 2416 JPG on Flickr. This image also has gotten a nice set of comments and tags.</li>
<li><a title="Flickr: Development and Public Works" href="http://www.flickr.com/photos/usnationalarchives/collections/72157622960946660/">Development and Public Works</a> (1 set) &#8211; the only set in this collection consists of images taken to support the Flathead Irrigation Project. &#8220;The Project was initiated to determine rights and distribute water originating on the Flathead Indian Agency in Montana to both tribal and non-tribal land.&#8221; These images seem to be the same resolution on both archives.gov and Flickr.</li>
</ul>
<p>In honor of this transition, NARA posted a new set of <a title="Ansel Adams Photographs" href="http://www.flickr.com/photos/usnationalarchives/sets/72157623296214442/">220 Ansel Adams photographs</a>. One of the first comments on the set was &#8220;low-res scans? Pretty big letdown.&#8221; Fine question. As noted above, other images from NARA in the Commons much larger than the 600 x 522 that seems to be available for the Ansel Adams images. It would be great to have a clear explanation about available resolutions published along with each new set of images.</p>
<p>NARA has published this simple <a title="NARA Flickr Commons Rights Statement" href="http://www.archives.gov/social-media/flickr-faqs.html#9">rights statement for all NARA images in the Commons</a>:</p>
<blockquote><p>All of the U.S. National Archives&#8217; images that are part of <a href="http://www.archives.gov/global-pages/exit.html?link=%20http://www.flickr.com/commons/">The Flickr Commons</a> are marked &#8220;no known copyright restrictions.&#8221; This means the U.S. National Archives is unaware of any copyright restrictions on the publication, distribution, or re-use of those particular photos. Their use restriction status in our online catalog is &#8220;unrestricted.&#8221; Therefore, no written permission is required to use them.</p></blockquote>
<p>NARA has also posted an official <a title="NARA Flickr Comment Policy" href="http://www.archives.gov/social-media/photo-comment-policy.html">Photo Comment and Posting Policy</a> and a fairly extensive <a title="NARA Flickr FAQ" href="http://www.archives.gov/social-media/flickr-faqs.html">FAQ</a> about the images they have post on Flickr. I do wish that there was a simpler way to request reprints of images from the Commons. Most of the NARA images have this standard sentence &#8211; but for someone not familiar with NARA and more accustom to one click ordering, the instructions seem very complex:</p>
<blockquote><p>For information about ordering reproductions of photographs held by the Still Picture Unit, visit: <a rel="nofollow" href="http://www.archives.gov/research/order/still-pictures.html">www.archives.gov/research/order/still-pictures.html</a></p></blockquote>
<p><a href="http://www.flickr.com/photos/35740357@N03/3952613735"><img class="alignright size-full wp-image-861" title="World Trade Center" src="http://www.spellboundblog.com/wp-content/uploads/2010/02/3952613735_73a0f8767b_m.jpg" alt="" width="163" height="240" /></a>I also wish that more of the images had location information assigned &#8211; only 113 of the images show up on the fun to explore <a title="NARA Flickr Map View" href="http://www.flickr.com/photos/usnationalarchives/map/">map view</a>. At first glance it looks as if this information is populated only for images taken near airports. There are many images that include a location based subject in the image description posted on Flickr, yet do not include geographic metadata that would permit the image to be shown on a map. <a href="http://www.flickr.com/photos/35740357@N03/3952613735"><img class="alignleft size-full wp-image-866" title="World Trade Center Tags" src="http://www.spellboundblog.com/wp-content/uploads/2010/02/world-trade-center-tags.jpg" alt="" width="123" height="144" /></a>The one image I did find that was not at an airport but did include geographic metadata is this <a title="Flickr World Trade Center" href="http://www.flickr.com/photos/35740357@N03/3952613735">image of the World Trade Center</a> assigned to the <a title="Flickr: NYC Financial District" href="http://www.flickr.com/places/United%20States/New%20York/New%20York/Financial%20District/">NYC Financial District Flickr Location</a>. While I could add a location related tag to NARA&#8217;s images, there does not appear any way for the general public to suggest location metadata.</p>
<p>One odd note about this and other World Trade Center images &#8211; the auto-generated tags have broken up the building name very oddly as shown in my screen clip on the left.</p>
<p>Another fun way way to explore the NARA Flickr images is to visit <a title="NARA Flickr Archives Page" href="http://www.flickr.com/photos/usnationalarchives/archives/">the &#8216;Archives&#8217; page</a> (slightly hilariously titled &#8220;U.S. National Archives&#8217; Archives&#8221;). Here we can browse photos based on when they were uploaded to Flickr or when they were taken. Those images that include a specific date can be viewed on a calendar (such as these <a title="NARA Flickr Images from 1918" href="http://www.flickr.com/photos/usnationalarchives/archives/date-taken/1918/calendar/">images from 1918</a>) or in a list view (those same <a title="NARA Flickr Images from 1918 - List View" href="http://www.flickr.com/photos/usnationalarchives/archives/date-taken/1918/">images from 1918 as a list</a>), while those taken &#8216;circa&#8217; a year can be viewed in a list with all other images from sometime that year (such as these <a title="NARA Flickr Images Circa 1824" href="http://www.flickr.com/photos/usnationalarchives/archives/circa/1824/">images from circa 1824</a>).</p>
<p>Beyond all the additional tags and content collected via comments on these images, I think that being able to find NARA images based on a map, calendar or tag is the real magic of the commons. The increased opportunities for access to these images cannot be overstated.</p>
<p><a href="http://www.spellboundblog.com/wp-content/uploads/2010/02/3953239497_35477bd7b7_m.jpg"><img class="alignleft size-full wp-image-865" title="Sunflower" src="http://www.spellboundblog.com/wp-content/uploads/2010/02/3953239497_35477bd7b7_m.jpg" alt="" width="240" height="163" /></a>Take this image of a <a title="NARA Flickr Sunflower" href="http://www.flickr.com/photos/usnationalarchives/3953239497/in/set-72157622453808416/">sunflower</a>. If you <a title="NARA page for the sunflower image" href="http://arcweb.archives.gov/arc/action/ExternalIdSearch?id=544590">visit this image on archives.gov</a>, you can certainly find the image and view it &#8211; but good luck finding all the images of flowers as quickly as this <a title="NARA Flickr Tag Flower" href="http://www.flickr.com/photos/usnationalarchives/tags/flower/">Flickr tag page for NARA images of flowers</a> can. Even looking at the special <a title="Documerica By Topic" href="http://www.archives.gov/research/arc/topics/environment/documerica-topics.html">Documerica by Topic</a> page doesn&#8217;t get me much closer to finding an image of a flower.</p>
<p>It will be fun to watch what else NARA chooses to upload to the Commons. I vote for more images that are assigned metadata such that they show up on the map and calendar. I will also put your mind at ease by telling you that the lovely ladies at the top of this post are their because their image is one of the most popular uploaded by NARA to date (based on it having been marked a favorite by 88 individuals). The only image I could find with more fans was the classic <a title="Flickr: Nixon and Elvis" href="http://www.flickr.com/photos/usnationalarchives/3679494978/in/set-72157620680828773/">image of Nixon and Elvis</a> with 250 fans at the time of this posting.</p>
<p>What is your favorite NARA Commons image? Please post a link in the comments and if I get enough I will set up a gallery of Spellbound Fan Favorites!</p>
<p><em>Image Credits:</em> All images within this blog post are pulled from NARA&#8217;s images on the Flickr Commons. Please click on the images to see their specific details.</p>
<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2010/02/16/national-archives-transitions-flickr-commons/">National Archives Transitions to Flickr Commons Membership</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.spellboundblog.com/2010/02/16/national-archives-transitions-flickr-commons/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Leveraging Google Reader&#8217;s Page Change Tracking for Web Page Preservation</title>
		<link>http://www.spellboundblog.com/2010/01/26/leveraging-google-readers-page-change-tracking-for-web-page-preservation/</link>
		<comments>http://www.spellboundblog.com/2010/01/26/leveraging-google-readers-page-change-tracking-for-web-page-preservation/#comments</comments>
		<pubDate>Tue, 26 Jan 2010 06:49:21 +0000</pubDate>
		<dc:creator>Jeanne</dc:creator>
				<category><![CDATA[at risk records]]></category>
		<category><![CDATA[future-proofing]]></category>
		<category><![CDATA[internet archiving]]></category>
		<category><![CDATA[learning technology]]></category>

		<guid isPermaLink="false">http://www.spellboundblog.com/?p=826</guid>
		<description><![CDATA[The Official Google Reader Blog recently announced a new feature that will let users watch any page for updates. The way this works is that you add individual URLs to your Google Reader account. Just as with regular RSS feeds, when an update is detected &#8211; a new entry is added to that subscription. My [...]<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2010/01/26/leveraging-google-readers-page-change-tracking-for-web-page-preservation/">Leveraging Google Reader&#8217;s Page Change Tracking for Web Page Preservation</a></p>
]]></description>
			<content:encoded><![CDATA[<p style="text-align: center;"><a title="Cute RSS Icons" href="http://comingupforair.net/2008/01/rss-icons/"><img class="size-full wp-image-827  aligncenter" title="rss-icons" src="http://www.spellboundblog.com/wp-content/uploads/2010/01/rss-icons.jpg" alt="" width="418" height="373" /></a></p>
<p><a title="Cute RSS Icons" href="http://comingupforair.net/2008/01/rss-icons/"></a>The <a title="Official Google Reader Blog" href="http://googlereader.blogspot.com/">Official Google Reader Blog</a> recently announced a new feature that will let users <a title="Official Google Reader Blog: Follow Changes to Any Website" href="http://googlereader.blogspot.com/2010/01/follow-changes-to-any-website.html">watch any page for updates</a>. The way this works is that you add individual URLs to your Google Reader account. Just as with regular RSS feeds, when an update is detected &#8211; a new entry is added to that subscription.</p>
<p>My thinking is that this could be a really useful tool for archivists charged with preserving websites that change gradually over time, especially those fairly static sites that change infrequently with little or no notice of upcoming changes. If a web page was archived and then added to a dedicated Google Reader account, the archivist could scan their list of watch pages daily or weekly. Changes could then trigger the creation of a fresh snapshot of the site.</p>
<p>I will admit that there have been services out there for a while that do something similar to what Google has just rolled out. I personally have used <a title="Dapper" href="http://www.dapper.net">Dapper.net</a> to take a standard web page and generate an RSS feed based on updates to the page (sound familiar?). One Dapper.net feed that I created and follow is for the <a title="IFRC News Archive" href="http://www.ifrc.org/news/archives.asp">news archive page for the International Red Cross</a> and can be <a title="Dapper: IFRC News RSS" href="http://www.dapper.net/services/IFRC_News">found here</a>. What is funny is that now they actually have <a title="IFRC News RSS Feed" href="http://www.ifrc.org/news/rss.asp">an official RSS feed for their news</a> that includes exactly what my Dapper.net feed harvested off their news archive page &#8211; but when I built that Dapper feed there was no other way for me to watch for those news updates.</p>
<p>There are lots of different tools out there that aim to archive websites. <a title="Archive-It.org" href="http://www.archive-it.org/">Archive-It</a> is a subscription based service run by <a title="Internet Archive" href="http://www.archive.org">Internet Archive</a> that targets institutions and will archive sites on demand or on a regular schedule. Internet Archive also has an open source crawler called <a title="Heritrix" href="http://crawler.archive.org/index.html">Heritrix</a> for those who are comfortable dealing with the code. Other institutions are building their own software to tackle this too. <a title="Harvard University" href="http://www.harvard.edu/">Harvard University</a> has their own <a title="Harvard: WAX" href="http://wax.lib.harvard.edu/collections/home.do">Web Archive Collection Service (WAX)</a>. The <a title="LiWA: Living Web Archives" href="http://liwa-project.eu/index.php/about/">LiWA (Living Web Archives) Project</a> is based in Germany and aims to &#8220;extend the current state of the art and develop the next generation of Web content capture, preservation, analysis, and enrichment services to improve fidelity, coherence, and interpretability of web archives.&#8221; One could even use something as simple as <a title="PDF my URL" href="http://pdfmyurl.com/">PDFmyURL.com</a> &#8211; an online service that turns any URL into a PDF (be sure to play with the advanced options to make sure you get a wide enough snapshot). I know there are many more possibilities &#8211; these just scratch the surface.</p>
<p>What I like about my idea is that it isn&#8217;t meant to replace these services but rather work in tandem with them. The Internet Archive does an amazing job crawling and archiving many web pages &#8211; but they can&#8217;t archive everything and their crawl frequency may not match up with real world updates to a website. This approach certainly wouldn&#8217;t scale well for huge websites for which you would need to watch for changes on many pages. I am picturing this technique as being useful for small organizations or individuals who just need to make sure that a county government website makeover or a community organization&#8217;s website update doesn&#8217;t get lost in the shuffle. I like the idea of finding clever ways to leverage free services and tools to support those who want to protect a particular niche of websites from being lost.</p>
<p><em>Image Credit:</em> The <a title="RSS Icons" href="http://comingupforair.net/2008/01/rss-icons/">RSS themed image</a> above is by <a title="Matt Forsythe" href="http://comingupforair.net/about/">Matt Forsythe</a>.</p>
<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2010/01/26/leveraging-google-readers-page-change-tracking-for-web-page-preservation/">Leveraging Google Reader&#8217;s Page Change Tracking for Web Page Preservation</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.spellboundblog.com/2010/01/26/leveraging-google-readers-page-change-tracking-for-web-page-preservation/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Concertina History Online Features Virtual Collaboration and Digitization</title>
		<link>http://www.spellboundblog.com/2010/01/10/concertinas-virtual-collaboration-digitization/</link>
		<comments>http://www.spellboundblog.com/2010/01/10/concertinas-virtual-collaboration-digitization/#comments</comments>
		<pubDate>Sun, 10 Jan 2010 04:56:01 +0000</pubDate>
		<dc:creator>Jeanne</dc:creator>
				<category><![CDATA[audio]]></category>
		<category><![CDATA[digitization]]></category>
		<category><![CDATA[historical research]]></category>
		<category><![CDATA[learning technology]]></category>
		<category><![CDATA[original order]]></category>
		<category><![CDATA[virtual collaboration]]></category>

		<guid isPermaLink="false">http://www.spellboundblog.com/?p=792</guid>
		<description><![CDATA[In the early 1960s, my father bought a Wheatstone concertina in London. He tells how he visited the factory where it was made to pick one out and recalls the ledger book in which details about the concertinas were recorded. After a recent retelling of this family classic, I was inspired to see what might [...]<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2010/01/10/concertinas-virtual-collaboration-digitization/">Concertina History Online Features Virtual Collaboration and Digitization</a></p>
]]></description>
			<content:encoded><![CDATA[<p style="text-align: center;"><a title="Flickr: Concertinas by user rocketlass" href="http://www.flickr.com/photos/rocketlass/470547134/"><img class="size-full wp-image-796  aligncenter" title="concertinas" src="http://www.spellboundblog.com/wp-content/uploads/2010/01/concertinas.jpg" alt="" width="508" height="380" /></a></p>
<p>In the early 1960s, my father bought a Wheatstone concertina in London. He tells how he visited the factory where it was made to pick one out and recalls the ledger book in which details about the concertinas were recorded. After a recent retelling of this family classic, I was inspired to see what might be online related to concertinas. I was amazed!</p>
<p>First I found the <a title="Concertina.com" href="http://www.concertina.com">Concertina Library</a> which presents itself as a &#8216;Digital Reference Collection for Concertinas&#8217;. With <a title="contributing authors to the concertina library" href="http://www.concertina.com/contributors/index.htm">fourteen contributing authors</a>, the site includes in depth articles on concertina <a title="Concertina History" href="http://www.concertina.com/history/index.htm">history</a>, <a title="Concertina Technology" href="http://www.concertina.com/technology/index.htm">technology</a>, <a title="Concertina Music" href="http://www.concertina.com/music/index.htm">music</a>, <a title="Concertina Research" href="http://www.concertina.com/research/index.htm">research</a> and a wide range of <a title="Concertina Systems" href="http://www.concertina.com/concertina-systems/index.htm">concertina systems</a>.</p>
<p>I particularly appreciate the reasons that Robert Gaskins, site creator, lists for the creation of the site on the <a title="About the Concertina Library" href="http://www.concertina.com/about/index.htm">about page</a>:</p>
<blockquote><p>(1) Almost all of the historical material about concertinas has been held in research libraries where access is limited, or in private collections where access may be non-existent.  The reason for this is not that the material is so valuable, but that in the past there was no way to make material of limited interest available to  everyone, so it stayed safely in archives.  The web has provided a way to make this material widely available—partly by the libraries themselves, and partly in collections such as this.</p>
<p>(2) There seems to be a growing number of people working again on the history of concertinas, perhaps in part because research materials are becoming available on the web.  These people are widely scattered, so they don&#8217;t get to meet and discuss their work in person.  But again the web has provided an answer, allowing people to work collaboratively and exchange information across miles and timezones,  and for the resulting articles the web offers worldwide publication at almost no cost.</p></blockquote>
<p>What an eloquent testimonial for the power of the internet to both provide access to once-inaccessible materials and support virtual collaboration within a geographically dispersed community.</p>
<p>Next, I found the <a title="Wheatstone Concertina Ledgers" href="http://horniman.info">Wheatstone Concertina Ledgers</a>. This site features business records (in the form of ledgers) of the C. Wheatstone &amp; Co. stretching from 1830 through 1974 (with some gaps). The originals are held at the Library of the <a title="Horniman Museum" href="http://www.horniman.ac.uk/">Horniman Museum</a> in London. It is a great reference website with a nice interface for paging through the ledgers. Armed with the serial number from my father&#8217;s concertina (36461) I found my way to <a title="Page 88: featuring my father's concertina" href="http://horniman.info/DKNSARC/SD03/PAGES/D3P0880S.HTM">page 88 of a Wheatstone Production Journal</a> from the Dickinson Archives. If I am reading that line properly, his concertina is a 3E model and was made (or maybe sold?) April 25, 1960. I wish that there was documentation online to explain how to read the ledgers. For example, I would love to know what &#8216;Bulletin 3052&#8242; means.</p>
<p>I liked the way that they retained the sense of turning pages in a ledger. Every page of each ledger is included, including front and back end pages and blank pages. I have total confidence that I am seeing the pages in the same order as I would in person.</p>
<p>You can read the <a title="Introduction to the Wheatstone Ledger Digitization Project" href="http://horniman.info/DOCUMNTS/INTRO.HTM">overview and introduction to the project</a>, but what intrigued me more was the very detailed narrative of how this digitization effort was accomplished. In <a title="How the Wheatstone Concertina Ledgers were Digitized" href="http://horniman.info/DOCUMNTS/HOWTO.HTM">How The Wheatstone Concertina Ledgers Were Digitized</a>, we find Robert Gaskins of  the <a title="Concertina.com" href="http://www.concertina.com/">Concertina Library</a> explaining how, with an older model IBM ThinkPad, a consumer grade scanner, and his existing software (Microsoft Office and Macromedia Fireworks), he created a website with 4,500 images and clean, simple navigation. From where I sit, this is a great success story &#8211; a single person&#8217;s dedication can yield fantastic results. You don&#8217;t need the latest and greatest technology to run a successful digitization project. One individual can go a long way through sheer determination and the clever leveraging of what they have on hand.</p>
<p>Back on the <a title="Concertina.com" href="http://www.concertina.com/">Concertina Library</a>&#8216;s about page we find &#8220;There is still a lot of material relevant to the study of concertinas and their history which should be digitized and placed on the web, but has not been so far. Ideas for additional contributors, items, and collections are very welcome.&#8221; If I am following the dates correctly, the Concertina Library has articles dating back to February of 2001, shortly before Mr. Gaskins started planning the ledger digitization project. At the same time as he was collaborating with other concertina enthusiasts to build the Concertina Library,  he was scanning ledgers and creating the <a title="Wheatstone Concertina Ledgers" href="http://horniman.info/">Wheatstone Concertina Ledgers</a> website. Three cheers to Mr. Gaskins for his obvious personal enthusiasm and dedication to virtual collaboration, digitization and well-built websites! Another three cheers for all those who joined the cause and collaborated to create great online resources to support ongoing concertina research from anywhere in the world.</p>
<p>All this started because my father owns a beautiful old concertina. I love it when an innocent web search leads me to find a wealth of online archival materials. Do you have a favorite online archival resource that you stumbled across while doing similar research for family or friends? Please share them in the comments below!</p>
<p><em>Image Credit: </em><a rel="cc:attributionURL" href="http://www.flickr.com/photos/rocketlass/">http://www.flickr.com/photos/rocketlass/</a> / <a rel="license" href="http://creativecommons.org/licenses/by-nc-sa/2.0/">CC BY-NC-SA 2.0</a></p>
<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2010/01/10/concertinas-virtual-collaboration-digitization/">Concertina History Online Features Virtual Collaboration and Digitization</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.spellboundblog.com/2010/01/10/concertinas-virtual-collaboration-digitization/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

