<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Spellbound Blog &#187; access</title>
	<atom:link href="http://www.spellboundblog.com/category/access/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.spellboundblog.com</link>
	<description>Archives, Digital Humanities, Cultural Heritage, Technology</description>
	<lastBuildDate>Fri, 03 Feb 2012 05:43:15 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>Digitization Program Site Visit: Archives of American Art</title>
		<link>http://www.spellboundblog.com/2012/02/03/digitization-program-site-visit-archives-of-american-art/</link>
		<comments>http://www.spellboundblog.com/2012/02/03/digitization-program-site-visit-archives-of-american-art/#comments</comments>
		<pubDate>Fri, 03 Feb 2012 05:43:15 +0000</pubDate>
		<dc:creator>Jeanne</dc:creator>
				<category><![CDATA[access]]></category>
		<category><![CDATA[digitization]]></category>
		<category><![CDATA[learning technology]]></category>

		<guid isPermaLink="false">http://www.spellboundblog.com/?p=1196</guid>
		<description><![CDATA[The image of Alexander Calder above shows him in his studio, circa 1950. It is from a folder titled Photographs: Calder at Work, 1927-1956, undated, part of Alexander Calder&#8217;s Papers held by the Smithsonian Archives of American Art and available online through the efforts of their digitization project. I love that this image capture him in [...]<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2012/02/03/digitization-program-site-visit-archives-of-american-art/">Digitization Program Site Visit: Archives of American Art</a></p>
]]></description>
			<content:encoded><![CDATA[<p style="text-align: center;"><strong><a href="http://www.aaa.si.edu/collections/images/detail/alexander-calder-his-studio-10308"><img class="wp-image-1242 aligncenter" title="Alexander Calder in his studio, ca. 1950 / unidentified photographer." src="http://www.spellboundblog.com/wp-content/uploads/2012/02/AAA_caldalex_26738.jpg.jpeg" alt="" width="471" height="479" /></a></strong></p>
<p>The image of Alexander Calder above shows him in his studio, circa 1950. It is from a folder titled <a title="Photographs: Calder at Work" href="http://www.aaa.si.edu/collections/container/viewer/Calder-at-Work--189357">Photographs: Calder at Work, 1927-1956, undated</a>, part of <a title="Alexander Calder's Papers" href="http://www.aaa.si.edu/collections/alexander-calder-papers-7294/more">Alexander Calder&#8217;s Papers</a> held by the <a title="Smithsonian Archives of American Art" href="http://www.aaa.si.edu/">Smithsonian Archives of American Art</a> and available online through the efforts of their digitization project. I love that this image capture him in his creative space &#8211; you get to see the happy chaos from which Calder drew his often sleek and sparse sculptures.</p>
<p>Back in October, I had the opportunity to visit with staff of the digitization program for the Smithsonian Archives of American Art along with a group of my colleagues from the World Bank. This is a report on that site visit. It is my hope that these details can help others planning digitization projects – much as it is informing our own internal planning.</p>
<p><strong>Date of Visit:</strong> October 18, 2011</p>
<p><strong>Destination:</strong> <a title="Smithsonian Archives of American Art" href="http://www.aaa.si.edu/">Smithsonian Archives of American Art</a></p>
<p><strong>Smithsonian Archives of American Art Hosts:</strong></p>
<ul>
<li><a title="Karen Weiss" href="http://www.linkedin.com/pub/karen-b-weiss/11/aa2/251">Karen Weiss</a></li>
<li>Many additional staff members</li>
</ul>
<p><strong>Summary: </strong> This visit was two hours in length and consisted of a combination of presentation, discussion and site tour to meet staff and examine equipment.</p>
<p><strong>Background: </strong>The Smithsonian’s Archives of American Art (AAA) program was first funded by a grant from the <a title="Terra Foundation of American Art" href="http://www.terraamericanart.org/">Terra Foundation of American Art</a> in 2005, recently extended through 2016. This funding supports both staff and research.</p>
<p>Their digitization project replaced their existing microfilm program and focuses on digitizing complete collections. Digitization focused on in-house collections (in contrast with collections captured on microfilm from other institutions across the USA as part of their microfilm program).</p>
<p>Over the course of the past 6 years, they have scanned over 110 collections &#8211; a total of 1,000 linear feet – out of an available total of 13,000 linear feet from 4,500 collections. They keep a prioritized list of what they want digitized.</p>
<p>The Smithsonian DAM (digital asset management system) had to be adjusted to handle the hierarchy of EAD and the digitized assets. Master files are stored in the Smithsonian DAM. Files stored in intermediate storage areas are only for processing and evaluation and are disposed of after they have been ingested into the DAM.</p>
<p>Current staffing is two and a half archivists and two digital imaging specialists. One digital imaging specialist focuses on scanning full collections, while the other focuses on on-demand single items.</p>
<p>The website is built in <a title="ColdFusion" href="http://en.wikipedia.org/wiki/ColdFusion">ColdFusion</a> and pulls content from a SQL database. Currently they have no way to post media files (audio, oral histories, video) on the external web interface.</p>
<p>They do not delineate separate items within folders. When feedback comes in from end users about individual items, this information is usually incorporated into the scope note for the collection, or the folder title of the folder containing the item. Full size images in both the image gallery and the full collections are watermarked.</p>
<p>They track the processing stats and status of their projects.</p>
<p><strong>Standard Procedures:</strong></p>
<p><span style="text-decoration: underline;">Full Collection Digitization:</span></p>
<ul>
<li>Their current digitization workflow is based on their microfilm process. The workflow is managed via an internal web-based management system. Every task required for the process is listed, then crossed off and annotated with the staff and date the action was performed.</li>
<li>Collections earmarked for digitization are thoroughly described by a processing archivist.</li>
<li>Finding aids are encoded in <a title="EAD" href="http://www.loc.gov/ead/">EAD</a> and created in XML using <a title="NoteTab" href="http://www.notetab.com/">NoteTab Pro</a> software.</li>
<li><a title="MARC" href="http://en.wikipedia.org/wiki/MARC_standards">MARC</a> records are created when the finding aid is complete. The summary information from the MARC record is used to create the summary of the collection published on the website.</li>
<li>Box numbers and folder numbers are assigned and associated with a finding aid. The number of the box and folder are all a scanning technician needs.</li>
<li>A ‘scanning information worksheet’ provides room for notes from the archivist to the scanning technician.  It provides the opportunity to indicate which documents should not be scanned. Possible reasons for this are duplicate documents or those containing personal identifying information (PIP).</li>
<li>A directory structure is generated by a script based on the finding aid, creating a directory folder for each physical folder which exists for the collection. Images are saved directly into this directory structure. The disk space to hold these images is centrally managed by the Smithsonian and automatically backed up.</li>
<li>All scanning is done in 600dpi color, according to their internal  guidelines. They frequently have internal projects which demand high resolution images for use in publication.</li>
<li>After scanning is complete, the processing archivist does the post scanning review before the images are pushed into the DAM for web publication.</li>
<li>Their policy is to post everything from a digitized collection, but they do support a take-down policy.</li>
<li>A recent improvement was made in January, 2010. At that time they relaunched the site to include all of their collections co-located on the same list, both digitized and non-digitized.</li>
</ul>
<p><span style="text-decoration: underline;">On Demand Digitization:</span></p>
<ul>
<li>Patrons may request the digitization of individual items.</li>
<li>These requests are evaluated by archivists to determine if it is appropriate to digitize the entire folder (or even box) to which the item belongs.</li>
<li>Requests are logged in a paper log.</li>
<li>Item level scanning ties back to an item level record with an item ID. There is an ‘Online Removal Notice’ to create item level stub.</li>
<li>An item level cataloger describes the content after it is scanned.</li>
<li>Unless there is an explicit copyright or donor restriction, the items is put online in the <a title="Archives of American Art Image Gallery" href="http://www.aaa.si.edu/collections/images">Image Gallery</a> (which currently has 12,000 documents).</li>
<li>Access to images is provided by keyword searching.</li>
<li>Individual images are linked back to the archival description for the collection from which they came.</li>
</ul>
<p><strong>Improvements/Changes they wish for: </strong></p>
<ul>
<li>They currently have no flexibility to make changes in the database nimbly. It is a tedious process to change the display and each change requires a programmer.</li>
<li>They would like to consider a move to open source software or to use a central repository – though they have concerns about what other sacrifices this would require.</li>
<li>Show related collections, list connected names (currently the only options for discovery are an A-Z list of creators or keyword search).</li>
<li>Ability to connect to guides and other exhibits.</li>
</ul>
<p><strong>References:</strong></p>
<ul>
<li><a title="Archives of American Art Image Gallery" href="http://www.aaa.si.edu/collections/images">Image Gallery</a></li>
<li><a title="Archives of American Art" href="http://www.aaa.si.edu/">Main Website</a></li>
<li><a title="Digitization Project" href="http://www.aaa.si.edu/collections/projects/terra">Digitization Project</a></li>
<li><a title="Technical Documentation" href="http://www.aaa.si.edu/collections/documentation">Technical Documentation</a> &#8211; shares internal procedures and guidelines</li>
<li><a title="OCLC rapid capture paper" href="http://www.oclc.org/research/publications/library/2011/2011-04r.htm">OCLC rapid capture paper</a></li>
<li><a title="Scanning equipment" href="http://www.digitaltransitions.com/page/divison-of-cultural-heritage-products">Scanning equipment</a></li>
</ul>
<p><em>Image Credit:</em> Alexander Calder papers, Archives of American Art, Smithsonian Institution.</p>
<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2012/02/03/digitization-program-site-visit-archives-of-american-art/">Digitization Program Site Visit: Archives of American Art</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.spellboundblog.com/2012/02/03/digitization-program-site-visit-archives-of-american-art/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Digitization Program Site Visit: University of Maryland</title>
		<link>http://www.spellboundblog.com/2011/12/12/digitization-program-site-visit-university-of-maryland/</link>
		<comments>http://www.spellboundblog.com/2011/12/12/digitization-program-site-visit-university-of-maryland/#comments</comments>
		<pubDate>Mon, 12 Dec 2011 04:58:26 +0000</pubDate>
		<dc:creator>Jeanne</dc:creator>
				<category><![CDATA[access]]></category>
		<category><![CDATA[digitization]]></category>
		<category><![CDATA[software]]></category>

		<guid isPermaLink="false">http://www.spellboundblog.com/?p=1194</guid>
		<description><![CDATA[I recently had the opportunity to visit with staff of the University of Maryland, College Park&#8217;s Digital Collections digitization program along with a group of my colleagues from the World Bank. This is a report on that site visit. It is my hope that these details can help others planning digitization projects &#8211; much as [...]<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2011/12/12/digitization-program-site-visit-university-of-maryland/">Digitization Program Site Visit: University of Maryland</a></p>
]]></description>
			<content:encoded><![CDATA[<p><a href="http://digital.lib.umd.edu/archivesum/?pid=umd:2258"><img class="size-full wp-image-1200    alignright" style="margin-left: 3px; margin-right: 3px;" title="University Archives, Special Collections, University of Maryland Libraries" src="http://www.spellboundblog.com/wp-content/uploads/2011/11/univarch.000969.0001.jpg" alt="" width="320" height="251" /></a></p>
<p style="text-align: left;">I recently had the opportunity to visit with staff of the University of Maryland, College Park&#8217;s Digital Collections digitization program along with a group of my colleagues from the World Bank. This is a report on that site visit. It is my hope that these details can help others planning digitization projects &#8211; much as it is informing our own internal planning.</p>
<p><strong>Date of Visit:</strong> October 13, 2011</p>
<p><strong>Destination:</strong> University of Maryland, Digital Collections</p>
<p><strong>University of Maryland Hosts:</strong></p>
<ul>
<li><a title="Jennie Levine Knies, Manager, Digital Collections" href="http://www.linkedin.com/in/jenniealevine">Jennie Levine Knies, Manager, Digital Collections</a></li>
<li><a title="Alexandra Carter, Digital Imaging Librarian" href="http://www.linkedin.com/pub/alexandra-carter/1a/814/1b3">Alexandra Carter, Digital Imaging Librarian</a></li>
</ul>
<p><strong>Summary: </strong> This visit was two hours in length and consisted of a one hour presentation and Q&amp;A session with Jennie Levine Knies, Manager of Digital Collections followed by a one hour tour and Q&amp;A session with Alexandra Carter, Digital Imaging Librarian.</p>
<p><strong>Background: </strong>The <a title="Digital Collections" href="http://digital.lib.umd.edu/">Digital Collections of the University of Maryland</a> was launched in 2006 using <a title="Fedora" href="http://www.fedora-commons.org/">Fedora Commons</a>. It is distinct from the ‘Digital Repository at the University of Maryland’, aka <a title="DRUM" href="http://drum.lib.umd.edu/">DRUM</a>, which is built on <a title="DSpace" href="http://www.dspace.org/">DSpace</a>. DRUM contains faculty-deposited documents, a library-managed collection of UMD theses and dissertations, and collections of technical reports. The Digital Collections project focuses on digitization of photographs, postcards, manuscripts &amp; correspondence – mostly based on patron demand. In addition, materials are selected for digitization based on the need for thematic collections to support events, such as their recent civil war exhibition.</p>
<p>After a period of full funding, there has been a fall off in funding which has prevented any additional changes to the Fedora system.</p>
<p>Another project at UMD involves digitization of Japanese childrens&#8217; books (<a href="http://digital.lib.umd.edu/prange.jsp">George W. Prange Collection</a>) and currently uses “in house outsourcing”. In this scenario, contractors bring all their equipment and staff on site to perform the digitization process.</p>
<p><strong>Standard Procedures:</strong></p>
<ul>
<li>Requests must be made using a combination of the ‘Digital Request Cover Sheet’ and ‘<a title="Digital Surrogate Request Sheet" href="http://www.lib.umd.edu/special/forms/digorderform.pdf">Digital Surrogate Request Sheet</a>. These sheets are then reviewed for completeness by the curator under whose jurisdiction the collection falls. Space on the request forms is provided so that the curator may add additional notes to aid in the digitization process. They decide if it is worth digitizing an entire folder when only specific item(s) are requested. Standard policy is to aim for two week turnaround for digitization based on patron request.</li>
<li>The digital request is given a code name for easy reference. They choose these names alphabetically.</li>
<li>Staff are assigned to digitize materials. This work is often done by student workers using one of three <a title="Epson Expression 10000XL" href="http://www.epson.com/cgi-bin/Store/jsp/Product.do?BV_UseBVCookie=yes&amp;sku=E10000XL-PH">Epson 10000 XL</a> flatbed scanners. There is also a <a title="Zeutschel OS 12000" href="http://www.zeutschel.com/products/book_copiers_os12000_bc.html">Zeutschel OS 12000</a> overhead scanner available for materials which cannot be handled by the flatbed scanners.</li>
<li>Alexandra reviews all scans for quality.</li>
<li>Metadata is reviewed by another individual.</li>
<li>When both the metadata &amp; image quality has been reviewed, materials are published online.</li>
</ul>
<p><strong>Improvements/Changes they wish for: </strong></p>
<ul>
<li>Easier way to create a web ‘home’ for collections, currently many do not have a main page and creating one requires the involvement of the IT department.</li>
<li>Option for users to save images being viewed</li>
<li>Option to upload content to their website in PDF format</li>
<li>Way to associate transcriptions with individual pages</li>
<li>More granularity for workflow: currently the only status they have to indicate that a folder or item is ready for review is ‘Pending’. Since there are multiple quality control activities that must be performed by different staff, currently they must make manual lists to track what phases of QA are complete for which digitized content.</li>
<li>Reduce data entry.</li>
<li>Support for description at both the folder and item level at the same time. Currently description is only permitted either at the folder level OR at the item level.</li>
<li>Enable search and sorting by date added to system. This data is captured, but not exposed.</li>
</ul>
<p><strong>Lessons Learned:</strong></p>
<ul>
<li>Should have adopted an existing metadata standard rather than creating their own.</li>
<li>People do not use the ‘browse terms’ – do not spend a lot of time working on this</li>
</ul>
<p><strong>Resources:</strong></p>
<ul>
<li><a title="Digital Content Guidelines" href="http://www.lib.umd.edu/dcr/publications/SelectionCriteriaforDigitalObjects2010.pdf">Digital Content Guidelines: Selection Criteria for Digital Objects</a></li>
</ul>
<p><em>Image Credit:</em> Women students in a green house during a Horticulture class at the University of Maryland, 1925. University Archives, Special Collections, University of Maryland Libraries</p>
<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2011/12/12/digitization-program-site-visit-university-of-maryland/">Digitization Program Site Visit: University of Maryland</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.spellboundblog.com/2011/12/12/digitization-program-site-visit-university-of-maryland/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>SXSW Panel Proposal &#8211; Archival Records Online: Context is King</title>
		<link>http://www.spellboundblog.com/2011/08/31/sxsw-panel-proposal-archival-records-online-context-is-king/</link>
		<comments>http://www.spellboundblog.com/2011/08/31/sxsw-panel-proposal-archival-records-online-context-is-king/#comments</comments>
		<pubDate>Wed, 31 Aug 2011 23:46:47 +0000</pubDate>
		<dc:creator>Jeanne</dc:creator>
				<category><![CDATA[access]]></category>
		<category><![CDATA[context]]></category>
		<category><![CDATA[interface design]]></category>
		<category><![CDATA[SXSW]]></category>

		<guid isPermaLink="false">http://www.spellboundblog.com/?p=1170</guid>
		<description><![CDATA[I have a panel up for evaluation on the SXSW Interactive Panel Picker titled Archival Records Online: Context is King. The evaluation process for SXSW panels is based on a combination of staff choice, advisory board recommendations and public votes. As you can see from the pie chart shown here (thank you SXSW website for [...]<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2011/08/31/sxsw-panel-proposal-archival-records-online-context-is-king/">SXSW Panel Proposal &#8211; Archival Records Online: Context is King</a></p>
]]></description>
			<content:encoded><![CDATA[<p><a href="http://panelpicker.sxsw.com"><img class="alignright size-full wp-image-1173" title="Panel Picker Pie Chart" src="http://www.spellboundblog.com/wp-content/uploads/2011/08/PanelPicker12_pie_RED.jpg" alt="" width="145" height="173" /></a>I have a panel up for evaluation on the SXSW Interactive Panel Picker titled <a title="SXSWi: Archival Records Online: Context is King" href="http://panelpicker.sxsw.com/ideas/view/12801">Archival Records Online: Context is King</a>. The evaluation process for SXSW panels is based on a combination of staff choice, advisory board recommendations and public votes. As you can see from the pie chart shown here (thank you SXSW website for the great graphic), <strong>30%</strong> of the selection criteria is based on public votes. That is where you come in. Voting is open through 11:59 pm Central Daylight Time on Friday, September 2. To vote in favor of my panel, all you need to do is create a free account over on <a title="SXSW Panel Picker" href="http://panelpicker.sxsw.com/">SXSW Panel Picker</a> and then find <a title="SXSWi: Archival Records Online: Context is King" href="http://panelpicker.sxsw.com/ideas/view/12801">Archival Records Online: Context is King</a> and give it a big thumbs up.</p>
<p>If my panel is selected, I intend this session to give me the chance to review all of the following:</p>
<ol>
<li>What are the special design requirements of archival records?</li>
<li>What are the biggest challenges to publishing archival records online?</li>
<li>How can archivists, designers and developers collaborate to build successful web sites?</li>
<li>Why is metadata important?</li>
<li>How can search engine optimization (SEO) inform the design process?</li>
</ol>
<p>All of this ties into what I have been pondering, writing about and researching for the past few years related to getting archival records online. So many people are doing such amazing work in this space. I want to show off the best of the best and give attendees some takeaways to help them build websites that make it easy to see the context of anything they find in their search.</p>
<p>While archival records have a very particular dependence on the effective communication of context &#8211; I also think that this is a lesson that can improve interface design across the board. These are issues that UI and IA folks are always going to be worrying about. SXSW is such a great opportunity for cross pollination. Conferences outside the normal archives, records management and library conference circuit give us a chance to bring fresh eyes and attention to the work being done in our corner of the world.</p>
<p>If you like the idea of this session, please take a few minutes to go sign up at the <a title="SXSW Panel Picker" href="http://panelpicker.sxsw.com/">SXSW Panel Picker</a> and give <a title="SXSWi: Archival Records Online: Context is King" href="http://panelpicker.sxsw.com/ideas/view/12801">Archival Records Online: Context is King</a> a thumbs up. You don&#8217;t need to be planning to attend in order to cast your vote, though after you start reading through all the great panel ideas you might change your mind!</p>
<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2011/08/31/sxsw-panel-proposal-archival-records-online-context-is-king/">SXSW Panel Proposal &#8211; Archival Records Online: Context is King</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.spellboundblog.com/2011/08/31/sxsw-panel-proposal-archival-records-online-context-is-king/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Support EAD Tagging Research</title>
		<link>http://www.spellboundblog.com/2010/12/06/support-ead-tagging-research/</link>
		<comments>http://www.spellboundblog.com/2010/12/06/support-ead-tagging-research/#comments</comments>
		<pubDate>Mon, 06 Dec 2010 15:27:44 +0000</pubDate>
		<dc:creator>Jeanne</dc:creator>
				<category><![CDATA[access]]></category>
		<category><![CDATA[archival community]]></category>
		<category><![CDATA[ArchivesZ]]></category>
		<category><![CDATA[EAD]]></category>
		<category><![CDATA[metadata]]></category>

		<guid isPermaLink="false">http://www.spellboundblog.com/?p=1069</guid>
		<description><![CDATA[In case you haven&#8217;t seen this request via other channels, please consider supporting the research effort described below into how different organizations encode finding aids using EAD. As someone who has dug into the gory details of eleven institutions&#8217; finding aids to extract data for my ArchivesZ project, I am here to tell you that [...]<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2010/12/06/support-ead-tagging-research/">Support EAD Tagging Research</a></p>
]]></description>
			<content:encoded><![CDATA[<p>In case you haven&#8217;t seen this request via other channels, please consider supporting the research effort described below into how different organizations encode finding aids using EAD. As someone who has dug into the gory details of eleven institutions&#8217; finding aids to extract data for my ArchivesZ project, I am here to tell you that this work is VERY important. With better standards in place we will have a better foundation upon which to create interesting new tools and services to support archivists and researchers.</p>
<p>Is part of your job is to encode finding aids in EAD? Then please ask if you can send a dozen of them to the researchers on this project!</p>
<blockquote><p><strong>Seeking EAD records from repositories that have implemented EAD</strong></p>
<p>Standards have been entering the archival lexicon at a fast pace to ensure data reliability, enable data aggregation, and manage data over the long term. However, we have not yet examined the use of these standards across the archival community. As we  move into the next phase of standards-creation, a broad look at current implementations will help to inform the next  generation of these standards. To do this, Kathy Wisser (Simmons College) and Jackie Dean (UNC Chapel Hill) are conducting research on EAD tag usage in the encoding community.</p>
<p>This project is intended to inform the TS-EAD revision process of the standard, and results will be disseminated through traditional publication avenues.</p>
<p>We are seeking a sample of encoded finding aids from institutions that have implemented EAD.  If you are willing to participate in this project, please submit via electronic mail 12 to 15 finding aids to eadtagresearch@gmail.com by December 15, 2010.</p>
<p>The goal of the project is to identify encoding behavior and <strong>not</strong> to evaluate the quality of the encoding or the content of the finding aid. We will be noting the presence and absence of elements and attributes and the way that elements are used within the context of an EAD instance.</p>
<p>All results will be <strong>anonymized</strong>; no institution-specific information will be linked to the results.  Institutions willing to participate will be acknowledged.</p>
<p>In order to obtain an accurate account of the use of the standard, we are looking for EAD instances from as many institutions as possible. We hope you will consider contributing to this effort.</p>
<p>If you have any questions about the project, please contact:</p>
<ul>
<li>Kathy Wisser (Simmons College &#8211; wisser@simmons.edu)</li>
<li>Jackie Dean (UNC Chapel Hill &#8211; jdean@email.unc.edu)</li>
</ul>
</blockquote>
<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2010/12/06/support-ead-tagging-research/">Support EAD Tagging Research</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.spellboundblog.com/2010/12/06/support-ead-tagging-research/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Topic Modeling, Auto-Classification and Archival Description</title>
		<link>http://www.spellboundblog.com/2010/04/27/topic-modeling-auto-classification-archival-description/</link>
		<comments>http://www.spellboundblog.com/2010/04/27/topic-modeling-auto-classification-archival-description/#comments</comments>
		<pubDate>Tue, 27 Apr 2010 06:28:08 +0000</pubDate>
		<dc:creator>Jeanne</dc:creator>
				<category><![CDATA[access]]></category>
		<category><![CDATA[interface design]]></category>
		<category><![CDATA[metadata]]></category>
		<category><![CDATA[open source]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[software]]></category>
		<category><![CDATA[text mining]]></category>
		<category><![CDATA[what if]]></category>

		<guid isPermaLink="false">http://www.spellboundblog.com/?p=963</guid>
		<description><![CDATA[In an example of Twitter serendipity, @silverasm&#8216;s (Aditi Muralidharan) tweet pointed me to @historying&#8216;s blog post about Topic Modeling. In this post Cameron Blevins explains the results of using the topic modeling feature of UMass Amherst&#8216;s MAchine Learning for LanguagE Toolkit (MALLET) on the text of Martha Ballard’s Diary. I have spent lot of time [...]<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2010/04/27/topic-modeling-auto-classification-archival-description/">Topic Modeling, Auto-Classification and Archival Description</a></p>
]]></description>
			<content:encoded><![CDATA[<p><a href="http://mallet.cs.umass.edu/index.php"><img class="alignright size-full wp-image-964" title="MALLET logo" src="http://www.spellboundblog.com/wp-content/uploads/2010/04/logo3.png" alt="" width="215" height="95" /></a>In an example of Twitter serendipity, <a title="Twitter: silverasm" href="http://twitter.com/silverasm">@silverasm</a>&#8216;s (Aditi Muralidharan) <a title="tweet about text mining" href="http://twitter.com/silverasm/statuses/12842112825">tweet</a> pointed me to <a title="Twitter: historying" href="http://twitter.com/historying">@historying</a>&#8216;s <a title="Topic Modeling Martha Ballard’s Diary" href="http://historying.org/2010/04/01/topic-modeling-martha-ballards-diary/">blog post about Topic Modeling</a>. In this post Cameron Blevins explains the results of using the <a title="MALLET: Topic Modeling" href="http://mallet.cs.umass.edu/topics.php">topic modeling</a> feature of <a title="UMass Amherst" href="http://www.umass.edu/">UMass Amherst</a>&#8216;s <a title="MAchine Learning for LanguagE Toolkit" href="http://mallet.cs.umass.edu/index.php">MAchine Learning for LanguagE Toolkit</a> (MALLET) on the text of <a title="Martha Ballard's Diary Online" href="http://dohistory.org/diary/">Martha Ballard’s Diary</a>.</p>
<p>I have spent lot of time thinking about how to generate thematic overviews of groups of archival collections. My information visualization project, <a title="ArchivesZ Blog Posts" href="http://www.spellboundblog.com/category/archivesz/">ArchivesZ</a>, aims to provide ways of understanding aggregated archival description data, both from a single institution or across institutional boundaries. Now I find myself wondering if text mining with a tool like MALLET might generate smart topic groupings more elegantly than fighting with the wide range of non-standardized collection subjects.</p>
<p><strong>Topic Modeling with MALLET</strong></p>
<p>To get a sense of what MALLET generates, see the excerpt below from Blevins&#8217;s post:</p>
<blockquote><p>With some tinkering, MALLET generated a list of thirty topics  comprised of twenty words each, which I then labeled with a descriptive  title. Below is a quick sample of what the program<em> </em>“thinks” are  some of the topics in the diary:</p>
<ul>
<li><strong>MIDWIFERY:</strong> birth deld safe morn receivd calld left  cleverly pm labour fine reward arivd infant expected recd shee born  patient</li>
<li><strong>CHURCH: </strong>meeting attended  afternoon reverend worship foren mr famely performd vers attend public  supper st service lecture discoarst administred supt</li>
<li><strong>DEATH:</strong> day yesterday  informd morn years death ye hear expired expird weak dead las past heard  days drowned departed evinn</li>
<li><strong>GARDENING:</strong> gardin sett  worked clear beens corn warm planted matters cucumbers gatherd potatoes  plants ou sowd door squash wed seeds</li>
</ul>
</blockquote>
<p>He goes on to explain that &#8220;MALLET also allows us to track those topics across the text.&#8221; What if, instead of text mining a diary, we pumped the descriptions of every archival collection from a single institution into MALLET. Of course we would need a good list of stop words including such common terms as archives, history, sources and records. But I wonder how the topics MALLET suggests would compare to the official subjects associated with each collection? Could this give us a broad overview of the topics covered by a specific repository and give us a new way to build paths to the collections based on topic?</p>
<p><strong>Auto-Classification Using Castanet</strong></p>
<p>Text miner <a title="Aditi Muralidharan" href="http://www.cs.berkeley.edu/~aditi/">Aditi Muralidharan</a> also posted recently on this theme in <a title="Castanet: automatically generating a browsing structure for a collection" href="http://mininghumanities.com/2010/04/24/castanet-automatically-generating-a-browsing-structure-for-a-collection/">Castanet: automatically generating a browsing structure for a collection</a> and explains:</p>
<blockquote><p>Castanet automatically carves a sub-structure from the hierarchical  concept dictionary, WordNet (<a href="http://wordnet.princeton.edu/">http://wordnet.princeton.edu</a>),  and matches items in the collection to one or many appropriate places  within that hierarchy. Then, after some automated trimming and  flattening, the result is a hierarchical browsing system.</p></blockquote>
<p>I have heard of Castanet before via the <a title="Flamenco Search Interface Project" href="http://flamenco.berkeley.edu/">Flamenco Search Interface Project</a>. Apparently Muralidharan did a project using Castanet last summer to create <a href="http://go2.wordpress.com/?id=725X1342&amp;site=textdigihum.wordpress.com&amp;url=http%3A%2F%2Forange.sims.berkeley.edu%2Fcgi-bin%2Fflamenco.cgi%2Fflickr%2FFlamenco&amp;sref=http%3A%2F%2Fmininghumanities.com%2F2010%2F04%2F24%2Fcastanet-automatically-generating-a-browsing-structure-for-a-collection%2F">a category system</a> for <a title="Flickr Commons" href="http://www.flickr.com/commons">Flickr Commons</a> images based on the images&#8217;  tags which is then rendered using a Flamenco interface. I include a partial screen-shot below to give you a taste of what the navigation of images feels like a few levels down in the hierarchy. I love the classification of &#8216;Group Action&#8217; then filtered by a sub-classification of &#8216;Commerce&#8217;. The first images shown are of &#8216;horse trading&#8217; &#8211; with additional headings and images beneath them as well as additional filter options on the left.</p>
<p style="text-align: center;"><a title="Flickr Commons: group_action &gt; commerce" href="http://orange.sims.berkeley.edu/cgi-bin/flamenco.cgi/flickr/Flamenco?q=actX:322&amp;group=actX"><img class="aligncenter size-full wp-image-966" title="Flickr Commons Images via Canasta &amp; Flamenco" src="http://www.spellboundblog.com/wp-content/uploads/2010/04/flickr-canasta.jpg" alt="" width="547" height="308" /></a></p>
<p><strong>What If?</strong></p>
<p>What if we pulled all the English language archival descriptions from around the world as our original data set. If we used this data for topic modeling, our subjects clusters would be cross-institutional. Maybe we could map the local institution assigned subjects to the topic model generated topics for each collection and get a sort of automated crosswalk for finding related collections. If we used the local institution assigned subjects from the archival descriptions for Canasta style auto-classification, maybe we could generate a way to hierarchically browse collections topically.</p>
<p>Both MALLET and Flamenco are open source (I am not sure of the status of Castanet) and, as I discovered working on ArchivesZ, many institutions will share their archival description data for a good cause. So &#8211; is this a good cause? I need to tease these ideas out a bit more, but what do you all think of it at first blush? Feasible? Interesting? Worthwhile experiments?</p>
<p><em>Image Credits:</em> MALLET logo from <a title="MALLET Homepage" href="http://mallet.cs.umass.edu/index.php">MALLET homepage</a>. Images in screen shot from <a title="Flickr Commons" href="http://www.flickr.com/commons">Flickr Commons</a> with no known copyright.</p>
<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2010/04/27/topic-modeling-auto-classification-archival-description/">Topic Modeling, Auto-Classification and Archival Description</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.spellboundblog.com/2010/04/27/topic-modeling-auto-classification-archival-description/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>National Archives Transitions to Flickr Commons Membership</title>
		<link>http://www.spellboundblog.com/2010/02/16/national-archives-transitions-flickr-commons/</link>
		<comments>http://www.spellboundblog.com/2010/02/16/national-archives-transitions-flickr-commons/#comments</comments>
		<pubDate>Tue, 16 Feb 2010 06:57:51 +0000</pubDate>
		<dc:creator>Jeanne</dc:creator>
				<category><![CDATA[access]]></category>
		<category><![CDATA[GIS]]></category>
		<category><![CDATA[learning technology]]></category>
		<category><![CDATA[outreach]]></category>
		<category><![CDATA[photography]]></category>
		<category><![CDATA[virtual collaboration]]></category>
		<category><![CDATA[web 2.0]]></category>

		<guid isPermaLink="false">http://www.spellboundblog.com/?p=824</guid>
		<description><![CDATA[Even with the recent announcement that the Flickr Commons is not currently accepting new applications, there are clearly still applications being processed. NARA has been on Flickr since February of 2009 and loaded 49 sets of images. As announced in a recent press release, on the first of February 2010 Flickr flipped the switch and all the images in the The U.S. National Archives' photostream was shifted over into the Commons.<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2010/02/16/national-archives-transitions-flickr-commons/">National Archives Transitions to Flickr Commons Membership</a></p>
]]></description>
			<content:encoded><![CDATA[<p style="text-align: center;"><a href="http://www.flickr.com/photos/usnationalarchives/3660777810/"><img class="size-full wp-image-852 aligncenter" title="Flickr Commons: NARA: Ladies in Gas Masks" src="http://www.spellboundblog.com/wp-content/uploads/2010/02/3660777810_2049a5718f.jpg" alt="Ladies in Gas Masks" width="428" height="424" /></a></p>
<p style="text-align: left;">Even with the recent announcement that the <a title="Flickr Commons not accepting new applications" href="http://www.archivesnext.com/?p=932">Flickr Commons is not currently accepting new applications</a>, there are clearly still applications being processed. NARA has been on Flickr since February of 2009 and loaded 49 sets of images. As announced in a recent <a title="Press Release: NARA joins Flickr Commons" href="http://www.archives.gov/press/press-releases/2010/nr10-57.html">press release</a>, on the first of February 2010 Flickr flipped the switch and all the images in the <a title="Flickr: The U.S. National Archives' photostream" href="http://www.flickr.com/photos/usnationalarchives/">The U.S. National Archives&#8217; photostream</a> was shifted over into the <a title="Flickr Commons" href="http://www.flickr.com/commons">Commons</a>.</p>
<p>The <a title="NARA Flickr Commons Sets" href="http://www.flickr.com/photos/usnationalarchives/sets/">49 sets</a> are sorted into <a title="NARA Flickr Commons Collections" href="http://www.flickr.com/photos/usnationalarchives/collections/">4 collections</a>:</p>
<ul>
<li><a title="Historical Photographs and Documents" href="http://www.flickr.com/photos/usnationalarchives/collections/72157620415261553/">Historical Photographs and Documents</a> (19 sets) &#8211; including NARA favorites like <a title="Flickr Commons: We Can Do It!" href="http://www.flickr.com/photos/usnationalarchives/3678696585/in/set-72157620680828773/">Rosie the Riveter</a> and <a title="Flickr Commons: Nixon and Elvis" href="http://www.flickr.com/photos/usnationalarchives/3679494978/in/set-72157620680828773/">Nixon and Elvis</a> and documents from regional archives across the country.</li>
<li><a title="DOCUMERICA Project by the Environmental Protection Agency " href="http://www.flickr.com/photos/usnationalarchives/collections/72157620729903309/">DOCUMERICA Project by the Environmental Protection Agency</a> (27 sets) &#8211; one set dedicated to top picks and the rest organized by photographer. Interestingly, NARA&#8217;s website has indexed the 15,000+ images from this project by <a title="Documerica by subject" href="http://www.archives.gov/research/arc/topics/environment/documerica-topics.html">subject</a> and by <a title="Documerica by location" href="http://www.archives.gov/research/arc/topics/environment/documerica-geographic.html">location</a>. I wonder how the picked which image from DOCUMERICA to port over to Flickr?</li>
<li><a title="Mathew Brady Civil War Photographs" href="http://www.flickr.com/photos/usnationalarchives/collections/72157622495226723/">Mathew Brady Civil War Photographs</a> <a href="http://www.flickr.com/photos/usnationalarchives/3995277577/in/set-72157622549882756"><img class="alignright size-full wp-image-850" title="Poplar Grove Church" src="http://www.spellboundblog.com/wp-content/uploads/2010/02/3995277577_2c4b28e495_m.jpg" alt="" width="240" height="193" /></a>(2 sets) &#8211; currently 473 out of the 6,066 digitized Mathew Brady images are uploaded into the Commons. The images posted in the Commons are available in a much higher resolution than they are within ARC. A great example from this collection is the image of the <a title="Flickr Commons: Poplar Church" href="http://www.flickr.com/photos/usnationalarchives/3995277577/in/set-72157622549882756">Poplar Church</a> (image shown to right) available as a 600 x 483 GIF on ARC and as a 3000 x 2416 JPG on Flickr. This image also has gotten a nice set of comments and tags.</li>
<li><a title="Flickr: Development and Public Works" href="http://www.flickr.com/photos/usnationalarchives/collections/72157622960946660/">Development and Public Works</a> (1 set) &#8211; the only set in this collection consists of images taken to support the Flathead Irrigation Project. &#8220;The Project was initiated to determine rights and distribute water originating on the Flathead Indian Agency in Montana to both tribal and non-tribal land.&#8221; These images seem to be the same resolution on both archives.gov and Flickr.</li>
</ul>
<p>In honor of this transition, NARA posted a new set of <a title="Ansel Adams Photographs" href="http://www.flickr.com/photos/usnationalarchives/sets/72157623296214442/">220 Ansel Adams photographs</a>. One of the first comments on the set was &#8220;low-res scans? Pretty big letdown.&#8221; Fine question. As noted above, other images from NARA in the Commons much larger than the 600 x 522 that seems to be available for the Ansel Adams images. It would be great to have a clear explanation about available resolutions published along with each new set of images.</p>
<p>NARA has published this simple <a title="NARA Flickr Commons Rights Statement" href="http://www.archives.gov/social-media/flickr-faqs.html#9">rights statement for all NARA images in the Commons</a>:</p>
<blockquote><p>All of the U.S. National Archives&#8217; images that are part of <a href="http://www.archives.gov/global-pages/exit.html?link=%20http://www.flickr.com/commons/">The Flickr Commons</a> are marked &#8220;no known copyright restrictions.&#8221; This means the U.S. National Archives is unaware of any copyright restrictions on the publication, distribution, or re-use of those particular photos. Their use restriction status in our online catalog is &#8220;unrestricted.&#8221; Therefore, no written permission is required to use them.</p></blockquote>
<p>NARA has also posted an official <a title="NARA Flickr Comment Policy" href="http://www.archives.gov/social-media/photo-comment-policy.html">Photo Comment and Posting Policy</a> and a fairly extensive <a title="NARA Flickr FAQ" href="http://www.archives.gov/social-media/flickr-faqs.html">FAQ</a> about the images they have post on Flickr. I do wish that there was a simpler way to request reprints of images from the Commons. Most of the NARA images have this standard sentence &#8211; but for someone not familiar with NARA and more accustom to one click ordering, the instructions seem very complex:</p>
<blockquote><p>For information about ordering reproductions of photographs held by the Still Picture Unit, visit: <a rel="nofollow" href="http://www.archives.gov/research/order/still-pictures.html">www.archives.gov/research/order/still-pictures.html</a></p></blockquote>
<p><a href="http://www.flickr.com/photos/35740357@N03/3952613735"><img class="alignright size-full wp-image-861" title="World Trade Center" src="http://www.spellboundblog.com/wp-content/uploads/2010/02/3952613735_73a0f8767b_m.jpg" alt="" width="163" height="240" /></a>I also wish that more of the images had location information assigned &#8211; only 113 of the images show up on the fun to explore <a title="NARA Flickr Map View" href="http://www.flickr.com/photos/usnationalarchives/map/">map view</a>. At first glance it looks as if this information is populated only for images taken near airports. There are many images that include a location based subject in the image description posted on Flickr, yet do not include geographic metadata that would permit the image to be shown on a map. <a href="http://www.flickr.com/photos/35740357@N03/3952613735"><img class="alignleft size-full wp-image-866" title="World Trade Center Tags" src="http://www.spellboundblog.com/wp-content/uploads/2010/02/world-trade-center-tags.jpg" alt="" width="123" height="144" /></a>The one image I did find that was not at an airport but did include geographic metadata is this <a title="Flickr World Trade Center" href="http://www.flickr.com/photos/35740357@N03/3952613735">image of the World Trade Center</a> assigned to the <a title="Flickr: NYC Financial District" href="http://www.flickr.com/places/United%20States/New%20York/New%20York/Financial%20District/">NYC Financial District Flickr Location</a>. While I could add a location related tag to NARA&#8217;s images, there does not appear any way for the general public to suggest location metadata.</p>
<p>One odd note about this and other World Trade Center images &#8211; the auto-generated tags have broken up the building name very oddly as shown in my screen clip on the left.</p>
<p>Another fun way way to explore the NARA Flickr images is to visit <a title="NARA Flickr Archives Page" href="http://www.flickr.com/photos/usnationalarchives/archives/">the &#8216;Archives&#8217; page</a> (slightly hilariously titled &#8220;U.S. National Archives&#8217; Archives&#8221;). Here we can browse photos based on when they were uploaded to Flickr or when they were taken. Those images that include a specific date can be viewed on a calendar (such as these <a title="NARA Flickr Images from 1918" href="http://www.flickr.com/photos/usnationalarchives/archives/date-taken/1918/calendar/">images from 1918</a>) or in a list view (those same <a title="NARA Flickr Images from 1918 - List View" href="http://www.flickr.com/photos/usnationalarchives/archives/date-taken/1918/">images from 1918 as a list</a>), while those taken &#8216;circa&#8217; a year can be viewed in a list with all other images from sometime that year (such as these <a title="NARA Flickr Images Circa 1824" href="http://www.flickr.com/photos/usnationalarchives/archives/circa/1824/">images from circa 1824</a>).</p>
<p>Beyond all the additional tags and content collected via comments on these images, I think that being able to find NARA images based on a map, calendar or tag is the real magic of the commons. The increased opportunities for access to these images cannot be overstated.</p>
<p><a href="http://www.spellboundblog.com/wp-content/uploads/2010/02/3953239497_35477bd7b7_m.jpg"><img class="alignleft size-full wp-image-865" title="Sunflower" src="http://www.spellboundblog.com/wp-content/uploads/2010/02/3953239497_35477bd7b7_m.jpg" alt="" width="240" height="163" /></a>Take this image of a <a title="NARA Flickr Sunflower" href="http://www.flickr.com/photos/usnationalarchives/3953239497/in/set-72157622453808416/">sunflower</a>. If you <a title="NARA page for the sunflower image" href="http://arcweb.archives.gov/arc/action/ExternalIdSearch?id=544590">visit this image on archives.gov</a>, you can certainly find the image and view it &#8211; but good luck finding all the images of flowers as quickly as this <a title="NARA Flickr Tag Flower" href="http://www.flickr.com/photos/usnationalarchives/tags/flower/">Flickr tag page for NARA images of flowers</a> can. Even looking at the special <a title="Documerica By Topic" href="http://www.archives.gov/research/arc/topics/environment/documerica-topics.html">Documerica by Topic</a> page doesn&#8217;t get me much closer to finding an image of a flower.</p>
<p>It will be fun to watch what else NARA chooses to upload to the Commons. I vote for more images that are assigned metadata such that they show up on the map and calendar. I will also put your mind at ease by telling you that the lovely ladies at the top of this post are their because their image is one of the most popular uploaded by NARA to date (based on it having been marked a favorite by 88 individuals). The only image I could find with more fans was the classic <a title="Flickr: Nixon and Elvis" href="http://www.flickr.com/photos/usnationalarchives/3679494978/in/set-72157620680828773/">image of Nixon and Elvis</a> with 250 fans at the time of this posting.</p>
<p>What is your favorite NARA Commons image? Please post a link in the comments and if I get enough I will set up a gallery of Spellbound Fan Favorites!</p>
<p><em>Image Credits:</em> All images within this blog post are pulled from NARA&#8217;s images on the Flickr Commons. Please click on the images to see their specific details.</p>
<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2010/02/16/national-archives-transitions-flickr-commons/">National Archives Transitions to Flickr Commons Membership</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.spellboundblog.com/2010/02/16/national-archives-transitions-flickr-commons/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>SEO Evaluation of an Archival Website: Looking at UMBC&#8217;s Digital Collections</title>
		<link>http://www.spellboundblog.com/2009/09/12/seo-evaluation-archival-websites-umbc/</link>
		<comments>http://www.spellboundblog.com/2009/09/12/seo-evaluation-archival-websites-umbc/#comments</comments>
		<pubDate>Sat, 12 Sep 2009 07:21:32 +0000</pubDate>
		<dc:creator>Jeanne</dc:creator>
				<category><![CDATA[access]]></category>
		<category><![CDATA[context]]></category>
		<category><![CDATA[interface design]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[SEO]]></category>
		<category><![CDATA[software]]></category>

		<guid isPermaLink="false">http://www.spellboundblog.com/?p=719</guid>
		<description><![CDATA[Each week brings announcements of archives launching new websites. Today both my email and Twitter told me about  University of Maryland, Baltimore County&#8217;s new Digital Collections site. Who can resist peeking at new materials available online? I have spent much of the past year learning the details of Search Engine Optimization. Usually shortened to SEO, [...]<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2009/09/12/seo-evaluation-archival-websites-umbc/">SEO Evaluation of an Archival Website: Looking at UMBC&#8217;s Digital Collections</a></p>
]]></description>
			<content:encoded><![CDATA[<p><a title="Flickr Commons Nationaal Archief: Do-It-Yourself-Woman" href="http://www.flickr.com/photos/nationaalarchief/3333357969/"><img class="alignright size-full wp-image-732" title="Flickr Commons: Do-it-yourself-woman" src="http://www.spellboundblog.com/wp-content/uploads/2009/09/3333357969_99f9a5c49a.jpg" alt="Flickr Commons: Do-it-yourself-woman" width="282" height="370" /></a>Each week brings announcements of archives launching new websites. Today both my email and Twitter told me about  <a title="UMBC Digital Collections" href="http://contentdm.ad.umbc.edu/">University of Maryland, Baltimore County&#8217;s new Digital Collections</a> site. Who can resist peeking at new materials available online?</p>
<p>I have spent much of the past year learning the details of <a title="Wikipedia: Search Engine Optimization" href="http://en.wikipedia.org/wiki/Search_engine_optimization">Search Engine Optimization</a>. Usually shortened to SEO, this simply refers to the use of techniques which improve the traffic sent to a website via <a title="Wikipedia: Organic Search" href="http://en.wikipedia.org/wiki/Organic_search">organic search</a>. Want your webpage to show up at the top of the list for a specific search in Google? You want to work on your SEO.</p>
<p>So when I look at new archives website, I can&#8217;t help but keep an eye open for how well the site is optimized for search engines.</p>
<p>I hope that UMBC will forgive me for nitpicking their new site. A lot of their choices are great for SEO,  but they also have room for improvement.</p>
<p><strong>Things Done Well for SEO<br />
</strong></p>
<ul>
<li><strong>Home Page Title &amp; Description</strong>: The site&#8217;s home page has a good meta description. This is the text displayed below the link on a search results page &#8211; as shown below:<img class="alignnone size-full wp-image-723" title="UMBC Digital Collection Google Result" src="http://www.spellboundblog.com/wp-content/uploads/2009/09/umbc_google_result.jpg" alt="UMBC Digital Collection Google Result" width="450" height="83" /></li>
<li><strong>Unique Page Titles At Collection Level</strong>: Each photography collection homepage has a unique page title and a nice block of explanatory text. Google can only read words &#8211; so the more unique text on a page, the better the job Google can do in figuring out what your page is about. Example: <a title="Ardsley Park Album" href="http://contentdm.ad.umbc.edu/ardsley.php">Ardsley Park Album</a></li>
<li><strong>Good <a title="Wikipedia: Anchor Text" href="http://en.wikipedia.org/wiki/Anchor_text">anchor text</a></strong>: (also known as link text) The words used in anchor text tells search engines information about the destination page. For example, the blue text below is anchor text.<a title="Back view of Bretz's portable wet plate case " href="http://contentdm.ad.umbc.edu/u?/georgebretz,63"> </a><a title="Back view of Bretz's portable wet plate case " href="http://contentdm.ad.umbc.edu/u?/georgebretz,63"><img class="size-full wp-image-724 aligncenter" title="UMBC Anchor Text Example" src="http://www.spellboundblog.com/wp-content/uploads/2009/09/UMBC-anchor-text.jpg" alt="UMBC Anchor Text Example" width="215" height="191" /></a></li>
</ul>
<p><strong>Areas for SEO Improvement</strong></p>
<ul>
<li><strong>Unique Page Titles At Item Level</strong>: Individual images and documents all use a generic page title such as &#8216;UMBC | Digital Archive | Document Viewer&#8217;. Document Example: <a title="Accidental Death of an Anarchist" href="http://contentdm.ad.umbc.edu/u?/theatreprod,1080">Accidental Death of an Anarchist</a> Image Example: <a title="Image: 10 year old Bootblack" href="http://contentdm.ad.umbc.edu/u?/hinecoll,3957">10 year old Bootblack</a></li>
<li><strong>H1 Tags</strong>: In the HTML of each page, the dominant heading of the page should use the &lt;h1&gt; tag. This helps Google know the phrase you are targeting with this page. It is your 2nd best place to emphasize your content after the page title. In the case of the item pages, there seems to often be a headline type title at the top of the page &#8211; but it currently is not an demarcated with an &lt;h1&gt; tag.</li>
<li><strong>Think About Search Results and Indexing</strong>: Pages displaying <a title="UMBC Digital Collections: Search for Bootblack" href="http://contentdm.ad.umbc.edu/cdm4/results.php?CISOOP1=all&amp;CISOBOX1=bootblack&amp;CISOFIELD1=CISOSEARCHALL&amp;CISOOP2=exact&amp;CISOBOX2=&amp;CISOFIELD2=CISOSEARCHALL&amp;CISOOP3=any&amp;CISOBOX3=&amp;CISOFIELD3=CISOSEARCHALL&amp;CISOOP4=none&amp;CISOBOX4=&amp;CISOFIELD4=CISOSEARCHALL&amp;CISOROOT=all&amp;t=a">results of internal searches</a> on your site are not likely to be useful as indexed pages in Google. The thinking here is that they can dilute the focus on the item and collection level pages on your site if Google also has many search results pages in the index. If UMBC wanted their search pages to be indexed, then those pages&#8217; URLs should be simplified and the search results pages need a page title that somehow includes the search criteria. There are two ways that I know of to disable this indexing &#8211; <a title="Wikipedia: Robots Exclusion Standard" href="http://en.wikipedia.org/wiki/Robots_exclusion_standard">blocking via the site&#8217;s robots.txt file</a> or via a <a title="Robots Meta Tag" href="http://www.robotstxt.org/meta.html">robots meta tag</a> in the header of the search results page. Both of these methods tell obliging search engines to not crawl certain parts of your site.</li>
</ul>
<p><strong>Final Thoughts<br />
</strong></p>
<p>There are plenty of other things that UMBC could do to support this new website. They could create an XML sitemap of all their pages and submit it to Google (maybe they already have). They might re-title some of their pages based on using a tool like <a title="Google Insight into Search" href="http://www.google.com/insights/search/#">Google Insight</a> to see what variations of a phrase is searched on most frequently. My goal here was to give you a taste of the sorts of things that catch my eye. Also, SEO is still more of an art than a science &#8211; so you will sometimes notice that what one SEO expert recommends is the opposite of what the next expert would tell you.</p>
<p>In many cases changes, such as the Unique Page Title at the Item Level mentioned above, may not even be possible due to software or programmer resource limitations. The trick is to take advantage of every option that is available. There are also trade-offs to be made. UMBC&#8217;s site provides some very slick interfaces for viewing the details of a group of documents, such as <a title="Theatre Department Production Materials Archive" href="http://contentdm.ad.umbc.edu/cdm4/browse.php?CISOROOT=/theatreprod">theater programs and other materials related to a theatrical production</a>. The imlementation elegantly handles the situation of multiple scanned images which relate to a coherent set of documents. Sometimes you can&#8217;t have both your innovative UI and perfect SEO. Then it gets down to what your goals are for your website. Are you trying to make a specific community of existing users happy by providing them with tools they can use? Or does your mission focus more on reaching out to a broader audience?</p>
<p>There is no silver bullet to search engine optimization. It just takes knowledge of the available tools and techniques combined with a willingness to keep learning and experimenting. Like the &#8216;<a title="Doe-het-zelf vrouw /Do-it-yourself-woman" href="http://www.flickr.com/photos/nationaalarchief/3333357969/">Do-It-Yourself-Woman</a>&#8216; pictured above in the <a title="Flickr Commons: Nationaal Archief" href="http://www.flickr.com/people/nationaalarchief/">Nationaal Archief</a>&#8216;s photo I found out on the Flickr Commons, you too can learn the basics and do-it-yourself. A great starting point is <a title="Google SEO Guide" href="http://www.google.com/webmasters/docs/search-engine-optimization-starter-guide.pdf">Google&#8217;s free SEO Guide</a>. Also, please remember that the best time to plan your SEO strategy is before you have built your site in the first place!</p>
<p>I would love to do research on how much progress archives websites can make in their organic search traffic after SEO improvements. My thinking is to take a snapshot of a month of <a title="Wikipedia: Analytics" href="http://en.wikipedia.org/wiki/Analytics">analytics</a> (the statistics that tell you how many people are visiting your website) and then apply some SEO inspired changes. After a suitable delay (it takes some time for SEO to do its job) we consider another month of analytics to determine any change in organic traffic.</p>
<p>Do you want me to do a quick review of your archives website to see if there is room for SEO improvement? Please <a title="Contact Jeanne" href="http://www.spellboundblog.com/contact/">contact me</a> or add a comment to this post. I feel like there is a conference presentation in all this if we can find a good set of websites to optimize.</p>
<p>Finally, thank you to unsuspecting UMBC &#8211; your new website really is beautiful.</p>
<p><em>Image credit: <a title="Doe-het-zelf vrouw /Do-it-yourself-woman" href="http://www.flickr.com/photos/nationaalarchief/3333357969/">Doe-het-zelf vrouw /Do-it-yourself-woman</a> from Nationaal Archief on Flickr Commons.</em></p>
<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2009/09/12/seo-evaluation-archival-websites-umbc/">SEO Evaluation of an Archival Website: Looking at UMBC&#8217;s Digital Collections</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.spellboundblog.com/2009/09/12/seo-evaluation-archival-websites-umbc/feed/</wfw:commentRss>
		<slash:comments>8</slash:comments>
		</item>
		<item>
		<title>Archival Collections Online: Reaching Audiences Beyond The Edge of Campus  (SAA09: Session 405)</title>
		<link>http://www.spellboundblog.com/2009/08/25/archival-collections-online-reaching-new-audiences/</link>
		<comments>http://www.spellboundblog.com/2009/08/25/archival-collections-online-reaching-new-audiences/#comments</comments>
		<pubDate>Tue, 25 Aug 2009 05:07:57 +0000</pubDate>
		<dc:creator>Jeanne</dc:creator>
				<category><![CDATA[access]]></category>
		<category><![CDATA[archival community]]></category>
		<category><![CDATA[interface design]]></category>
		<category><![CDATA[learning technology]]></category>
		<category><![CDATA[outreach]]></category>
		<category><![CDATA[SAA2009]]></category>
		<category><![CDATA[virtual collaboration]]></category>
		<category><![CDATA[web 2.0]]></category>

		<guid isPermaLink="false">http://www.spellboundblog.com/?p=648</guid>
		<description><![CDATA[Expanding Your Local and Global Audiences (Session 405, SAA 2009) shared how three institutions of higher education are using the web to reach out to new audiences. While the general public may still hold close the stereotype of archives as of rooms full of boxes of paper (not so different from this Duke image on [...]<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2009/08/25/archival-collections-online-reaching-new-audiences/">Archival Collections Online: Reaching Audiences Beyond The Edge of Campus  (SAA09: Session 405)</a></p>
]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.flickr.com/photos/dukeyearlook/3706334377/"><img class="alignright size-full wp-image-655" title="The Archivist's Life, 23 May 1954" src="http://www.spellboundblog.com/wp-content/uploads/2009/08/3706334377_939ce4aa98.jpg" alt="The Archivist's Life, 23 May 1954" width="269" height="330" /></a><a title="Session 405: Expanding Your Local and Global Audience" href="http://saa.archivists.org/Scripts/4Disapi.dll/4DCGI/events/eventdetail.html?Action=Events_Detail&amp;InvID_W=1072">Expanding Your Local and Global Audiences</a> (Session 405, SAA 2009) shared how three institutions of higher education are using the web to reach out to new audiences. While the general public may still hold close the stereotype of archives as of rooms full of boxes of paper (not so different from this Duke image on Flickr: &#8220;Mattie Russell, curator of manuscripts, and Jay Luvaas, director of the Flowers Collection, examine the papers of Senator Willis Smith in the library vault.&#8221;), the presenters in this session are focused on expanding peoples&#8217; experience of archives beyond boxes of papers locked away in a vault. They are using the web as a tool to reach beyond the walls of their reading rooms and the edges of their campuses.</p>
<p><a title="Duke RBMSCL" href="http://library.duke.edu/specialcollections/index.html">Duke University Rare Books, Manuscript &amp; Special Collections Library (RBMSCL)</a> : <a title="Lynn Eaton" href="http://library.duke.edu/apps/directory/staff/751/">Lynn Eaton</a> (Reference Archivist)</p>
<p>While I didn&#8217;t find my way into this session until the start of the next speaker&#8217;s presentation, Lynn was kind enough to share with me her personal printout of her presentation slides. The links below and any associated commentary are based solely on my own interpretation of the various screen-shots included.</p>
<ul>
<li><a title="Duke Digital Collections" href="http://library.duke.edu/digitalcollections/">Duke Digital Collections</a></li>
<li><a title="RBMSCL Finding Aids" href="http://library.duke.edu/digitalcollections/rbmscl/inv/">RBMSCL Finding Aids</a></li>
<li><a title="AdViews: A Digital Archive of Vintage Television Commercials" href="http://library.duke.edu/digitalcollections/adviews/">AdViews: A Digital Archive of Vintage Television Commercials</a> &#8211; this includes <a href="http://library.duke.edu/digitalcollections/adviews/interviews.html">interviews with experts</a>, a <a href="http://dn.duke.edu/adviewsquiz/">TV ads quiz</a> and a wide range of <a href="http://deimos3.apple.com/WebObjects/Core.woa/Browse/new.duke.edu.2256724776">TV ads available via iTunes U</a>.</li>
<li><a title="Flickr: Duke Yearlook" href="http://www.flickr.com/photos/dukeyearlook/collections/72157619799420213/">Duke Yearlook</a> &#8211; a set of Flickr collections displaying images from the Duke University Archives, each focused on a decade or theme related to Duke&#8217;s history.</li>
<li><a title="YouTube: Duke University Libraries Channel" href="http://www.youtube.com/user/DukeUnivLibraries">Duke University Libraries YouTube Channel</a>: example <a title="Duke Exhibit: &quot;A Century of Sex Appeals&quot; " href="http://www.youtube.com/user/DukeUnivLibraries#play/user/8A67AFF8AE54578F/6/_lpnpMyx8MI">Duke Exhibit: &#8220;A Century of Sex Appeals&#8221;</a></li>
<li><a title="Duke Digital Collections on DukeMobile" href="http://www.youtube.com/watch?v=iHK3E4N7w6o">Duke Digital Collections on DukeMobile iPhone application</a> &#8211; This wasn&#8217;t included in the presentation&#8217;s slides &#8211; but I spotted it on the YouTube Channel. I downloaded the DukeMobile app onto my iTouch and had a great time exploring the Duke Digital Collections included in the images section of the app. I think it was</li>
</ul>
<p><a title="UNLV Digital Collections" href="http://digital.library.unlv.edu/">University of Nevada Las Vegas (UNLV) Digital Collections</a>: <a title="Tom Sommer" href="http://www.library.unlv.edu/about/staff/libstafinfo.php?style=other&amp;personid=141">Tom Sommer</a> (University and Technical Services Archivist)</p>
<p>UNLV has experimented with new technologies as they appear. Tom made a point of saying that when they started seeing others provide a feature on their websites, UNLV would find a way to try it out. A great example of this is the addition of a tag cloud and google map to The Boomtown Years collection listed below.</p>
<ul>
<li><a title="Howard Hughes Digital Collection" href="http://digital.library.unlv.edu/hughes/">Howard Hughes Digital Collection</a> &#8211; Images displayed in this online exhibition about Howard Hughes, such as this <a title="Portrait of Howard Hughes" href="http://digital.library.unlv.edu/u?/hughes,60">portrait of Howard Hughes</a>, feature the opportunity both to rate and comment on the image. In addition, they provide an RSS feed for every possible metadata attribute (such as location, subject and media type)</li>
<li> <a title="Southern Nevada: The Boomtown Years" href="http://digital.library.unlv.edu/boomtown/">Southern Nevada: The Boomtown Years</a> &#8211; in addition to ratings and comments, this collection adds on display of recent comments, tagging and a google map which ties images to locations in southern Nevada.</li>
<li><a title="UNLV Special Collections Facebook Page" href="http://www.facebook.com/pages/Las-Vegas-NV/UNLV-Special-Collections/70053571047">UNLV Special Collections Facebook Page</a> &#8211; shares news and updates about projects &#8211; launched 2 months ago</li>
</ul>
<p><a title="Marist College Archives and Special Collections" href="http://library.marist.edu/archives/index.html">Marist College Archives and Special Collections</a>: <a title="John Ansley" href="http://library.marist.edu/archives/contact.html">John Ansley</a> (Head, Archives and Special Collections)</p>
<p>Marist first launched their website in 2001 to raise awareness of their collections. They also used listserves and the on-campus newspaper. Utlimately their best tactic was working one-on-one with professors whose interests intersected with their collections. This led to contact with special interest groups. Working with the special interest groups led to new tag and metadata values for their collections.</p>
<ul>
<li><a title="Hidden in Plain Sight" href="http://library.marist.edu/archives/gill/foreword.html">Hidden in Plain Sight</a> &#8211; online exhibit about fore-edge painting. Includes <a title="Introduction to Edge Painting" href="http://library.marist.edu/archives/gill/intro.html">videos as part of introduction</a> since it is hard to understand through still images. The <a title="Bibliography of Fore-Edge Painting" href="http://library.marist.edu/archives/gill/bibliography.html">bibliography</a> receives the most hits.</li>
<li><a title="Marist Environmental History Project" href="http://library.marist.edu/archives/mehp/index.html">Marist Environmental History Project</a> &#8211; this ongoing project aims to document who has what information about environmental history. The site includes an extensive <a title="Environmental History Primary Sources" href="http://library.marist.edu/archives/mehp/primarysources.html">list of primary sources</a> as well as a 24 minute oral history:  <a title="The Enduring Storm:  The Story of the Storm King Case and the People Who Launched the Modern Environmental Movement" href="http://http://library.marist.edu/archives/mehp/Audio%20Documentary/The%20Enduring%20Storm%20FINAL%20VERSION.mp3">The Enduring Storm:  The Story of the Storm King Case and the People Who Launched the Modern Environmental Movement (mp3)</a>.</li>
<li><a title="Intercollegiate Rowing Association Poughkeepsie Regatta" href="http://library.marist.edu/archives/regatta/index.html">Intercollegiate Rowing Association Poughkeepsie Regatta</a> &#8211; timeline used to guide users to who won each race, PDFs of programs, and extensive bibliographies (including an <a title="NYT newspaper article index" href="http://library.marist.edu/archives/regatta/newspapers.html">index of 1000+ NYT articles</a> about the regatta).</li>
<li><a title="Lowell Thomas Travelogues" href="http://library.marist.edu/archives/LTT/index.html">Lowell Thomas Travelogues</a> &#8211; a household name during the golden age of radio, <a title="Lowell Thomas Biography" href="http://library.marist.edu/archives/LTT/ltbiography.html">Lowell Thomas</a> created extensive multimedia travelogues of his travels around the world. He is credited with making <a title="T. E. Lawrence" href="http://en.wikipedia.org/wiki/T._E._Lawrence">T. E. Lawrence</a> famous as &#8216;Lawrence of Arabia&#8217;. The site was launched as a teaser to the over 1000 linear feet of photos, audio, video &amp; other records which will be available to researchers in October 2009. For a taste of what is coming, check out <a title="Lowell Thomas Travelogue Video Clip" href="http://library.marist.edu/archives/LTT/clip%203.html">this Lowell Thomas travelogue video clip</a> &#8211; my favorite quote from which is &#8220;&#8230;come with me on a magic carpet out to the land of history, mystery and romance.&#8221;</li>
</ul>
<p><strong>My Thoughts</strong></p>
<p>The archivists at all three of these educational institutions have tried new things and worked hard to share their materials with people beyond the traditional range of a reading room. The promise of the web, and all the tools and techniques it supports, is still being uncovered. It will be up to innovative archivists to keep discovering ways to push the envelope and welcome new audiences from all the corners of the globe.</p>
<p><em>Image Credit:</em> <a rel="cc:attributionURL" href="http://www.flickr.com/photos/dukeyearlook/">http://www.flickr.com/photos/dukeyearlook/</a> / <a rel="license" href="http://creativecommons.org/licenses/by-nc-sa/2.0/">CC BY-NC-SA 2.0</a></p>
<p><em>As is the case with all my session summaries from <a title="SAA2009 Posts" href="http://www.spellboundblog.com/category/saa2009/">SAA2009</a>, please accept my apologies in advance for any cases in which I misquote, overly simplify or miss points altogether in the post above. These sessions move fast and my main goal is to capture the core of the ideas presented and exchanged. Feel free to contact me about corrections to my summary either via comments on this post or via <a title="Contact Jeanne" href="http://www.spellboundblog.com/contact/">my contact form</a>.</em></p>
<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2009/08/25/archival-collections-online-reaching-new-audiences/">Archival Collections Online: Reaching Audiences Beyond The Edge of Campus  (SAA09: Session 405)</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.spellboundblog.com/2009/08/25/archival-collections-online-reaching-new-audiences/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>DH2009: Digital Curiosities and Amateur Collections</title>
		<link>http://www.spellboundblog.com/2009/06/29/dh2009-digital-curiosities/</link>
		<comments>http://www.spellboundblog.com/2009/06/29/dh2009-digital-curiosities/#comments</comments>
		<pubDate>Tue, 30 Jun 2009 02:25:33 +0000</pubDate>
		<dc:creator>Jeanne</dc:creator>
				<category><![CDATA[access]]></category>
		<category><![CDATA[at risk records]]></category>
		<category><![CDATA[DH2009]]></category>
		<category><![CDATA[digital humanities]]></category>
		<category><![CDATA[digitization]]></category>
		<category><![CDATA[learning technology]]></category>
		<category><![CDATA[metadata]]></category>
		<category><![CDATA[outreach]]></category>
		<category><![CDATA[virtual collaboration]]></category>
		<category><![CDATA[web 2.0]]></category>

		<guid isPermaLink="false">http://www.spellboundblog.com/2009/06/25/dh2009-digital-curiosities/</guid>
		<description><![CDATA[Session Title: Digital Curiosities: Resource Creation Via Amateur Digitisation Speaker: Melissa Terras Overview: Review of 100 virtual museum websites and multiple flickr groups plus surveys of amateur website creators, memory institutions and Arts &#38; Humanities academics leads to new perspective on digitization and creation of collections online by dedicated enthusiasts. Session Highlights Areas of &#8220;Amateur&#8221; [...]<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2009/06/29/dh2009-digital-curiosities/">DH2009: Digital Curiosities and Amateur Collections</a></p>
]]></description>
			<content:encoded><![CDATA[<p><a title="Flickr Image from Curio Cabinet Group by mms0131" href="http://www.flickr.com/photos/mms0131/500142786/in/set-72157605079911413/"><img class="alignright size-full wp-image-617" title="Flickr Image from Curio Cabinet Group by mms0131" src="http://www.spellboundblog.com/wp-content/uploads/2009/06/curio-image.jpg" alt="curio-image" width="282" height="398" /></a><strong>Session Title:</strong> Digital Curiosities: Resource Creation Via Amateur Digitisation<br />
<strong>Speaker:</strong> <a title="Dr Melissa Terras" href="http://www.ucl.ac.uk/infostudies/melissa-terras/">Melissa Terras</a></p>
<p><strong>Overview:</strong> Review of 100 virtual museum websites and multiple flickr groups plus surveys of amateur website creators, memory institutions and Arts &amp; Humanities academics leads to new perspective on digitization and creation of collections online by dedicated enthusiasts.</p>
<p><strong>Session Highlights</strong></p>
<p>Areas of &#8220;Amateur&#8221; endeavor  have a long history of launching collections, such as:</p>
<ul>
<li>cabinet of curiosities</li>
<li>foundation of astronomical research</li>
<li>british flora and amateur botanists</li>
<li>weather observations</li>
<li>open source software movement</li>
</ul>
<p>Being an amateur doesn&#8217;t necessarily mean being bad at what you do!</p>
<p>Within the realm of self-defined museums some common topics often emerge:</p>
<ul>
<li>ephemera (advertising, packaging, nostalgia)</li>
<li>comics</li>
<li>technology &#8211; especially old tech, there is a surprising trend of being fascinated by technology approximately 10 years older than the collector</li>
<li>personal and &#8220;embarrassing&#8221; collections</li>
<li>genealogy</li>
</ul>
<p>For these self-defined museums the scope is self-defined &#8211; these are self-delineated collections. Virtual museums can document aspects of cultural heritage considered socially taboo or in some way too sensitive to collect. A great example of this is the <a title="Museum of Menstruation" href="http://www.mum.org/">Museum of Menstruation</a> which claims to have been created 14 years ago and is currently trying to establish a <a title="Future of MUM" href="http://www.mum.org/future.htm">public permenant display for the public</a>.</p>
<p>Platforms have evolved over the life of the web, starting with static html, then blogs and now Flickr images as a mode of presentation.</p>
<p>This is a list of successful amateur collections online:</p>
<ul>
<li><a title="Today's Inspiration" href="http://todaysinspiration.blogspot.com/">Today&#8217;s Inspiration</a> &#8211; illustration from the 40&#8242;s and 50&#8242;s</li>
<li><a title="JonWilliamson.com" href="http://jonwilliamson.com/">JonWilliamson.com</a> &#8211; advertising 1940s-1960s</li>
<li><a title="Pulp Fiction Flickr Group" href="http://www.flickr.com/groups/pulpfiction/pool/">Pulp Fiction Flickr Group</a> &#8211; 882 members who provide basic metadata and often label stuff within the image &#8211; currently contains 3,385 items.</li>
<li><a title="Curio Cabinet Flickr Group" href="http://www.flickr.com/groups/curiocabinet/">Curio Cabinet Flickr Group</a> &#8211; 1,206 members and 5,537 items</li>
</ul>
<p><a title="VADS (Visual Arts Data Service)" href="http://www.vads.ac.uk/">Visual Arts Data Service</a> (VADS) is a more traditional site created by a cultural heritage institution. It contains 100,000+ images copyright cleared for use in teaching, learning and research in the UK. VADS is a very detailed static source of images with metadata, but provides no interaction.</p>
<p>Amateurs do provide metadata, but it is intuitive metadata. It might not fit into rigid buckets of data, but that doesn&#8217;t meant that the metadata available isn&#8217;t useful.</p>
<p>What are the boundaries between amateur and professional? Work vs hobby?</p>
<p>Many of these amateur sites get much more traffic than most standard museum sites. More than 50% of museum digitized images are never visited.</p>
<p>Memory institutions are starting to put things into the wider online community:</p>
<ul>
<li><a title="Smithsonian Institution" href="http://www.si.edu/">Smithsonian</a>: photos in <a title="Flickr Commons: Smithsonian" href="http://www.flickr.com/photos/smithsonian/">Smithsonian Flickr Commons</a></li>
<li><a title="Tate Online" href="http://www.tate.org.uk/">Tate</a>: The <a title="How We Are Now" href="http://www.tate.org.uk/britain/exhibitions/howweare/slideshow.shtm">How We Are Now</a> project invited the public to contribute photos to the <a title="Flickr: How We Are Now Group" href="http://www.flickr.com/groups/howwearenow/">How We Are Flickr Group</a>. The images were <a title="Flickr Photos Streamed in the Tate" href="http://www.flickr.com/photos/tategallery/507813139/in/set-72157600238798389/">streamed to screens</a> within the <a title="How We Are: Photographing Britain" href="http://www.tate.org.uk/britain/exhibitions/howweare/default.shtm">How We Are: Photographing Britain exhibit</a> and 40 photos were chosen to be included as the last set of photos in the physical exhibit.</li>
<li><a title="Victoria &amp; Albert Museum" href="http://www.vam.ac.uk/">Victoria &amp; Albert Museum</a>: created a <a title="Flickr: Photos from Victoria &amp; Albert Museum" href="http://www.flickr.com/groups/va_museum/">Flicrk group of photos taken at the V&amp;A museum</a> along with a long list of other <a title="V&amp;A Flickr Groups and Streams" href="http://www.vam.ac.uk/activ_events/do_online/flickr_group/index.html">V&amp;A Flickr groups and streams</a></li>
<li>Oxford University&#8217;s <a title="Oxford Great War Archive" href="http://www.oucs.ox.ac.uk/ww1lit/gwa">Great War Archive</a>: contains 6,500 items contributed by the public and related to the First World War.</li>
<li><a title="Facebook" href="http://www.facebook.com/">Facebook</a> and <a title="Twitter" href="http://twitter.com/">Twitter</a> are being used more often for informing the community about their collections</li>
</ul>
<p>Much of amateur research has been driven by advances in technology. A great example of this is the advent of affordable <a title="Wikipedia: metal detector" href="http://en.wikipedia.org/wiki/Metal_detector">metal detectors</a> led to dramatic changes in archaeology. The internet and Web 2.0 technology are arming a whole new generation of enthusists who can find one another and collaborate more easily than might ever have been dreamed of 20 years ago.</p>
<p><strong>Next Steps &amp; Conclusions</strong></p>
<p>Future research will involve looking at the psychology of collection: archives vs collections. For now it is important to realize that institutions are not the only hosts of &#8220;worthwhile&#8221; digital objects. Pro-am (aka, pro-amateur) are doing better with using web 2.0 &amp; getting more traffic.</p>
<p>What can memory institutions learn from this?</p>
<ul>
<li>interact with user communities</li>
<li>use the &#8216;grand central stations&#8217; of flickr, twitter, facebook</li>
<li>usability of flickr is better than what most memory institutions build for themselves</li>
</ul>
<p><strong>My Thoughts</strong></p>
<p>This session considers the ways cultural memory institution can take advantage of the web by looking at what the successful enthusiasts are achieving. This research-backed approach confirms what I would have expected. Libraries, museums and archives are leaving a lot on the table when it comes to putting their collections online. Sites run by non-professionals are doing an amazing job of drawing in new audiences, keeping people around and then initiating conversation within that audience.</p>
<p>The Flickr Commons is a big step forward, but it isn&#8217;t the only option. There are also varying opinions about <a title="Flckr Commons Discussion: Question re Crowdsourcing: fail or win?" href="http://www.flickr.com/groups/flickrcommons/discuss/72157620593449864/">how successful the crowdsourcing aspect of the Flickr Commons is for memory institutions</a>. A lot of this goes back to to a core question &#8220;how do we know if we have succeeded?&#8221;. There is much to be said for setting out clear goals when launching online initiatives. Is your goal increased traffic to your site or crowdsourcing of metadata? A great example of an initiative whose goal is clearly collection of crowdsourced metadata is the <a title="German Federal Archives, Crowdsourcing &amp; the Wikimedia Commons" href="http://www.spellboundblog.com/2009/01/26/german-federal-archives-crowdsourcing-wikimedia-commons/">German Federal Archives who chose to use the Wikimedia Commons for their photo metadata initiative</a>.</p>
<p>If you are trying to extend your mission of providing access to materials to the public, then how do you measure success? Putting your materials in what Melissa called &#8220;grand central stations&#8221; (or what I have also heard termed &#8220;public crosswalks&#8221;) definitely increases the chances of serendipitous discovery by new individuals. That said, we can see from the successful blogs mentioned above that tackling a niche with enthusiasm and consistent posting can go a long way to building a following. JonWilliamson.com seems to have only launched back in November of 2008 with a post featuring a <a title="JonWilliamson.com: Scotch Tape Christmas ad from 1951" href="http://jonwilliamson.com/template_permalink.asp?id=88">Scotch Tape Christmas ad from 1951</a>. The author posted in May of 2009 that his <a title="JonWilliamson.com: 100,000 Hits n Flickr" href="http://jonwilliamson.com/template_archives_cat.asp?cat=25">images in Flickr had surpassed 100,000 views</a>.</p>
<p>To conclude this post I leave you with a list of inspirational digitized collections online that were created by various cultural heritage institutions:</p>
<ul>
<li><a title="Publishers' Bindings Online" href="http://bindings.lib.ua.edu/">Publishers&#8217; Bindings Online</a> &#8211; discussed in <a title="SAA2007: Publishers’ Bindings Online – Digitization, Collaboration, Standardization and Community Building (Session 707)" href="http://www.spellboundblog.com/2007/09/22/saa2007-publishers%E2%80%99-bindings-online-digitization-collaboration-standardization-and-community-building-session-707/">SAA2007&#8242;s Session: Publishers’ Bindings Online – Digitization, Collaboration, Standardization and Community Building</a>, a multi-institutional project that includes <a title="PBO Galleries" href="http://bindings.lib.ua.edu/gallery2.html">galleries</a> of topical images combined with an essay that gives the images context. Two of my favorites are:
<ul>
<li><a title="From Domestic Goddesses to Suffragists: The Story of Women Told on Bookbindings, 1820-1920" href="http://bindings.lib.ua.edu/gallery/women.html">From Domestic Goddesses to Suffragists: The Story of Women Told on Bookbindings, 1820-1920</a></li>
<li><a title="Indians, the Frontier, and the West in American Bookbindings" href="http://bindings.lib.ua.edu/gallery/west.html">Indians, the Frontier, and the West in American Bookbindings</a></li>
</ul>
</li>
<li><a title="Calisphere" href="http://www.calisphere.universityofcalifornia.edu/">Calisphere</a> &#8211; more than 150,000 digitized items <span>organized for easy use by K-12 teachers. This is especially interesting in that it represents items already available in <a title="Online Archive of California" href="http://oac4.cdlib.org/">Online Archive of California</a>, but organized in a way to make them easy to find and use with their target audience in mind.</span></li>
<li><span><a title="Yiddish Books Online" href="http://yiddishbookcenter.org/+yb">Yiddish Books Online</a> &#8211; A project by the <a title="National Yiddish Book Center" href="http://www.yiddishbookcenter.org">National Yiddish Book Center</a> that uses the Internet Archive as a platform to host </span>11,000 digitized out-of-print Yiddish books. This project is a nice cross between a branded custom site and a grand-central station</li>
</ul>
<p>Have a favorite online collection website? Please share it in the comments below.</p>
<p><strong><span style="font-weight: normal;"><em>As is the case with all my session summaries from <a title="Digital Humanities 2009" href="http://www.mith2.umd.edu/dh09/">DH2009</a>, please accept my apologies in advance for any cases in which I misquote, overly simplify or miss points altogether in the post above. These sessions move fast and my main goal is to capture the core of the ideas presented and exchanged. Feel free to contact me about corrections to my summary either via comments on this post or via my <a title="Contact Jeanne" href="../2009/06/25/contact/">contact form</a>.</em></span></strong></p>
<p><strong><em>Image credit:</em></strong> <a rel="cc:attributionURL" href="http://www.flickr.com/photos/mms0131/">http://www.flickr.com/photos/mms0131/</a> / <a rel="license" href="http://creativecommons.org/licenses/by-nc-nd/2.0/">CC BY-NC-ND 2.0</a></p>
<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2009/06/29/dh2009-digital-curiosities/">DH2009: Digital Curiosities and Amateur Collections</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.spellboundblog.com/2009/06/29/dh2009-digital-curiosities/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Yahoo &amp; Google&#8217;s Search for Reusable Images and the Flickr Commons</title>
		<link>http://www.spellboundblog.com/2009/06/13/yahoo-google-search-reusable-images-flickr-commons/</link>
		<comments>http://www.spellboundblog.com/2009/06/13/yahoo-google-search-reusable-images-flickr-commons/#comments</comments>
		<pubDate>Sat, 13 Jun 2009 17:03:27 +0000</pubDate>
		<dc:creator>Jeanne</dc:creator>
				<category><![CDATA[access]]></category>
		<category><![CDATA[copyright]]></category>
		<category><![CDATA[photography]]></category>
		<category><![CDATA[search]]></category>

		<guid isPermaLink="false">http://www.spellboundblog.com/?p=584</guid>
		<description><![CDATA[When I read about Yahoo Image Search&#8217;s recent addition of a filter to return only creative commons Flickr images, I got all excited about what this might mean for images in the Flickr Commons. So I raced off to the Yahoo Image Search page to see how it works. The short answer is that the [...]<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2009/06/13/yahoo-google-search-reusable-images-flickr-commons/">Yahoo &#038; Google&#8217;s Search for Reusable Images and the Flickr Commons</a></p>
]]></description>
			<content:encoded><![CDATA[<p>When I read about Yahoo Image Search&#8217;s recent <a title="Yahoo Search Blog: Find Images to Use and Reuse with the New Creative Commons Filter" href="http://www.ysearchblog.com/2009/05/26/find-images-to-use-and-reuse-with-the-new-creative-commons-filter/">addition of a filter to return only creative commons Flickr images</a>, I got all excited about what this might mean for images in the <a title="Flickr Commons" href="http://www.flickr.com/commons">Flickr Commons</a>. So I raced off to the <a title="Yahoo Image Search" href="http://images.search.yahoo.com">Yahoo Image Search</a> page to see how it works. The short answer is that the new special rights setting of  <a title="Flickr Commons: No Known Rights Restrictions" href="http://www.flickr.com/commons/usage/">no known copyright restrictions</a> that they created for members of the Flickr Commons apparently doesn&#8217;t count.</p>
<p>For my test I searched for an exact match on <a title="Yahoo Image Search: Ticket with portrait of George Washington" href="http://images.search.yahoo.com/search/images?p=%22Ticket+with+portrait+of+George+Washington%22&amp;fr=sfp&amp;ei=utf-8&amp;x=wrt&amp;y=Search">&#8220;Ticket with portrait of George Washington&#8221;</a>. This returns one result &#8211; the <a title="Flickr: Ticket with portrait of George Washington" href="http://www.flickr.com/photos/field_museum_library/3589463633/">one image in Flickr</a> with the same name, from <a title="Flickr Commons: The Field Museum" href="http://www.flickr.com/photos/field_museum_library/">The Field Museum</a> in Flickr Commons. If you click on the &#8216;More Filters&#8217; link, you will see other ways to filter your <img class="size-full wp-image-593 alignleft" title="Creator permits reuse - Yahoo image search" src="http://www.spellboundblog.com/wp-content/uploads/2009/06/cc-img.JPG" alt="Creator permits reuse - Yahoo image search" width="243" height="121" />results &#8211; including the option to restrict your results to only include images whose creators permit reuse.</p>
<p>Next I clicked in the &#8216;Creator allows reuse&#8217; and my one result disappeared! Quite disappointing in my book.</p>
<p>Google is also getting onto the &#8216;make it easy to search for reusable images&#8217; bandwagon. <a title="Search Engine Land" href="http://searchengineland.com/">Search Engine Land</a> reported that <a title="Search Engine Land: Google Images Quietly Adds Creative Commons Filter" href="http://searchengineland.com/google-images-quietly-adds-creative-commons-filter-20847">Google Images Quietly Adds Creative Commons Filter</a>. That post pointed me to <a title="Google Operating System Blog" href="http://googlesystem.blogspot.com/">Google  Operating System</a>&#8216;s search interface that lets you <a title="Find Creative Commons Images in Google Image Search" href="http://googlesystem.blogspot.com/2009/06/find-creative-commons-images-in-google.html">play with the options that Google has available</a>. After a clicking through to some of the images returned by a <a title="Google Image Search: Archives" href="http://images.google.com/images?as_rights=(cc_publicdomain|cc_attribute|cc_sharealike|cc_noncommercial|cc_nonderived)&amp;hl=en&amp;sa=1&amp;q=archives&amp;btnG=Search+Images&amp;aq=f&amp;oq=">Google Image Search for creative commons images of archives</a>, the way the Google model <em>appears </em>to work is to look for <a title="Creative Commons" href="http://creativecommons.org/">creative commons</a> badges or links on the page with the image. I even found Flickr creative commons images, but when I tried to find my Flickr Commons image of the ticket used above for my Yahoo image search experiment it wasn&#8217;t returned by Google either.</p>
<p>So if an archives (or museum or library) posts images on a page that indicates that the content is licensed under creative commons, it seems those images will then appear in Google&#8217;s image search as reusable. That is good news! Another way to get users to find your public domain images.</p>
<p>The question I am left is how to resolve the gap between <a title="Flickr Commons: Rights Statement" href="http://www.flickr.com/commons/usage/">Flickr Commons&#8217; &#8216;no known copyright restrictions  rights statement</a> and both Google and Yahoo&#8217;s definition of reusable content.</p>
<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2009/06/13/yahoo-google-search-reusable-images-flickr-commons/">Yahoo &#038; Google&#8217;s Search for Reusable Images and the Flickr Commons</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.spellboundblog.com/2009/06/13/yahoo-google-search-reusable-images-flickr-commons/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

