<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Spellbound Blog &#187; search</title>
	<atom:link href="http://www.spellboundblog.com/category/search/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.spellboundblog.com</link>
	<description>Archives, Digital Humanities, Cultural Heritage, Technology</description>
	<lastBuildDate>Fri, 03 Feb 2012 05:43:15 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>Creative Funding for Text-Mining and Visualization Project</title>
		<link>http://www.spellboundblog.com/2011/01/16/creative-funding-text-mining-visualization/</link>
		<comments>http://www.spellboundblog.com/2011/01/16/creative-funding-text-mining-visualization/#comments</comments>
		<pubDate>Sun, 16 Jan 2011 15:51:29 +0000</pubDate>
		<dc:creator>Jeanne</dc:creator>
				<category><![CDATA[digital humanities]]></category>
		<category><![CDATA[funding]]></category>
		<category><![CDATA[GIS]]></category>
		<category><![CDATA[information visualization]]></category>
		<category><![CDATA[interface design]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[text mining]]></category>

		<guid isPermaLink="false">http://www.spellboundblog.com/?p=1074</guid>
		<description><![CDATA[The Hip-Hop word count project on Kickstarter.com caught my eye because it seems to be a really interesting new model for funding a digital humanities project. You can watch the video below &#8211; but the core of the project tackles assorted metadata from 40,000 rap songs from 1979 to the present including stats about each [...]<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2011/01/16/creative-funding-text-mining-visualization/">Creative Funding for Text-Mining and Visualization Project</a></p>
]]></description>
			<content:encoded><![CDATA[<p><iframe frameborder="0" height="380px" align="right" src="https://www.kickstarter.com/projects/1801076626/the-hip-hop-word-count-a-searchable-rap-almanac/widget/card.html" width="220px"></iframe>The<a href="http://kck.st/g3M9lv"> Hip-Hop word count project</a> on <a href="http://www.kickstarter.com">Kickstarter.com</a> caught my eye because it seems to be a really interesting new model for funding a digital humanities project. You can watch the video below &#8211; but the core of the project tackles assorted metadata from 40,000 rap songs from 1979 to the present including stats about each song (word count, syllables, education level, etc), individual words, artist location and date. This information aims to become a public online almanac fueled by visualizations.</p>
<p>I am a backer of this project, and you can be too. As of the original writing of this post, they are currently 47% funded twenty-eight days out from their deadline. For those of you not familiar with <a href="http://www.kickstarter.com">Kickstarter</a>, people can post <a href="https://www.kickstarter.com/help/faq#WhoCanFundTheiProjOnKick">creative projects</a> and provide rewards for their funders. The funding only goes through if they reach their goal within the time limit &#8211; otherwise nothing happens, a model they call &#8216;all-or-nothing funding&#8217;.</p>
<p>What will the money be spent on?</p>
<ul>
<li>45% for PHP programmers who have been coding the custom web interface</li>
<li>35% for interface designers</li>
<li>10% for data acquisition &amp; data clean up</li>
<li>10% for hosting bills</li>
</ul>
<p>They aim for a five month time-line to move from their existing functional prototype to something viable to release to the public.</p>
<p>I am also intrigued by ways that the work on this project might be leveraged in the future to support similar text-mining projects that tie in location and date. How about doing the same thing with civil war letters? How about mining the lyrics from Broadway musical songs? </p>
<p>If this all sounds interesting, take a look at the video below and read more on the <a href="http://kck.st/g3M9lv">Hip-Hop Word Count Kickstarter home page</a>. If half the people who follow my RSS feed pitch in $10, this project would be funded. Take a look and consider pitching in. If this project doesn&#8217;t speak to you &#8211; take a look around <a href="http://www.kickstarter.com">Kickstarter</a> for something else you might want to support.</p>
<p><center><iframe frameborder="0" height="410px" src="https://www.kickstarter.com/projects/1801076626/the-hip-hop-word-count-a-searchable-rap-almanac/widget/video.html" width="480px"></iframe></center></p>
<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2011/01/16/creative-funding-text-mining-visualization/">Creative Funding for Text-Mining and Visualization Project</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.spellboundblog.com/2011/01/16/creative-funding-text-mining-visualization/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Topic Modeling, Auto-Classification and Archival Description</title>
		<link>http://www.spellboundblog.com/2010/04/27/topic-modeling-auto-classification-archival-description/</link>
		<comments>http://www.spellboundblog.com/2010/04/27/topic-modeling-auto-classification-archival-description/#comments</comments>
		<pubDate>Tue, 27 Apr 2010 06:28:08 +0000</pubDate>
		<dc:creator>Jeanne</dc:creator>
				<category><![CDATA[access]]></category>
		<category><![CDATA[interface design]]></category>
		<category><![CDATA[metadata]]></category>
		<category><![CDATA[open source]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[software]]></category>
		<category><![CDATA[text mining]]></category>
		<category><![CDATA[what if]]></category>

		<guid isPermaLink="false">http://www.spellboundblog.com/?p=963</guid>
		<description><![CDATA[In an example of Twitter serendipity, @silverasm&#8216;s (Aditi Muralidharan) tweet pointed me to @historying&#8216;s blog post about Topic Modeling. In this post Cameron Blevins explains the results of using the topic modeling feature of UMass Amherst&#8216;s MAchine Learning for LanguagE Toolkit (MALLET) on the text of Martha Ballard’s Diary. I have spent lot of time [...]<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2010/04/27/topic-modeling-auto-classification-archival-description/">Topic Modeling, Auto-Classification and Archival Description</a></p>
]]></description>
			<content:encoded><![CDATA[<p><a href="http://mallet.cs.umass.edu/index.php"><img class="alignright size-full wp-image-964" title="MALLET logo" src="http://www.spellboundblog.com/wp-content/uploads/2010/04/logo3.png" alt="" width="215" height="95" /></a>In an example of Twitter serendipity, <a title="Twitter: silverasm" href="http://twitter.com/silverasm">@silverasm</a>&#8216;s (Aditi Muralidharan) <a title="tweet about text mining" href="http://twitter.com/silverasm/statuses/12842112825">tweet</a> pointed me to <a title="Twitter: historying" href="http://twitter.com/historying">@historying</a>&#8216;s <a title="Topic Modeling Martha Ballard’s Diary" href="http://historying.org/2010/04/01/topic-modeling-martha-ballards-diary/">blog post about Topic Modeling</a>. In this post Cameron Blevins explains the results of using the <a title="MALLET: Topic Modeling" href="http://mallet.cs.umass.edu/topics.php">topic modeling</a> feature of <a title="UMass Amherst" href="http://www.umass.edu/">UMass Amherst</a>&#8216;s <a title="MAchine Learning for LanguagE Toolkit" href="http://mallet.cs.umass.edu/index.php">MAchine Learning for LanguagE Toolkit</a> (MALLET) on the text of <a title="Martha Ballard's Diary Online" href="http://dohistory.org/diary/">Martha Ballard’s Diary</a>.</p>
<p>I have spent lot of time thinking about how to generate thematic overviews of groups of archival collections. My information visualization project, <a title="ArchivesZ Blog Posts" href="http://www.spellboundblog.com/category/archivesz/">ArchivesZ</a>, aims to provide ways of understanding aggregated archival description data, both from a single institution or across institutional boundaries. Now I find myself wondering if text mining with a tool like MALLET might generate smart topic groupings more elegantly than fighting with the wide range of non-standardized collection subjects.</p>
<p><strong>Topic Modeling with MALLET</strong></p>
<p>To get a sense of what MALLET generates, see the excerpt below from Blevins&#8217;s post:</p>
<blockquote><p>With some tinkering, MALLET generated a list of thirty topics  comprised of twenty words each, which I then labeled with a descriptive  title. Below is a quick sample of what the program<em> </em>“thinks” are  some of the topics in the diary:</p>
<ul>
<li><strong>MIDWIFERY:</strong> birth deld safe morn receivd calld left  cleverly pm labour fine reward arivd infant expected recd shee born  patient</li>
<li><strong>CHURCH: </strong>meeting attended  afternoon reverend worship foren mr famely performd vers attend public  supper st service lecture discoarst administred supt</li>
<li><strong>DEATH:</strong> day yesterday  informd morn years death ye hear expired expird weak dead las past heard  days drowned departed evinn</li>
<li><strong>GARDENING:</strong> gardin sett  worked clear beens corn warm planted matters cucumbers gatherd potatoes  plants ou sowd door squash wed seeds</li>
</ul>
</blockquote>
<p>He goes on to explain that &#8220;MALLET also allows us to track those topics across the text.&#8221; What if, instead of text mining a diary, we pumped the descriptions of every archival collection from a single institution into MALLET. Of course we would need a good list of stop words including such common terms as archives, history, sources and records. But I wonder how the topics MALLET suggests would compare to the official subjects associated with each collection? Could this give us a broad overview of the topics covered by a specific repository and give us a new way to build paths to the collections based on topic?</p>
<p><strong>Auto-Classification Using Castanet</strong></p>
<p>Text miner <a title="Aditi Muralidharan" href="http://www.cs.berkeley.edu/~aditi/">Aditi Muralidharan</a> also posted recently on this theme in <a title="Castanet: automatically generating a browsing structure for a collection" href="http://mininghumanities.com/2010/04/24/castanet-automatically-generating-a-browsing-structure-for-a-collection/">Castanet: automatically generating a browsing structure for a collection</a> and explains:</p>
<blockquote><p>Castanet automatically carves a sub-structure from the hierarchical  concept dictionary, WordNet (<a href="http://wordnet.princeton.edu/">http://wordnet.princeton.edu</a>),  and matches items in the collection to one or many appropriate places  within that hierarchy. Then, after some automated trimming and  flattening, the result is a hierarchical browsing system.</p></blockquote>
<p>I have heard of Castanet before via the <a title="Flamenco Search Interface Project" href="http://flamenco.berkeley.edu/">Flamenco Search Interface Project</a>. Apparently Muralidharan did a project using Castanet last summer to create <a href="http://go2.wordpress.com/?id=725X1342&amp;site=textdigihum.wordpress.com&amp;url=http%3A%2F%2Forange.sims.berkeley.edu%2Fcgi-bin%2Fflamenco.cgi%2Fflickr%2FFlamenco&amp;sref=http%3A%2F%2Fmininghumanities.com%2F2010%2F04%2F24%2Fcastanet-automatically-generating-a-browsing-structure-for-a-collection%2F">a category system</a> for <a title="Flickr Commons" href="http://www.flickr.com/commons">Flickr Commons</a> images based on the images&#8217;  tags which is then rendered using a Flamenco interface. I include a partial screen-shot below to give you a taste of what the navigation of images feels like a few levels down in the hierarchy. I love the classification of &#8216;Group Action&#8217; then filtered by a sub-classification of &#8216;Commerce&#8217;. The first images shown are of &#8216;horse trading&#8217; &#8211; with additional headings and images beneath them as well as additional filter options on the left.</p>
<p style="text-align: center;"><a title="Flickr Commons: group_action &gt; commerce" href="http://orange.sims.berkeley.edu/cgi-bin/flamenco.cgi/flickr/Flamenco?q=actX:322&amp;group=actX"><img class="aligncenter size-full wp-image-966" title="Flickr Commons Images via Canasta &amp; Flamenco" src="http://www.spellboundblog.com/wp-content/uploads/2010/04/flickr-canasta.jpg" alt="" width="547" height="308" /></a></p>
<p><strong>What If?</strong></p>
<p>What if we pulled all the English language archival descriptions from around the world as our original data set. If we used this data for topic modeling, our subjects clusters would be cross-institutional. Maybe we could map the local institution assigned subjects to the topic model generated topics for each collection and get a sort of automated crosswalk for finding related collections. If we used the local institution assigned subjects from the archival descriptions for Canasta style auto-classification, maybe we could generate a way to hierarchically browse collections topically.</p>
<p>Both MALLET and Flamenco are open source (I am not sure of the status of Castanet) and, as I discovered working on ArchivesZ, many institutions will share their archival description data for a good cause. So &#8211; is this a good cause? I need to tease these ideas out a bit more, but what do you all think of it at first blush? Feasible? Interesting? Worthwhile experiments?</p>
<p><em>Image Credits:</em> MALLET logo from <a title="MALLET Homepage" href="http://mallet.cs.umass.edu/index.php">MALLET homepage</a>. Images in screen shot from <a title="Flickr Commons" href="http://www.flickr.com/commons">Flickr Commons</a> with no known copyright.</p>
<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2010/04/27/topic-modeling-auto-classification-archival-description/">Topic Modeling, Auto-Classification and Archival Description</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.spellboundblog.com/2010/04/27/topic-modeling-auto-classification-archival-description/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>SEO Evaluation of an Archival Website: Looking at UMBC&#8217;s Digital Collections</title>
		<link>http://www.spellboundblog.com/2009/09/12/seo-evaluation-archival-websites-umbc/</link>
		<comments>http://www.spellboundblog.com/2009/09/12/seo-evaluation-archival-websites-umbc/#comments</comments>
		<pubDate>Sat, 12 Sep 2009 07:21:32 +0000</pubDate>
		<dc:creator>Jeanne</dc:creator>
				<category><![CDATA[access]]></category>
		<category><![CDATA[context]]></category>
		<category><![CDATA[interface design]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[SEO]]></category>
		<category><![CDATA[software]]></category>

		<guid isPermaLink="false">http://www.spellboundblog.com/?p=719</guid>
		<description><![CDATA[Each week brings announcements of archives launching new websites. Today both my email and Twitter told me about  University of Maryland, Baltimore County&#8217;s new Digital Collections site. Who can resist peeking at new materials available online? I have spent much of the past year learning the details of Search Engine Optimization. Usually shortened to SEO, [...]<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2009/09/12/seo-evaluation-archival-websites-umbc/">SEO Evaluation of an Archival Website: Looking at UMBC&#8217;s Digital Collections</a></p>
]]></description>
			<content:encoded><![CDATA[<p><a title="Flickr Commons Nationaal Archief: Do-It-Yourself-Woman" href="http://www.flickr.com/photos/nationaalarchief/3333357969/"><img class="alignright size-full wp-image-732" title="Flickr Commons: Do-it-yourself-woman" src="http://www.spellboundblog.com/wp-content/uploads/2009/09/3333357969_99f9a5c49a.jpg" alt="Flickr Commons: Do-it-yourself-woman" width="282" height="370" /></a>Each week brings announcements of archives launching new websites. Today both my email and Twitter told me about  <a title="UMBC Digital Collections" href="http://contentdm.ad.umbc.edu/">University of Maryland, Baltimore County&#8217;s new Digital Collections</a> site. Who can resist peeking at new materials available online?</p>
<p>I have spent much of the past year learning the details of <a title="Wikipedia: Search Engine Optimization" href="http://en.wikipedia.org/wiki/Search_engine_optimization">Search Engine Optimization</a>. Usually shortened to SEO, this simply refers to the use of techniques which improve the traffic sent to a website via <a title="Wikipedia: Organic Search" href="http://en.wikipedia.org/wiki/Organic_search">organic search</a>. Want your webpage to show up at the top of the list for a specific search in Google? You want to work on your SEO.</p>
<p>So when I look at new archives website, I can&#8217;t help but keep an eye open for how well the site is optimized for search engines.</p>
<p>I hope that UMBC will forgive me for nitpicking their new site. A lot of their choices are great for SEO,  but they also have room for improvement.</p>
<p><strong>Things Done Well for SEO<br />
</strong></p>
<ul>
<li><strong>Home Page Title &amp; Description</strong>: The site&#8217;s home page has a good meta description. This is the text displayed below the link on a search results page &#8211; as shown below:<img class="alignnone size-full wp-image-723" title="UMBC Digital Collection Google Result" src="http://www.spellboundblog.com/wp-content/uploads/2009/09/umbc_google_result.jpg" alt="UMBC Digital Collection Google Result" width="450" height="83" /></li>
<li><strong>Unique Page Titles At Collection Level</strong>: Each photography collection homepage has a unique page title and a nice block of explanatory text. Google can only read words &#8211; so the more unique text on a page, the better the job Google can do in figuring out what your page is about. Example: <a title="Ardsley Park Album" href="http://contentdm.ad.umbc.edu/ardsley.php">Ardsley Park Album</a></li>
<li><strong>Good <a title="Wikipedia: Anchor Text" href="http://en.wikipedia.org/wiki/Anchor_text">anchor text</a></strong>: (also known as link text) The words used in anchor text tells search engines information about the destination page. For example, the blue text below is anchor text.<a title="Back view of Bretz's portable wet plate case " href="http://contentdm.ad.umbc.edu/u?/georgebretz,63"> </a><a title="Back view of Bretz's portable wet plate case " href="http://contentdm.ad.umbc.edu/u?/georgebretz,63"><img class="size-full wp-image-724 aligncenter" title="UMBC Anchor Text Example" src="http://www.spellboundblog.com/wp-content/uploads/2009/09/UMBC-anchor-text.jpg" alt="UMBC Anchor Text Example" width="215" height="191" /></a></li>
</ul>
<p><strong>Areas for SEO Improvement</strong></p>
<ul>
<li><strong>Unique Page Titles At Item Level</strong>: Individual images and documents all use a generic page title such as &#8216;UMBC | Digital Archive | Document Viewer&#8217;. Document Example: <a title="Accidental Death of an Anarchist" href="http://contentdm.ad.umbc.edu/u?/theatreprod,1080">Accidental Death of an Anarchist</a> Image Example: <a title="Image: 10 year old Bootblack" href="http://contentdm.ad.umbc.edu/u?/hinecoll,3957">10 year old Bootblack</a></li>
<li><strong>H1 Tags</strong>: In the HTML of each page, the dominant heading of the page should use the &lt;h1&gt; tag. This helps Google know the phrase you are targeting with this page. It is your 2nd best place to emphasize your content after the page title. In the case of the item pages, there seems to often be a headline type title at the top of the page &#8211; but it currently is not an demarcated with an &lt;h1&gt; tag.</li>
<li><strong>Think About Search Results and Indexing</strong>: Pages displaying <a title="UMBC Digital Collections: Search for Bootblack" href="http://contentdm.ad.umbc.edu/cdm4/results.php?CISOOP1=all&amp;CISOBOX1=bootblack&amp;CISOFIELD1=CISOSEARCHALL&amp;CISOOP2=exact&amp;CISOBOX2=&amp;CISOFIELD2=CISOSEARCHALL&amp;CISOOP3=any&amp;CISOBOX3=&amp;CISOFIELD3=CISOSEARCHALL&amp;CISOOP4=none&amp;CISOBOX4=&amp;CISOFIELD4=CISOSEARCHALL&amp;CISOROOT=all&amp;t=a">results of internal searches</a> on your site are not likely to be useful as indexed pages in Google. The thinking here is that they can dilute the focus on the item and collection level pages on your site if Google also has many search results pages in the index. If UMBC wanted their search pages to be indexed, then those pages&#8217; URLs should be simplified and the search results pages need a page title that somehow includes the search criteria. There are two ways that I know of to disable this indexing &#8211; <a title="Wikipedia: Robots Exclusion Standard" href="http://en.wikipedia.org/wiki/Robots_exclusion_standard">blocking via the site&#8217;s robots.txt file</a> or via a <a title="Robots Meta Tag" href="http://www.robotstxt.org/meta.html">robots meta tag</a> in the header of the search results page. Both of these methods tell obliging search engines to not crawl certain parts of your site.</li>
</ul>
<p><strong>Final Thoughts<br />
</strong></p>
<p>There are plenty of other things that UMBC could do to support this new website. They could create an XML sitemap of all their pages and submit it to Google (maybe they already have). They might re-title some of their pages based on using a tool like <a title="Google Insight into Search" href="http://www.google.com/insights/search/#">Google Insight</a> to see what variations of a phrase is searched on most frequently. My goal here was to give you a taste of the sorts of things that catch my eye. Also, SEO is still more of an art than a science &#8211; so you will sometimes notice that what one SEO expert recommends is the opposite of what the next expert would tell you.</p>
<p>In many cases changes, such as the Unique Page Title at the Item Level mentioned above, may not even be possible due to software or programmer resource limitations. The trick is to take advantage of every option that is available. There are also trade-offs to be made. UMBC&#8217;s site provides some very slick interfaces for viewing the details of a group of documents, such as <a title="Theatre Department Production Materials Archive" href="http://contentdm.ad.umbc.edu/cdm4/browse.php?CISOROOT=/theatreprod">theater programs and other materials related to a theatrical production</a>. The imlementation elegantly handles the situation of multiple scanned images which relate to a coherent set of documents. Sometimes you can&#8217;t have both your innovative UI and perfect SEO. Then it gets down to what your goals are for your website. Are you trying to make a specific community of existing users happy by providing them with tools they can use? Or does your mission focus more on reaching out to a broader audience?</p>
<p>There is no silver bullet to search engine optimization. It just takes knowledge of the available tools and techniques combined with a willingness to keep learning and experimenting. Like the &#8216;<a title="Doe-het-zelf vrouw /Do-it-yourself-woman" href="http://www.flickr.com/photos/nationaalarchief/3333357969/">Do-It-Yourself-Woman</a>&#8216; pictured above in the <a title="Flickr Commons: Nationaal Archief" href="http://www.flickr.com/people/nationaalarchief/">Nationaal Archief</a>&#8216;s photo I found out on the Flickr Commons, you too can learn the basics and do-it-yourself. A great starting point is <a title="Google SEO Guide" href="http://www.google.com/webmasters/docs/search-engine-optimization-starter-guide.pdf">Google&#8217;s free SEO Guide</a>. Also, please remember that the best time to plan your SEO strategy is before you have built your site in the first place!</p>
<p>I would love to do research on how much progress archives websites can make in their organic search traffic after SEO improvements. My thinking is to take a snapshot of a month of <a title="Wikipedia: Analytics" href="http://en.wikipedia.org/wiki/Analytics">analytics</a> (the statistics that tell you how many people are visiting your website) and then apply some SEO inspired changes. After a suitable delay (it takes some time for SEO to do its job) we consider another month of analytics to determine any change in organic traffic.</p>
<p>Do you want me to do a quick review of your archives website to see if there is room for SEO improvement? Please <a title="Contact Jeanne" href="http://www.spellboundblog.com/contact/">contact me</a> or add a comment to this post. I feel like there is a conference presentation in all this if we can find a good set of websites to optimize.</p>
<p>Finally, thank you to unsuspecting UMBC &#8211; your new website really is beautiful.</p>
<p><em>Image credit: <a title="Doe-het-zelf vrouw /Do-it-yourself-woman" href="http://www.flickr.com/photos/nationaalarchief/3333357969/">Doe-het-zelf vrouw /Do-it-yourself-woman</a> from Nationaal Archief on Flickr Commons.</em></p>
<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2009/09/12/seo-evaluation-archival-websites-umbc/">SEO Evaluation of an Archival Website: Looking at UMBC&#8217;s Digital Collections</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.spellboundblog.com/2009/09/12/seo-evaluation-archival-websites-umbc/feed/</wfw:commentRss>
		<slash:comments>8</slash:comments>
		</item>
		<item>
		<title>Yahoo &amp; Google&#8217;s Search for Reusable Images and the Flickr Commons</title>
		<link>http://www.spellboundblog.com/2009/06/13/yahoo-google-search-reusable-images-flickr-commons/</link>
		<comments>http://www.spellboundblog.com/2009/06/13/yahoo-google-search-reusable-images-flickr-commons/#comments</comments>
		<pubDate>Sat, 13 Jun 2009 17:03:27 +0000</pubDate>
		<dc:creator>Jeanne</dc:creator>
				<category><![CDATA[access]]></category>
		<category><![CDATA[copyright]]></category>
		<category><![CDATA[photography]]></category>
		<category><![CDATA[search]]></category>

		<guid isPermaLink="false">http://www.spellboundblog.com/?p=584</guid>
		<description><![CDATA[When I read about Yahoo Image Search&#8217;s recent addition of a filter to return only creative commons Flickr images, I got all excited about what this might mean for images in the Flickr Commons. So I raced off to the Yahoo Image Search page to see how it works. The short answer is that the [...]<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2009/06/13/yahoo-google-search-reusable-images-flickr-commons/">Yahoo &#038; Google&#8217;s Search for Reusable Images and the Flickr Commons</a></p>
]]></description>
			<content:encoded><![CDATA[<p>When I read about Yahoo Image Search&#8217;s recent <a title="Yahoo Search Blog: Find Images to Use and Reuse with the New Creative Commons Filter" href="http://www.ysearchblog.com/2009/05/26/find-images-to-use-and-reuse-with-the-new-creative-commons-filter/">addition of a filter to return only creative commons Flickr images</a>, I got all excited about what this might mean for images in the <a title="Flickr Commons" href="http://www.flickr.com/commons">Flickr Commons</a>. So I raced off to the <a title="Yahoo Image Search" href="http://images.search.yahoo.com">Yahoo Image Search</a> page to see how it works. The short answer is that the new special rights setting of  <a title="Flickr Commons: No Known Rights Restrictions" href="http://www.flickr.com/commons/usage/">no known copyright restrictions</a> that they created for members of the Flickr Commons apparently doesn&#8217;t count.</p>
<p>For my test I searched for an exact match on <a title="Yahoo Image Search: Ticket with portrait of George Washington" href="http://images.search.yahoo.com/search/images?p=%22Ticket+with+portrait+of+George+Washington%22&amp;fr=sfp&amp;ei=utf-8&amp;x=wrt&amp;y=Search">&#8220;Ticket with portrait of George Washington&#8221;</a>. This returns one result &#8211; the <a title="Flickr: Ticket with portrait of George Washington" href="http://www.flickr.com/photos/field_museum_library/3589463633/">one image in Flickr</a> with the same name, from <a title="Flickr Commons: The Field Museum" href="http://www.flickr.com/photos/field_museum_library/">The Field Museum</a> in Flickr Commons. If you click on the &#8216;More Filters&#8217; link, you will see other ways to filter your <img class="size-full wp-image-593 alignleft" title="Creator permits reuse - Yahoo image search" src="http://www.spellboundblog.com/wp-content/uploads/2009/06/cc-img.JPG" alt="Creator permits reuse - Yahoo image search" width="243" height="121" />results &#8211; including the option to restrict your results to only include images whose creators permit reuse.</p>
<p>Next I clicked in the &#8216;Creator allows reuse&#8217; and my one result disappeared! Quite disappointing in my book.</p>
<p>Google is also getting onto the &#8216;make it easy to search for reusable images&#8217; bandwagon. <a title="Search Engine Land" href="http://searchengineland.com/">Search Engine Land</a> reported that <a title="Search Engine Land: Google Images Quietly Adds Creative Commons Filter" href="http://searchengineland.com/google-images-quietly-adds-creative-commons-filter-20847">Google Images Quietly Adds Creative Commons Filter</a>. That post pointed me to <a title="Google Operating System Blog" href="http://googlesystem.blogspot.com/">Google  Operating System</a>&#8216;s search interface that lets you <a title="Find Creative Commons Images in Google Image Search" href="http://googlesystem.blogspot.com/2009/06/find-creative-commons-images-in-google.html">play with the options that Google has available</a>. After a clicking through to some of the images returned by a <a title="Google Image Search: Archives" href="http://images.google.com/images?as_rights=(cc_publicdomain|cc_attribute|cc_sharealike|cc_noncommercial|cc_nonderived)&amp;hl=en&amp;sa=1&amp;q=archives&amp;btnG=Search+Images&amp;aq=f&amp;oq=">Google Image Search for creative commons images of archives</a>, the way the Google model <em>appears </em>to work is to look for <a title="Creative Commons" href="http://creativecommons.org/">creative commons</a> badges or links on the page with the image. I even found Flickr creative commons images, but when I tried to find my Flickr Commons image of the ticket used above for my Yahoo image search experiment it wasn&#8217;t returned by Google either.</p>
<p>So if an archives (or museum or library) posts images on a page that indicates that the content is licensed under creative commons, it seems those images will then appear in Google&#8217;s image search as reusable. That is good news! Another way to get users to find your public domain images.</p>
<p>The question I am left is how to resolve the gap between <a title="Flickr Commons: Rights Statement" href="http://www.flickr.com/commons/usage/">Flickr Commons&#8217; &#8216;no known copyright restrictions  rights statement</a> and both Google and Yahoo&#8217;s definition of reusable content.</p>
<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2009/06/13/yahoo-google-search-reusable-images-flickr-commons/">Yahoo &#038; Google&#8217;s Search for Reusable Images and the Flickr Commons</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.spellboundblog.com/2009/06/13/yahoo-google-search-reusable-images-flickr-commons/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Sunshine Week 2009: Archives, Records and Other Online Government Information</title>
		<link>http://www.spellboundblog.com/2009/03/18/sunshine-week-archives-records-online-government-information/</link>
		<comments>http://www.spellboundblog.com/2009/03/18/sunshine-week-archives-records-online-government-information/#comments</comments>
		<pubDate>Wed, 18 Mar 2009 04:23:44 +0000</pubDate>
		<dc:creator>Jeanne</dc:creator>
				<category><![CDATA[access]]></category>
		<category><![CDATA[at risk records]]></category>
		<category><![CDATA[born digital records]]></category>
		<category><![CDATA[digitization]]></category>
		<category><![CDATA[journalism]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[transcription]]></category>

		<guid isPermaLink="false">http://www.spellboundblog.com/?p=376</guid>
		<description><![CDATA[Sunshine Week 2009 is a national initiative spearheaded by journalists to &#8220;open a dialogue about the importance of open government and freedom of information&#8221;. The Electronic Frontier Foundation (EFF) chose to mark Sunshine Week this year by announcing the release their new tool for searching EFF&#8217;s FOIA documents. Learn more about EFF&#8217;s efforts to make [...]<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2009/03/18/sunshine-week-archives-records-online-government-information/">Sunshine Week 2009: Archives, Records and Other Online Government Information</a></p>
]]></description>
			<content:encoded><![CDATA[<p><a title="Sunshine Week" href="http://www.sunshineweek.org/"><img class="alignright size-full wp-image-380" title="Sunshine Week" src="http://www.spellboundblog.com/wp-content/uploads/2009/03/sunshine-week-logo.jpg" alt="Sunshine Week" width="99" height="150" /></a> <a href="http://www.sunshineweek.org/sunshineweek/about">Sunshine Week</a> 2009 is a national initiative spearheaded by journalists to &#8220;open a dialogue about the importance of open government and freedom of information&#8221;. The <a title="Electronic Frontier Foundation" href="http://www.eff.org/">Electronic Frontier Foundation</a> (EFF) chose to mark Sunshine Week this year by <a title="EFF Launches Search Tool for Uncovered Government Documents" href="http://www.eff.org/press/archives/2009/03/16">announcing</a> the release their new <a title="Search EFF FOIA Documents" href="http://www.eff.org/issues/foia/search">tool for searching EFF&#8217;s FOIA documents</a>. Learn more about EFF&#8217;s efforts to make open government a reality in this <a title="Help EFF Make Open Government a Reality" href="http://www.eff.org/deeplinks/2009/03/foia">EFF call to action</a>.</p>
<p>The <a title="Sunshine Week 2009 Survey of State Government Info. Online" href="http://sunshineweek.blogs.com/my_weblog/2009/03/most--americans-can-easily-find-videos-of-water-skiing-squirrels-on-the--internet-but-theyll-have-less-luck-finding-out.html">Sunshine Week blog announced</a> the release of a <a title="2009 Survey Of State Government Information Online" href="http://www.sunshineweek.org/sunshineweek/state_govt_online_survey_09">2009 Survey Of State Government Information Online</a>. The survey results explains:</p>
<p style="padding-left: 30px;">Using a <a title="Sunshine Week 2009 Survey Worksheet" href="http://www.sunshineweek.org/files/transparency_worksheet_09.pdf">standardized worksheet</a> surveyors rated each section on its usability, looking at factors such as whether the information was clearly linked, if full reports or only summaries were available, whether viewing and/or downloading was free, and whether the data were current. The categories for the survey were selected for generally serving the overall public good — the kind of information people need for their own health and well-being and that of the community.</p>
<p>See the <a title="Sunshine Week 2009 Survey Worksheet" href="http://www.sunshineweek.org/files/transparency_worksheet_09.pdf">worksheet</a> for details on the categories selected for inclusion in the survey and the <a title="Sunshine Week 2009 Survey Results" href="http://www.sunshineweek.org/sunshineweek/state_govt_online_survey_09">results</a> for lots of interesting tidbits about exactly which states provide access (or not) to various public information online. A few very randomly selected highlights:</p>
<ul>
<li><em>Maryland:</em> Nursing home information, <a href="http://%20mhcc.maryland.gov/consumerinfo/nhguide">mhcc.maryland.gov/consumerinfo/nhguide</a>, got high marks for facilitating online search and for allowing users to &#8220;compare data in a variety of ways.&#8221;</li>
<li><em>Iowa:</em> The state auditor&#8217;s office reportedly offers online more than 5,000 full reports of all its audits dating back to 2001. The audits are easily accessible from tabs on the main Web page, <a href="http://%20www.auditor.iowa.gov/">www.auditor.iowa.gov</a>.</li>
<li><em>Colorado:</em> Bridge inspection reports in Colorado are considered public, but they are not published online. Anyone who wants to see the reports is advised to file an FOI request.</li>
</ul>
<p>All of this made me recall my blog post about the <a title="Understanding Born Digital Records: Journalists and Archivists with Parallel Challenges" href="http://www.spellboundblog.com/2007/02/17/understanding-born-digital-records-journalists-and-archivists-with-parallel-challenges/">parallel goals of journalists and archivists</a> when considering digital public records and databases. I wanted to celebrate Sunshine Week by looking for other online sources of government information. My first stop was the website of the <a title="Council of State Archivists" href="http://www.statearchivists.org/index.htm">Council of State Archivists</a> (CoSA). They had a couple of great resources including:</p>
<ul>
<li>A 2007 <a title="CoSA: State of State Records" href="http://www.statearchivists.org/reports/2007-ARMreport/index.htm">status report on the state of State Records</a> (and it <a href="http://www.statearchivists.org/projects/2008survey/index.htm">looks like a new report should be out soon</a> &#8211; their 2008 survey just closed at the end of January 2009)</li>
<li><a title="CoSA:  	 Directory of State Archives and Records Programs" href="http://www.statearchivists.org/states.htm">Directory of State Archives and Records Programs</a></li>
<li>Details on their <a title="CoSA: Local Government Project" href="http://www.statearchivists.org/lga/index.htm">Local Government Project</a></li>
</ul>
<p>A bit further afield we find <a title="GovernmentDocs.org" href="http://www.governmentdocs.org">GovernmentDocs.org</a> advertised as a &#8220;community government document reviewer system&#8221;. On their <a title="About GovernmentDocs.org" href="http://www.governmentdocs.org/About.aspx">about page</a> we read:</p>
<p style="padding-left: 30px;">With the GovernmentDocs.org system, citizen reviewers can engage in the government accountability process like never before. Registered users can review and comment on documents, adding their insights and expertise to the work of the national nonprofit organizations which are partnering on this project. This new information then becomes instantly searchable. The text of each document is searchable, as well, thanks to a powerful Optical Character Recognition (OCR) functionality.</p>
<p style="padding-left: 30px;">GovernmentDocs.org adds a powerful layer to government transparency and accountability by indexing documents in a user-friendly manner that is remarkably easy to share. Every page of every document has its own unique url, allowing you and other users to link to that page on blogs, send emails about the documents to friends, and expose the information to a wider audience.</p>
<p>Here is an <a title="GovernmentDocs: ESA page 82" href="http://www.governmentdocs.org/Doc/PageView.aspx?DocId=848&amp;PagNum=82">example GovernmentDocs page</a> taken from a request submitted by <a title="CREW" href="http://citizensforethics.org/">CREW</a> (Citizens for Responsibility and Ethics in Washington) regarding the Endangered Species Act. Each GovernmentDocs page has a unique URL, full text transcription of the page and supports comments and reviews. The possibility of building up a community around these records is very real. I am curious to see how many citizen reviewers and comments are associated with these documents a year from now.</p>
<p>Please help celebrate Sunshine Week by exploring all these amazing resources!</p>
<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2009/03/18/sunshine-week-archives-records-online-government-information/">Sunshine Week 2009: Archives, Records and Other Online Government Information</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.spellboundblog.com/2009/03/18/sunshine-week-archives-records-online-government-information/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Google Tackles Magazine Archives</title>
		<link>http://www.spellboundblog.com/2008/12/10/google-tackles-magazine-archives/</link>
		<comments>http://www.spellboundblog.com/2008/12/10/google-tackles-magazine-archives/#comments</comments>
		<pubDate>Wed, 10 Dec 2008 06:13:51 +0000</pubDate>
		<dc:creator>Jeanne</dc:creator>
				<category><![CDATA[access]]></category>
		<category><![CDATA[digitization]]></category>
		<category><![CDATA[future-proofing]]></category>
		<category><![CDATA[historical research]]></category>
		<category><![CDATA[information visualization]]></category>
		<category><![CDATA[interface design]]></category>
		<category><![CDATA[journalism]]></category>
		<category><![CDATA[search]]></category>

		<guid isPermaLink="false">http://www.spellboundblog.com/2008/12/10/google-tackles-magazine-archives/</guid>
		<description><![CDATA[As has been reported around the web today, Google is now digitizing and adding magazines to Google Book Search. This follows on the tails of the recent Google Life Photo archive announcement. I took a look around to see what I could see. I was intrigued by the fact that I couldn&#8217;t see a list [...]<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2008/12/10/google-tackles-magazine-archives/">Google Tackles Magazine Archives</a></p>
]]></description>
			<content:encoded><![CDATA[<p><a title="Google Book Search: Popular Mechanics Jan 1905" href="http://books.google.com/books?id=S98DAAAAMBAJ&amp;printsec=frontcover&amp;source=gbs_summary_r&amp;cad=0_0"><img src="http://www.spellboundblog.com/wp-content/uploads/2008/12/popmech.JPG" alt="Google Book Search: Popular Mechanics Jan 1905 Cover Image" width="275" height="395" align="right" /></a>As has been reported around the web today, Google is now digitizing and adding magazines to <a title="Google Book Search" href="http://books.google.com/books">Google Book Search</a>. This follows on the tails of the recent <a title="Google Life Photo Archive Blog Post" href="http://www.spellboundblog.com/2008/11/22/life-photo-archive-digitized-and-put-online-by-google/">Google Life Photo archive</a> announcement.</p>
<p>I took a look around to see what I could see. I was intrigued by the fact that I couldn&#8217;t see a list of all the magazines in their collection. So I went after the information the hard way and kept reloading the Google Book Search home page until I didn&#8217;t see any new titles displayed in their highlighted magazine section. This is what I came up with, roughly grouped by general topic groupings.</p>
<p>Science and technology:</p>
<ul>
<li><a title="The Bulletin of Atomic Scientists Archive" href="http://books.google.com/books?id=swoAAAAAMBAJ">The Bulletin of the Atomic Scientists</a>: which started out as the Bulletin of the Atomic Scientists of Chicago in December of 1945 through November of 1998</li>
<li><a title="CIO Magazine Archives" href="http://books.google.com/books?id=jwsAAAAAMBAJ">CIO: The Magazine for Information Executives</a>: back to Volume 1, Number 1 from Sept/Oct 1987</li>
<li><a title="Maximum PC Magazine Archives" href="http://books.google.com/books?id=cAIAAAAAMBAJ">Maximum PC</a>: October 1998 through the present</li>
<li><a title="Popular Science Magazine Archive" href="http://books.google.com/books?id=NkmBuPFIfaMC">Popular Science</a>: stretching back to an issue for March of 1872 when it was known as Popular Science Monthly through to February 2008</li>
<li><a title="Popular Mechanics Magazine Archive" href="http://books.google.com/books?id=UtMDAAAAMBAJ">Popular Mechanics</a>: January 1905 through November 2005</li>
</ul>
<p>Lifestyle and city themed:</p>
<ul>
<li><a title="New York Magazine Archive" href="http://books.google.com/books?id=hBgAAAAAMBAJ">New York Magazine</a>:  April 1968 through December 1997. Fascinating that some of the magazines still have the original mailing label on them (see this example from <a title="New York Magazine July 1969 Cover" href="http://books.google.com/books?id=jMcDAAAAMBAJ&amp;printsec=frontcover">a July 1969 issue of New York</a> )</li>
<li><a title="Cincinnati Magazine Archive" href="http://books.google.com/books?id=QB8DAAAAMBAJ">Cincinnati Magazine</a>: January 1971 through December 2005, at which point it seems to switch to being an annual city guide titled Cincinnati USA</li>
<li><a title="Atlanta Magazine Archive" href="http://books.google.com/books?id=ng8AAAAAMBAJ">Atlanta</a>: January 2003 through August 2008 &#8211; and mis-titled &#8216;Atlants&#8217;</li>
<li><a title="Indianapolis Monthly Archive" href="http://books.google.com/books?id=POsCAAAAMBAJ">Indianapolis Monthly</a>: January 1995 to the present</li>
<li><a title="Cruise Travel Magazine Archive" href="http://books.google.com/books?id=jjEDAAAAMBAJ">Cruise Travel</a>: June 1979 through December 2007</li>
</ul>
<p>African American:</p>
<ul>
<li><a title="Ebony Jr! Magazine Archive" href="http://books.google.com/books?id=wr4DAAAAMBAJ">Ebony Jr!</a>: May 1973 through October 1985</li>
<li><a title="Jet Magazine Archive" href="http://books.google.com/books?id=87MDAAAAMBAJ">Jet</a>: November 1961 through October 2008</li>
<li><a title="Black Digest Magazine Archive" href="http://books.google.com/books?id=MbIDAAAAMBAJ">Black Digest</a>: Named &#8216;Negro Digest&#8217; from November 1961 through April 1970, then Black Digest from May 1970 through April 1976.</li>
</ul>
<p>Health, nutrition and organic:</p>
<ul>
<li><a title="Women's Health Magazine Archive" href="http://books.google.com/books?id=wMUDAAAAMBAJ">Women&#8217;s Health</a> and <a title="Men's Health Magazine Archive" href="http://books.google.com/books?id=McgDAAAAMBAJ">Men&#8217;s Health</a>: January 2006 through present. I found it very amusing to be able to scan the covers of all the issues so easily &#8211; true for all of these magazines of course, but funny to see cover after cover of almost identically clad men and women exercising.</li>
<li><a title="Prevention Magazine Archive" href="http://books.google.com/books?id=YccDAAAAMBAJ">Prevention</a>: January 2006 through the present</li>
<li><a title="Better Nutrition Magazine Archive" href="http://books.google.com/books?id=lAUAAAAAMBAJ">Better Nutrition</a>: January 1999 through December 2004</li>
<li><a title="Organic Gardening Magazine Archive" href="http://books.google.com/books?id=esMDAAAAMBAJ">Organic Gardening</a>: November 2005 to the present</li>
<li><a title="Vegetarian Times Magazine Archive" href="http://books.google.com/books?id=FQQAAAAAMBAJ">Vegetarian Times</a>: March1981 through November 2004</li>
</ul>
<p>Sports and the outdoors:</p>
<ul>
<li><a title="Baseball Digest Archive" href="http://books.google.com/books?id=8SsDAAAAMBAJ">Baseball Digest</a>: July 1945 through October 2007</li>
<li><a title="American Cowboy Magazine Archive" href="http://books.google.com/books?id=XeoCAAAAMBAJ">American Cowboy</a>: May 1994 through August 2008</li>
<li><a title="Bicycling Magazine Archive" href="http://books.google.com/books?id=rMUDAAAAMBAJ">Bicycling</a>, <a title="Mountain Bike Magazine" href="http://books.google.com/books?id=ZcQDAAAAMBAJ">Mountain Bike</a> and <a title="Runner's World Magazine Archive" href="http://books.google.com/books?id=FMgDAAAAMBAJ">Runner&#8217;s World</a>: January 2006 through present</li>
</ul>
<p>They of course promise more magazines on the way, so if you are reading this long after mid December 2008  I would assume there are more magazines and more issues available now. I hope that they make it easier to browse just magazines. Once they have a broader array of titles &#8211; how neat would it be to build a virtual news stand for a specific week in history? Shouldn&#8217;t be hard &#8211; they have all the metadata and cover images they need.</p>
<p>I love being able to read the magazine &#8211; advertising and all. They display the covers in batches by decade or 5 year period depending on the number of issues. I also like the Google map provided on each magazines &#8216;about&#8217; page that shows &#8216;Places mentioned in this magazine&#8217; and easily links you directly to the article that mentions the location marked on the map.</p>
<p>I think it is interesting that Google went with more of a PDF single scrolling model rather than an interface that mimics turning pages. In many issues (maybe all?) they have hot-linked the table of contents so that you can scroll down to that section instantly. You can also search within the magazine, though from my short experiments it seems that only the articles are text indexed and the advertisements are not.</p>
<p>Google&#8217;s current model for search is to return results for magazines mixed in with books in Google Book Search results &#8211; but they do let you limit your results to only magazines from their <a title="Advanced Google Book Search" href="http://books.google.com/advanced_book_search">Advanced Search page within Google Book Search</a>. See these results for a quick <a title="Google Book Search: sunscreen in magazines" href="http://books.google.com/books?as_q=sunscreen&amp;num=10&amp;btnG=Google+Search&amp;as_epq=&amp;as_oq=&amp;as_eq=&amp;as_brr=0&amp;as_pt=MAGAZINES&amp;lr=&amp;as_vt=&amp;as_auth=&amp;as_pub=&amp;as_sub=&amp;as_drrb=c&amp;as_miny=&amp;as_maxy=&amp;as_isbn=&amp;as_issn=">search on sunscreen in magazines</a>.</p>
<p>Overall I mark this as a really nice step forward in access to old magazines. As with many visualizations, seeing the about page for any of these magazines made me ask myself new questions.  It will be interesting to see how many magazines sign on to be included and how the interface evolves.</p>
<p>To read more about Google&#8217;s foray into magazine digitization and search take a look at:</p>
<ul>
<li><a title="Tech Crunch: Google Adds Print Magazines To Book Search" href="http://www.techcrunch.com/2008/12/09/google-adds-print-magazines-to-book-search/">Tech Crunch: Google Adds Print Magazines To Book Search </a></li>
<li><a title="Official Google Blog: Search and Find Magazines on Google" href="http://googleblog.blogspot.com/2008/12/search-and-find-magazines-on-google.html">Official Google Blog: Search and Find Magazines on Google</a></li>
<li><a title="Venture Beat Digital Media: Google Book Search: now with magazines!" href="http://venturebeat.com/2008/12/09/google-book-search-now-with-magazines/">Venture Beat Digital Media: Google Book Search: now with magazines!</a></li>
<li><a title="Washington Post/AP: Google updates search index with old magazines" href="http://www.washingtonpost.com/wp-dyn/content/article/2008/12/10/AR2008121000908.html?sub=AR">Washington Post/AP: Google updates search index with old magazines</a></li>
</ul>
<p>For a really nice analysis of the information that Google provides on the magazine pages see <a title="Search Engine Land: Google Book Search Puts Magazines Online" href="http://searchengineland.com/google-book-search-puts-magazines-online-15762.php">Search Engine Land&#8217;s Google Book Search Puts Magazines Online</a>.</p>
<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2008/12/10/google-tackles-magazine-archives/">Google Tackles Magazine Archives</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.spellboundblog.com/2008/12/10/google-tackles-magazine-archives/feed/</wfw:commentRss>
		<slash:comments>11</slash:comments>
		</item>
		<item>
		<title>LIFE Photo Archive Digitized and Put Online by Google</title>
		<link>http://www.spellboundblog.com/2008/11/22/life-photo-archive-digitized-and-put-online-by-google/</link>
		<comments>http://www.spellboundblog.com/2008/11/22/life-photo-archive-digitized-and-put-online-by-google/#comments</comments>
		<pubDate>Sat, 22 Nov 2008 07:00:12 +0000</pubDate>
		<dc:creator>Jeanne</dc:creator>
				<category><![CDATA[controlled vocabularies]]></category>
		<category><![CDATA[digitization]]></category>
		<category><![CDATA[funding]]></category>
		<category><![CDATA[metadata]]></category>
		<category><![CDATA[photography]]></category>
		<category><![CDATA[search]]></category>

		<guid isPermaLink="false">http://www.spellboundblog.com/2008/11/22/life-photo-archive-digitized-and-put-online-by-google/</guid>
		<description><![CDATA[In news that would make any fan of old photographs drool, the Official Google Blog has announced that the LIFE Photo Archive is now available on Google Image Search.  The LIFE Photo Archive&#8217;s home page is neatly organized to encourage you to browse for images by decade, famous people and topics. There really is something [...]<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2008/11/22/life-photo-archive-digitized-and-put-online-by-google/">LIFE Photo Archive Digitized and Put Online by Google</a></p>
]]></description>
			<content:encoded><![CDATA[<p><a href="http://images.google.com/hosted/life/l?imgurl=4be14877a8ea4f88" title="TimeInc: Martha Graham, photo by Gjon Mili, 1941"><img src="http://www.spellboundblog.com/wp-content/uploads/2008/11/martha-graham.jpg" alt="Martha Graham" width="230" align="right" height="292" /></a>In news that would make any fan of old photographs drool, the <a href="http://googleblog.blogspot.com/" title="Official Google Blog">Official Google Blog</a> has announced that the <a href="http://googleblog.blogspot.com/2008/11/life-photo-archive-available-on-google.html" title="LIFE Photo Archive available on Google Image Search">LIFE Photo Archive is now available on Google Image Search</a>.  The <a href="http://images.google.com/hosted/life" title="LIFE Photo Archive">LIFE Photo Archive&#8217;s home page</a> is neatly organized to encourage you to browse for images by decade, famous people and topics.</p>
<p>There really is something for everyone here. I picked this striking image of <a href="http://en.wikipedia.org/wiki/Martha_Graham" title="Wikipedia: Martha Graham">Martha Graham</a> because I love modern dance, but there are also images of war, fashion, sports, landscapes, architecture and tons more. The images currently posted stretch from the 1750s through 2003 and include many that have never before been published.</p>
<p>It is also worth noting that not everything in this collection is a photograph. I found illustrated pages from books like the <a href="http://images.google.com/hosted/life/l?imgurl=3e8435ecdfee184b" title="Queen Summer">Queen Summer by Walter Crane</a>. I also found illustrations like this one of <a href="http://images.google.com/hosted/life/l?imgurl=f218f88474dca5ee" title="Temple of Artemis: Ephesus, Turkey">the ancient temple of Artemis in Ephesus, Turkey</a>.</p>
<p>From the text in Google&#8217;s blog post, it sounds as if Google is doing the digitizing &#8211; while LIFE Inc (or their parent company <a href="http://www.timeinc.com" title="Time Inc">Time Inc</a>) will profit from the sale of prints. The current posted photos represent 20% of all the photos. Ultimately the photo archive is expected to be about 10 million images and stretch to the present day.</p>
<p><a href="http://life.qoop.com/images/13051022" title="TimeInc: Trees in Snow"><img src="http://www.spellboundblog.com/wp-content/uploads/2008/11/tree-snow.jpg" alt="TimeInc: St. Moritz, Switzerland, February 1947" align="left" /></a>TimeInc has partnered with <a href="http://www.qoop.com/" title="QOOP">QOOP</a> to sell framed fine art photographs via links directly from each of the LIFE photo pages within Google. Take a look at the page dedicated to selling you a  <a href="http://life.qoop.com/images/13051022" title="Framed LIFE photo: St. Moritz, Switzerland in 1947">framed art print of St. Moritz, Switzerland in 1947</a>.</p>
<p>I had not heard of QOOP before, but they seem to have a number of <a href="https://my.qoop.com/design/partner_page.php" title="QOOP Partner Options">options available for those who want to be their partner</a>. QOOP also wanted me to join their &#8216;Social Commerce Revolution&#8217;. In order to join the revolution I had to create an account (it is free and easy it pointed out). To create my account I had to give them my email address, password and birth date. Not so bad. On the next screen they required that I enter my mailing address and phone number. I don&#8217;t really want to give them this information &#8211; but I am curious about this revolution I have been promised. And when I was done &#8211; a whole lot of nothing happened. I think that I am supposed to use QOOP to create and market products. Is it an affiliate program? Is it an artsier <a href="http://www.cafepress.com" title="CafePress">CafePress</a>? Do I need to contribute my own images or can I use those of others? I am still not sure.</p>
<p>I do like their integration with the LIFE images, but I think that there is clearly more work to be done before they are going to foster a &#8216;Social Commerce Revolution&#8217; anytime soon.</p>
<p><a href="http://www.timeinc.com/pressroom/detail.php?id=releases/11182008.php" title="TimeInc Press Release">Time Inc&#8217;s press release</a> includes the following details:</p>
<blockquote><p>The LIFE Photo Archive featured on Google will be among the largest professional photography collections on the Web and one of the largest scanning projects ever undertaken. Millions of images have been scanned and made available on Google Image Search today with all 10 million images to be available in the coming months.</p>
<p>“For 70 years, LIFE has been about one thing, and that’s the power of photography to tell a story,” says Andy Blau, LIFE’s President. “LIFE will now reach a broader audience and engage them online with the incredible depth and breadth of the LIFE Photo Archive from serious world events, to Hollywood celebrities to whimsical photographs.” Time Inc. EVP, John Squires adds: “We’re delighted Google recognized the rich value of our photo archive and worked with us to bring it to millions of consumers. Consistent with the launch of the TIME Archive, PEOPLE Archive and the SI Vault, this initiative continues our efforts to build valuable new revenue opportunities from our rich heritage.”</p>
<p>All keywords are translated into 16 different   languages<strong>. </strong>LIFE’s Photo Archive will be scanned and available on Google Image Search free for personal and research purposes. Copyright and ownership of all images will remain with Time Inc.</p></blockquote>
<p>Google uses a special notation to support search across the LIFE collection &#8211; all you do is include source:life as one of your search terms within the Google Image search box. Each photo has a rich set of metadata including a description and the keywords mentioned in the press release above.</p>
<p>When you click on one of the keywords (which are actually called &#8216;Labels&#8217; in the Google interface) it submits a search within the LIFE collection &#8211; but does NOT restrict that search to only the keywords. Rather, it seems to search across all the text associated with each image. For example &#8211; the label &#8216;Feathers&#8217; on assorted images links to this URL: <a href="http://images.google.com/images?&amp;hl=en&amp;q=Feathers+source:life" title="LIFE Photo Archive search for Feathers">http://images.google.com/images?q=Feathers+source:life</a>. This search returns many images, including one of <a href="http://images.google.com/hosted/life/l?imgurl=835d55936404eae4" title="Debutante Marilyn Lowe wearing a dress made from feathers">Debutante Marilyn Lowe wearing a dress made from feathers</a> which is not in fact assigned the label &#8216;Feathers&#8217;, but obviously does include the word feathers in the description.</p>
<p>For those accustomed to hotlinked tag-like terms only retrieving content that also is assigned that term (see <a href="http://flickr.com/commons/tags/snow" title="Flickr Commons Tag Snow">all the images in the Flickr Commons tagged with &#8216;snow&#8217;</a>), this might be a bit confusing. Also in contrast with the <a href="http://flickr.com/commons" title="Flickr Commons">Flickr Commons</a>, Google does not offer the opportunity for users to assign additional labels/keywords to the images. If you have signed on with a Google account, you <em>can </em>assign images within the LIFE collection a star rating. I don&#8217;t see how this is used right now, but I expect that over time they will leverage these ratings to sort the Google hosted image search results.</p>
<p>Since I spend a lot of time these days organizing controlled vocabularies, seeing the keywords assigned to these images makes me wish I could see Time Inc&#8217;s full and organized set of terms. My favorite spotted so far? <a href="http://images.google.com/images?q=Lines+Of+People+source:life" title="LIFE Photo Archive of Lines of People">Lines of People</a> &#8211; as typified by this photograph of <a href="http://images.google.com/hosted/life/l?imgurl=e2699913c595da58" title="Models wearing checked outfits">Models wearing checked outfits</a> from 1958.</p>
<p>It will be interesting to see what other partnerships crop up in the next year to digitize other major collections. I am also very curious to know if people actually buy framed fine art prints at the current cost of $79.99 for an 8&#8243;x12&#8243; inch print in a 13&#8243;x16&#8243; frame. Who knows if Time Inc will be forthcoming with their degree of success on this front &#8211; but it will likely be an good test case for other major collections looking to recoup some of the cost of their digitization efforts and find a new revenue stream.</p>
<p><em>Image Credit: The copyright of both of the images shown above belong to Time Inc. Please click through to view details about each image, including the photographer&#8217;s name and the option to purchase your own print of the image. </em></p>
<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2008/11/22/life-photo-archive-digitized-and-put-online-by-google/">LIFE Photo Archive Digitized and Put Online by Google</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.spellboundblog.com/2008/11/22/life-photo-archive-digitized-and-put-online-by-google/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Celebrating NASA&#8217;s 50th with NASA Images</title>
		<link>http://www.spellboundblog.com/2008/09/25/celebrating-nasas-50th-with-nasa-images/</link>
		<comments>http://www.spellboundblog.com/2008/09/25/celebrating-nasas-50th-with-nasa-images/#comments</comments>
		<pubDate>Thu, 25 Sep 2008 23:37:29 +0000</pubDate>
		<dc:creator>Jeanne</dc:creator>
				<category><![CDATA[access]]></category>
		<category><![CDATA[metadata]]></category>
		<category><![CDATA[photography]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[video]]></category>

		<guid isPermaLink="false">http://www.spellboundblog.com/2008/09/25/celebrating-nasas-50th-with-nasa-images/</guid>
		<description><![CDATA[October 1st, 2008 marks the 50th anniversary of the National Aeronautics and Space Administration (NASA). It is interesting to contrast the feel of the NASA 50th Anniversary Web Site with the 15 NARA/NASA videos currently posted on Google Video, but my favorite site for celebrating NASA&#8217;s 50 year journey is the amazing NASA Images website. [...]<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2008/09/25/celebrating-nasas-50th-with-nasa-images/">Celebrating NASA&#8217;s 50th with NASA Images</a></p>
]]></description>
			<content:encoded><![CDATA[<p><img src="http://www.spellboundblog.com/wp-content/uploads/2008/09/nasa.jpg" alt="NASA 50th" width="220" align="right" height="304" />October 1st, 2008 marks the <a href="http://blogs.nasa.gov/cm/blog/Shana%27s-Blog/posts/post_1190659953822.html" title="NASA Blog: NASA’s 50th Anniversary">50th anniversary of the National Aeronautics and Space Administration</a> (NASA). It is interesting to contrast the feel of the <a href="http://www.nasa.gov/50th/home/index.html" title="NASA 50th Anniversary Web Site">NASA 50th Anniversary Web Site</a> with the 15 <a href="http://video.google.com/videosearch?q=owner%3Anara+type%3Anasa#q=owner%3Anara%20type%3Anasa&amp;so=1" title="Google Video: NARA/NASA">NARA/NASA videos currently posted on Google Video</a>, but my favorite site for celebrating NASA&#8217;s 50 year journey is the amazing <a href="http://www.nasaimages.org/" title="NASA Images">NASA Images</a> website.</p>
<p>I learned about this site on a tour of the <a href="http://www.archive.org" title="Internet Archive">Internet Archive</a> while out in California for <a href="http://www.spellboundblog.com/category/saa2008/" title="SAA 2008">SAA 2008</a>. It contains still and moving images from across NASA. For the first time these visual materials have been pulled together and methodically assigned appropriate metadata. This means that you can do really nice advanced searches and faceted browsing of search results. Before this effort, there was no standardized set of attributes describing these visual materials being created across NASA.</p>
<p>The <a href="http://www.nasaimages.org/About.html">NASA Images about page</a> explains:</p>
<blockquote><p> NASA Images is a service of Internet Archive ( www.archive.org ), a non-profit library, to offer public access to NASA&#8217;s images, videos and audio collections. NASA Images is constantly growing with the addition of current media from NASA as well as newly digitized media from the archives of the NASA Centers.</p>
<p>The goal of NASA Images is to increase our understanding of the earth, our solar system and the universe beyond in order to benefit humanity.</p></blockquote>
<p>It goes on to explain that the Internet Archive receives no financial support from NASA and that NASA Images is currently funded through a grant from the <a href="http://www.guidestar.org/pqShowGsReport.do?partner=grantexplorer&amp;grantType=grant&amp;ein=91-1816164&amp;gotoNext=/reports/partners/grantexplorer/showGrants.jsp" title="GuideStar: Kahle/Austin Foundation">Kahle-Austin Foundation</a>. They are currently looking for new grants and sponsorships to fund upcoming projects.</p>
<p>Also, according to their published <a href="http://www.nasaimages.org/Terms.html" title="NASA Images Terms">Terms and Conditions</a>, they have made an effort to only include non-copyrighted images (though they don&#8217;t guarantee it). This is an amazing wealth of images and movies available for public use. The terms state that &#8220;You may use this NASA imagery for educational or informational purposes, including photo collections, textbooks, public exhibits and Internet Web pages (personal or otherwise). &#8221;</p>
<blockquote></blockquote>
<p>I have embedded below <a href="http://www.nasaimages.org/luna/servlet/detail/NVA2~4~4~6195~106721:Movie--The-Active-Sun?pgs=50&amp;res=1&amp;cic=nasaNAS%7E10%7E10%2CnasaNAS%7E12%7E12%2CnasaNAS%7E13%7E13%2CnasaNAS%7E16%7E16%2CnasaNAS%7E20%7E20%2CnasaNAS%7E22%7E22%2CnasaNAS%7E2%7E2%2CnasaNAS%7E4%7E4%2CnasaNAS%7E5%7E5%2CnasaNAS%7E6%7E6%2CnasaNAS%7E7%7E7%2CnasaNAS%7E8%7E8%2CnasaNAS%7E9%7E9%2CNSVS%7E3%7E3%2CNVA2%7E13%7E13%2CNVA2%7E1%7E1%2CNVA2%7E4%7E4%2CNVA2%7E8%7E8%2CNVA2%7E9%7E9" title="NASA Movie: The Active Sun">footage of plumes of hot gas shooting across the surface of the sun</a>. Check out this photo of the <a href="http://nasaimages.org/luna/servlet/detail/nasaNAS~5~5~22981~127221:Original-7-Astronauts-in-Spacesuits?pgs=50&amp;res=1&amp;cic=nasaNAS%7E10%7E10%2CnasaNAS%7E12%7E12%2CnasaNAS%7E13%7E13%2CnasaNAS%7E16%7E16%2CnasaNAS%7E20%7E20%2CnasaNAS%7E22%7E22%2CnasaNAS%7E2%7E2%2CnasaNAS%7E4%7E4%2CnasaNAS%7E5%7E5%2CnasaNAS%7E6%7E6%2CnasaNAS%7E7%7E7%2CnasaNAS%7E8%7E8%2CnasaNAS%7E9%7E9%2CNSVS%7E3%7E3%2CNVA2%7E13%7E13%2CNVA2%7E1%7E1%2CNVA2%7E4%7E4%2CNVA2%7E8%7E8%2CNVA2%7E9%7E9" title="NASA Images: Original 7 Astronauts - 1968">original 7 astronauts in their very shiny spacesuits from 1968</a>. Happy Golden Anniversary NASA!</p>
<p><iframe src="http://www.nasaimages.org/luna/servlet/detail/NVA2~4~4~6195~106721:Movie--The-Active-Sun?embedded=true&amp;widgetFormat=javascript&amp;widgetType=detail&amp;controls=1&amp;nsip=1" id="widgetPreview" border="0px" style="border: 0px solid white" width="525" frameborder="0" height="350"></iframe></p>
<p><em>Image Credit:  <a href="http://www.nasa.gov/vision/earth/technologies/nextfest_2007.html" title="NASA 50th Anniversary Logo">NASA 50th Anniversary logo</a> designed by Crabtree + Company. <a href="http://www.crabtreecompany.com/nasa50.htm" title="Thoughts Behind NASA 50 Logo Creation">Read more about Crabtree + Company&#8217;s thoughts behind the creation of the logo</a>.</em></p>
<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2008/09/25/celebrating-nasas-50th-with-nasa-images/">Celebrating NASA&#8217;s 50th with NASA Images</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.spellboundblog.com/2008/09/25/celebrating-nasas-50th-with-nasa-images/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>SAA2008: Revealing Archival Collections at the Web&#8217;s Surface (Session 102)</title>
		<link>http://www.spellboundblog.com/2008/09/02/saa2008-revealing-archival-collections-at-the-webs-surface-session-102/</link>
		<comments>http://www.spellboundblog.com/2008/09/02/saa2008-revealing-archival-collections-at-the-webs-surface-session-102/#comments</comments>
		<pubDate>Wed, 03 Sep 2008 01:40:14 +0000</pubDate>
		<dc:creator>Jeanne</dc:creator>
				<category><![CDATA[access]]></category>
		<category><![CDATA[context]]></category>
		<category><![CDATA[digitization]]></category>
		<category><![CDATA[learning technology]]></category>
		<category><![CDATA[metadata]]></category>
		<category><![CDATA[outreach]]></category>
		<category><![CDATA[photography]]></category>
		<category><![CDATA[SAA2008]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[teacher resources]]></category>
		<category><![CDATA[virtual collaboration]]></category>
		<category><![CDATA[web 2.0]]></category>

		<guid isPermaLink="false">http://www.spellboundblog.com/2008/09/02/saa2008-revealing-archival-collections-at-the-webs-surface-session-102/</guid>
		<description><![CDATA[The official title of Session 102 was We&#8217;re Not the Destination, We&#8217;re the Journey: Revealing Archival Collections at the Web&#8217;s Surface. If you attended this session or don&#8217;t want to read through the details, you can skip to the end and just read my thoughts on this session. California Digital Library The first presentation was [...]<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2008/09/02/saa2008-revealing-archival-collections-at-the-webs-surface-session-102/">SAA2008: Revealing Archival Collections at the Web&#8217;s Surface (Session 102)</a></p>
]]></description>
			<content:encoded><![CDATA[<p><a href="http://content.cdlib.org/ark:/13030/kt1w10180b/?brand=calisphere" title="Calisphere: Vigilance Commitee Warning"></a></p>
<p style="text-align: center"><a href="http://content.cdlib.org/ark:/13030/kt1w10180b/?brand=calisphere" title="Calisphere: Vigilance Commitee Warning"><img src="http://www.spellboundblog.com/wp-content/uploads/2008/09/vigilance-committee.jpg" alt="Calisphere: Vigilance Committee Warning, April 1906" width="344" height="265" /></a></p>
<p>The official title of Session 102 was <a href="http://www.ibiblio.org/saawiki/2008/index.php/Session:_102:_%E2%80%9CWe%E2%80%99re_Not_the_Destination%2C_We%E2%80%99re_the_Journey%E2%80%9D:_Revealing_Archival_Collections_at_the_Web%E2%80%99s_Surface" title="Session 102: We're Not the Destination, We're the Journey">We&#8217;re Not the Destination, We&#8217;re the Journey: Revealing Archival Collections at the Web&#8217;s Surface.</a> If you attended this session or don&#8217;t want to read through the details, you can skip to the end and just read <a href="http://www.spellboundblog.com/2008/09/02/saa2008-revealing-archival-collections-at-the-webs-surface-session-102/#mythoughts" title="My thoughts on session 102">my thoughts on this session</a>.</p>
<p><strong>California Digital Library</strong></p>
<p>The first presentation was by <a href="http://www.universityofcalifornia.edu/cgi-bin/ph_mod.pl?Qname=Lena+Zentall" title="Lena Zentall">Lena Zentall</a> of the <a href="http://www.cdlib.org/" title="California Digital Library">California Digital Library</a> (CDL). I believe it was titled something like &#8220;Untitled &lt;snappy name here&gt;&#8221;. CDL is increasing visibility of primary sources by targeting primary sources to specific audiences. Lena described how they view the URL as a line to reel in new audiences. She started with an overview of how archival content traditionally makes its way online.</p>
<p>Start with a box -&gt; described by finding aids -&gt; digital copies of finding aids put on line and cherry picked individual items are digitized to be featured online.</p>
<p><strong>Two Audiences, Two Sites</strong></p>
<p>CDL has taken a new approach. They have two sites for two very different audiences:</p>
<ul>
<li><a href="http://www.oac.cdlib.org/">Online Archive of California</a> (OAC): presents both finding aids and digitized primary sources and targets archivists, historians &amp; researchers</li>
<li><a href="http://www.calisphere.universityofcalifornia.edu/">Calisphere</a> &#8211; only takes primary sources (for now) and targets k-12 teachers, lifelong learners, and undergraduates</li>
</ul>
<p>Collections can have home in several places. For example, the items about the Chinese in California can be found in:</p>
<ul>
<li>OAC: <a href="http://content.cdlib.org/view?docId=kt5p3019m2&amp;doc.view=items" title="OAC: Chinese in California">Guide to the Chinese in California Virtual Collection</a></li>
<li>Calisphere: As a subset of the <a href="http://www.calisphere.universityofcalifornia.edu/calcultures/ethnic_groups/ethnic2.html" title="Calisphere: California Cultures - Asian Americans">California Cultures: Asian Americans</a> collection, including the <a href="http://www.calisphere.universityofcalifornia.edu/themed_collections/subtopic2a.html" title="Calisphere: Chinese Exclusion Act">Chinese Exclusion Act</a></li>
<li>Library of Congress American Memory: <a href="http://lcweb2.loc.gov/ammem/award99/cubhtml/" title="Library of Congress American Memory: The Chinese in California">The Chinese in California, 1850-1925</a></li>
</ul>
<p>Calisphere has created themed collections to highlight superstar digital objects. They pull images out of the finding aids and rearrange them for the target audience. These images are hand picked and associated with an essay. They pick striking objects with good metadata. This is what their audience wants &#8211; the teachers asked for it. Another example themed collection is the <a href="http://www.calisphere.universityofcalifornia.edu/themed_collections/subtopic1a.html" title="Calisphere: Goldrush - Murder &amp; Mayhem">Goldrush Murder &amp; Mayhem</a> collection which includes this photo of the &#8220;old time San Francisco pickpocket&#8221; <a href="http://content.cdlib.org/ark:/13030/tf22900704/" title="Calisphere: Jennie Hastings">Jennie Hastings</a>.</p>
<p><strong>Hidden Gems: Untitled and No Metadata</strong></p>
<p>The next part of the presentation discussed what happens to items that are untitled and associated with no metadata. Lena showed us the results when you <a href="http://content.cdlib.org/search?style=oac-img&amp;facet-type-tab-join=or&amp;facet-type-tab=image+cartographic+mixed&amp;fieldList=text+keywords+title+description&amp;keyword=untitled" title="OAC: search for untitled">searched the OAC images for for untitled.</a> I found 12,315 items when I did this search. They really only live in the context of the finding aid. Of course the challenge is that people use words to find images. These hidden gems can be helped by inheriting the metadata of their parent container (such as collection level information) when there is nothing else.</p>
<p><strong>3 Approaches</strong></p>
<ol>
<li><span style="text-decoration: underline">Digitize and release content to the web:</span> low effort (after infrastructure is set up), very high return on investment. Over 40% of Calisphere traffic generated by google searches&#8230; but when users follow the link from google then they find the rich context.</li>
<li>     <span style="text-decoration: underline">Align with other aggregators:</span> &#8211; low/medium effort, medium return. Calipshere content is also being pulled into aggregators. They can also pull back new data that is added by 3rd party partners &#8211; such as reading level added on a teacher site. These are three examples of Murder and Mayhem content in three different partner sites:
<ul>
<li><a href="http://www.clrn.org/weblinks/details.cfm?id=1645" title="CLRN: Murder &amp; Mayhem">CLRN: Murder and Mayhem</a> &#8211; (California Learning Resources Network)</li>
<li><a href="http://www.k12hsn.org/edzone/media/gallery.php/images/18481865-gold-rush-era-murder-and-mayhem-3" title="EdZone: Murder &amp; Mayhem">EdZone: Murder &amp; Mayhem</a></li>
<li><a href="http://www.oercommons.org/libraries/murder-and-mayhem" title="OER: Murder and Mayhem">OER: Murder and Mayhem</a> (Open Educational Resources) lets users add tags and search by keywords</li>
</ul>
</li>
<li><span style="text-decoration: underline">Cherry-picking the best items:</span> high effort, promising returns &#8211; but it is also harder to measure the returns</li>
</ol>
<p><strong>Finding New Audiences and New Volunteers</strong></p>
<p>The next step is to reach beyond standard cultural and education venues and move into different ares of the internet. For example, the CDL added links to <a href="http://www.wikipedia.org/" title="Wikipedia">Wikipedia</a>. The perception of those involved with this effort was that it was a very convoluted process with lots of mysterious rules. They were unsure if the links would remain in place. It sometimes seemed like a lot of work when the links might just be removed. They added 33 links and found 53 links made by others not affiliated with the CDL. On the plus side, links like this puts the digital objects in a very specific context. Traffic initiated from these Wikipedia entries is almost certainly individuals seeking detailed information in the specific topic they are researching.</p>
<p>The next frontier involves blogs. CDL digital items are now featured in blogs, but soon CDL will be creating a blog for Calisphere to tell the story behind individual pictures. The final stop for this talk was an inspirational blog: <a href="http://mustachesofthenineteenthcentury.blogspot.com/" title="Mustaches of the 19th Century">Mustaches of the Nineteenth Century</a>. This blog was presented as a way to achieve the fame that primary sources dream about.</p>
<p><strong>Library of Congress</strong></p>
<p><strong><span style="font-weight: normal">The second presentation, by Helena Zinkham from the L<a href="http://www.loc.gov/rr/print/" title="LOC Prints and Photographs Division">ibrary of Congress Prints and Photographs division</a>, was titled &#8220;The New Friends for Old Photos &#8211; putting pictures in your path with the Flickr commons and Web 2.0&#8243;. This talk focused on the pilot project of putting Library of Congress photos on Flickr in the new <a href="http://www.flickr.com/commons" title="The Flickr Commons">Flickr Commons</a>.</span></strong></p>
<p>People who want photos don&#8217;t think of libraries or archives. They go to museums and stock photo agencies. Helena wants to help people realize that archives are a great source of images.</p>
<p>There has been increasing progress with hidden collections. Lots of digitization and work with metadata has been done to help items make their way online. But this begs the question of whether we are just creating new hidden collections in corners of the Internet that the average person will never come in contact with. Collections like <a href="http://archivegrid.org/" title="ArchiveGrid">ArchiveGrid</a>, <a href="http://www.diglib.org/aquifer/" title="DLF Aquifer">DLF Aquifer</a>, and <a href="http://www.oac.cdlib.org/" title="OAC">OAC</a>. The descriptions need to get out of the catalogs &#8211; most people find content on the web.. we need to put the images on the web in the path of the users.</p>
<p>The Flickr commons satisfied Helena&#8217;s desire to pull people in from Flickr back to discover the catalog world of archives. Flickr can be considered a virtual reading room and platform for a virtual volunteer corp. Helena showed the example of the image <a href="http://www.flickr.com/photos/library_of_congress/2163450764/" title="LOC: Weavers at Work">Weavers at Work</a>. The comments on this photo included:</p>
<ul>
<li>information that photo is of blind women weaving rugs</li>
<li>the photographer&#8217;s great grandchild identified the photographer as Percy Byron</li>
<li>the start of a discussion about what the cabinet or instrument might be shown to the far right of the photo</li>
</ul>
<p>These commenters are new friends worth making!</p>
<p><strong>Pros of Web 2.0</strong></p>
<ul>
<li>make collection available</li>
<li>gain information about collections &#8211; participatory description</li>
<li>increase the visibility of specific photos</li>
<li>win support for cultural heritage organizations</li>
</ul>
<p><strong>Risks of Web 2.0</strong></p>
<ul>
<li>disrespect for collections (smart aleck chat)</li>
<li>loss of meaning</li>
<li>reduce revenue from photo sales</li>
<li>excludes undigitized collections</li>
<li>higher costs (more money and time)</li>
<li>less chance for us to have fun as history detectives &#8211; other people are doing &#8216;our&#8217; work</li>
</ul>
<p>read powerhouse museums&#8217; 3 month report about their experience. &#8230; Helena will post info about the nuts and bolts on the SAA site, but she also directed the audience to <a href="http://www.powerhousemuseum.com/dmsblog/index.php/2008/07/21/commons-on-flickr-a-report-some-concepts-and-an-faq-the-first-3-months-from-the-powerhouse-museum/" title="Powerhouse Museum: Commons on Flicker - a report">Powerhouse Museum&#8217;s Commons on Flickr First 3 Months Report.</a></p>
<p><strong>Flickr Basics</strong></p>
<p>Helena asked the session attendees who was familiar with flicker? Most of the room raised their hands. Who has accounts? Still good number. Who is adding archival content? A sprinkling of hands were raised.</p>
<p>Helena then explores Flickr basics and showed off the following neat search examples:</p>
<ul>
<li>A search for <a href="http://www.google.com/search?q=germany+schaefer" title="Google Search: germany schaefer">germany schaefer in Google</a> finds Flickr photos (as well as Flickr photo comments). The <a href="http://www.flickr.com/photos/library_of_congress/2162646403/" title="LOC: Flickr Germany Schaefer Photo">LOC Germany Schaefer photo</a> was returned 4th on my list when I did the search when writing up this post.</li>
<li>A search for <a href="http://flickr.com/search/?q=houston+house" title="Flickr Search: Houston House">houston house</a> search within Flickr co-mingles old and current photos</li>
</ul>
<p><strong>Logistics and Statistics</strong></p>
<p>The LOC liked Flickr and felt it was a good fit because photographs are the main focus of the site. They did need one big change. Because LOC is not the owner or photographer (unlike most photo contributors), they needed a way to express that clearly. Flickr responded by creating <a href="http://flickr.com/commons" title="Flickr: The Commons">The Commons</a>. They also created a new rights statement of &#8216;no known copyright restrictions&#8217; for members of The Commons to use. This is different from public domain. Flickr also appears (based on my hunt through the links) to permit each institutions in The Commons to link to their own explanation about what they mean by &#8216;no known copyright restrictions&#8217;. LOC deep links to a specific section of their <a href="http://www.loc.gov/rr/print/195_copr.html#noknown" title="LOC: Assessing the risk of using a P&amp;P image">Copyright and Other Restrictions page for Prints &amp; Photographs</a>. George Eastman House has a special <a href="http://www.eastmanhouse.org/flickrstatement.php" title="George Eastman House &amp; The Commons on Flickr">George Eastman House &amp; The Commons on Flickr</a> page about copyright, as does the <a href="http://www.brooklynmuseum.org/flickr_commons.php" title="About Brooklyn Museum and The Commons on Flickr">Brooklyn Museum</a>.</p>
<p>Statistics from the first 6 moths on Flickr:</p>
<ul>
<li>3,500 LOC photos posted</li>
<li>8 million views</li>
<li>30,000 favorites for 80% of the photos</li>
<li>14,000 Flickr members made LOC a contact</li>
<li>5,000 comments (3,300 people)</li>
<li>12,500 unique tags (59,000 total)</li>
<li>500 catalog records updated &#8211; Helena indicated that this could be considered a new kind of backlog, &#8220;but a backlog you can come to like&#8221;</li>
<li>20% increased traffic to p&amp;p online catalog</li>
</ul>
<p>There are 30,000 more photos from <a href="http://www.loc.gov/rr/print/coll/022.html" title="George Grantham Bain Collection">Bain News Service</a> on the way, but they are only adding fifty photos a week. This number was recommended by Flickr as the largest they would want to push at any one time. This goes back to the tolerance of people who have Flickr in their friend photo stream. Fifty photos is about as many as people want to get at any one time. More than that and you increase the likelihood that people would remove you from their stream instead of be overwhelmed. They would have no chance to really look at more than that.</p>
<p>Contributors to The Commons can choose which features to enable. For example, the <a href="http://flickr.com/photos/george_eastman_house/2720795144/" title="Flickr: Portrait of Hine">Portrait of Hine as small child standing by drum</a> shows how george eastman house chooses to send people back to their institution for prints.</p>
<p>How much does it cost?</p>
<ul>
<li>a Flickr pro account costs $24.95 a year</li>
<li>digitization costs</li>
<li>time: daily moderation on the account &#8211; LOC checks every day for uncivil discourse which takes about 10 minutes</li>
<li>15-20 hours a week to pull data from comments to update metadata</li>
</ul>
<p><strong>Flickr Comments</strong></p>
<p>One of the greatest parts of this presentation was the examination of ways in which flicker users contributed through comments. Here are some examples:</p>
<ul>
<li><a href="http://flickr.com/photos/library_of_congress/2477625961/" title="Flickr Commons: Auto Polo (LOC)">Auto Polo</a>: &#8211; comment includes link to an a<a href="http://www.jalopyjournal.com/forum/showthread.php?t=245777" title="Jalopy Journal: Auto Polo">uto polo thread on the Jalopy Journal&#8217;s message board</a> which includes newspaper images and an extended discussion.</li>
<li><a href="http://flickr.com/photos/library_of_congress/2178249475/" title="Flickr Commons: Sylvia Sweets Tea Room (LOC)">Sylvia Sweets Tea Room</a> &#8211; includes a very extensive history of the business added by the daughter of the original proprietor</li>
<li><a href="http://flickr.com/photos/library_of_congress/2179172498/" title="Flickr Commons: Negro boy near Cincinnati, Ohio (LOC)">Negro boy near Cincinnati, Ohio</a> &#8211; the comments include a deep conversation about the title of the photo and the context of this title at the time it was taken (1942 or 1943).</li>
<li><a href="http://flickr.com/photos/library_of_congress/2163452890/" title="Flickr Commons: Jones Barn where dynamite was found (LOC)">Jones Barn where dynamite was found</a> &#8211; Flickr members found the context and news article to go with this photo</li>
<li><a href="http://flickr.com/photos/library_of_congress/2459827892/" title="Flickr Commons: Al Palzer (LOC)">Al Palzer</a> &#8211; this photo&#8217;s original title was Al Palser &#8211; but the misspelling was pointed out in the comments. The comments also include a response from the LOC noting that the boxer&#8217;s name would be updated in the original catalog record.</li>
</ul>
<p><strong>Other Promotion Approaches</strong></p>
<p>The Library of Congress has now started linking out from the LOC catalog entries to the Flickr image so that it is easy for users to discover any conversations associated with the Flickr version. <a href="http://flickr.com/photos/powerhouse_museum/" title="Flickr Commons: Powerhouse Museum">Powerhouse museum</a> has a <a href="http://www.powerhousemuseum.com/imageservices/" title="Powerhouse Museum Photo of the Day">Photo of the Day blog</a> to highlight images from their collection. The <a href="http://www.brooklynmuseum.org/" title="Brooklyn Museum">Brooklyn Museum</a> encourages people to upload photos of things happening in Brooklyn. Then and now photos can be taken &#8211; in this case see <a href="http://flickr.com/photos/library_of_congress/2179041336/" title="Flickr Commons: Factory buildings in Lowell, MA 1940/41">factory buildings in Lowell, Massachusetts in December 1940/January 1941</a> and then again in <a href="http://www.flickr.com/photos/pcarlson/2209255147/" title="Flickr: factory buildings in Lowell, MA January 2008">January of 2008</a>.</p>
<p>The key to 2.0 is frequent, new content and interaction from archival staff. Helena is open to new ideas about how to use Flickr and closed with saying that Web 2.0 is right in our path.</p>
<p><strong>Questions and Answers</strong></p>
<p><strong>Question:</strong> What is their view of the accuracy/inaccuracy user generated tags and comments?</p>
<p><strong>Answer:</strong> Study done in the past comparing accuracy of official cataloging to comments &#8211; even if people make mistakes, but others will correct them.. LOC has a &#8216;hands off&#8217; policy to not delete/change stuff unless it is defamatory or spam. Only 3 instances of this so far. LOC is citing the source as &#8216;Flickr commons&#8217; and also include commenters&#8217; sources &#8211; which are actually a lot more varied than you might expect (like the Jalopy Journal).</p>
<p><strong>Question:</strong> Are you worried about an increase demand in staff time as you add more photos?</p>
<p><strong>Answer:</strong> Yes.. there will be an increase in demand.. but the Flickr comments are there and since LOC is adding links back out to those records they are available for researchers even if they are not added to the original catalog record. Maybe they need more staff? depends on goals. Could work with expert teams and look for &#8216;formal trusted&#8217; volunteers. A great example was the baseball history association who took photos and contributed expert information in a spreadsheet (if I heard correctly they gave LOC a spreadsheet identifying team, game, date and opponent for more than 3000 photos).</p>
<p><strong>Question:</strong> Isn&#8217;t the link from the LOC catalog record to Flickr enough? Why update the LOC catalog records at all?</p>
<p><strong>Answer:</strong> They are really only updating when it is a mistake (like Palser&#8217;s name mistake). Flickr also provides APIs and LOC pulls all the comments and tags into external database so that LOC can choose how to use the information over time.</p>
<p><strong>Question:</strong> What are your thoughts and concerns about the longevity of Flickr as a platform?</p>
<p><strong>Answer:</strong> What grows fast can die fast. Their perspective: Flickr is a copy.. and LOC has an extract of all the tags and comments &#8211; nothing lost if it disappears.</p>
<p><strong>Question:</strong> Calipshere: how do they work with teachers to learn their needs and their satisfaction with the work that is done?</p>
<p><strong>Answer:</strong> They hired Berkeley experts to talk to teachers about what they wanted. They used interviews and created personas to capture the audience needs. Targeting the K-12 audience was aimed at being a success by being clear about their audience. Teachers used to print out images, but now they do more with powerpoint and iPods plugged into TV in the classroom. The teachers say they are happy with the theme collections and they want more. They have an advisory board with teachers.. they use surveys and watch the bboards.</p>
<p><strong>Question:</strong> Is there a crossover between Calishpere and OAC users?</p>
<p><strong>Answer:</strong> They almost didn&#8217;t cross link to the finding aids from within Calisphere.. but they decided the information was so important. Reason for the upcoming blog &#8211; want to tell the story behind the photos.</p>
<p><strong>Question:</strong> Do they have anlytics/evidence of pulling people back to their sites?</p>
<p><strong>Answer:</strong> Yes.. they can see increases in usage from everything they have done.</p>
<p><strong>Question:</strong> When you download the comments &#8211; are they dated so you can only look at the new ones? How hard was it to change the title in your catalog?</p>
<p><strong>Answer:</strong> Everything is time/date stamped when you pull info out of Flickr. Quick and easy to update.. 10 minutes per picture to do the updates.. Flickr members are doing a great job with citations.</p>
<p><strong>Question:</strong> Do you have advice about how to get historical society folks who are concerned about loosing the admission fee for people coming in to do research on board with these web 2.0 approaches?</p>
<p><strong>Answer:</strong> You show them alternative revenue streams. In the museum world .. they realized that they weren&#8217;t making money from reproductions and a change is in process to let people use images for publishing.. all about improving the brand recognition. Helena: I would love ideas from people using Flickr.. and to hear from people who are dealing with multiple audiences.</p>
<p><strong>Question:</strong> Have you had complaints? Any specifically from copyright holders?</p>
<p><strong>Answer:</strong> Yes.. they have had complaints.. one &#8220;Why haven&#8217;t you cleaned up the photos?&#8221; LOC position is to provide the version they have.. and it is up to others to cleanup and do what they like with the photos. They also point out that instead of perfecting photos, they are spending money on providing access to more photos.</p>
<p><strong>Question:</strong> Expectations of service. Are people expecting that if they ask a question about a photo that they will get an answer from a LOC representative?</p>
<p><strong>Answer:</strong> Do you have to respond to everyone who asked to be a contact? No.. perhaps different expectations for institutions. They currently add a comment when they are updating the original catalog records. Might acknoledge big contributors (more than 10 photos) at the end of the pilot via a direct e-mail to individuals.</p>
<p><strong>Question:</strong> Have people complained about rights &#8211; that is my grandmother.. don&#8217;t put it on the web?</p>
<p><strong>Answer:</strong> No. They do have a policy in place. Most people are &#8216;pleased as punch&#8217; to learn that their family heritage is alive and well. OAC: They haven&#8217;t had anyone ask to take the content down. In the case that people provide feedback for updates &#8211; since OAC is an aggregation of items from so many institutions &#8211; they have to pass corrections info along to original keeper of the metadata and leave it in their hands to do updates.</p>
<p><strong>Question:</strong> Is there a fear that interest will decrease as more photos are added to the commons?</p>
<p><strong>Answer:</strong> Bloggers in the web were in love with the idea that the photos would go into Flickr. There was a big peak at the start &#8211; but views and comments are still steady (but smaller) . The more additions.. more communities that will be touched. The Powerhouse Museum experienced a tripling of their traffic after posting images in the Flickr Commons.</p>
<p><strong>Question:</strong> Have people come into the reading room because of the Flickr pilot?</p>
<p><strong>Answer:</strong> Maybe? We don&#8217;t know. Lena said she did!</p>
<p><strong>Question:</strong> Are we teaching the teachers how to teach with photos?</p>
<p><strong>Answer:</strong> Calisphere has provided links to info about using primary sources and analysis tools.. resources for teachers. (Follow-up: Are they clicking those links? Good question!)</p>
<p><strong>Question:</strong> Are you contacting the people who post negative comments?</p>
<p><strong>Answer:</strong> Yes.. and most of them were more spam.</p>
<h2 id="mythoughts">My Thoughts</h2>
<p><strong><em>Culture of Online Communities</em></strong></p>
<p>There are a few different ideas I wanted to share related to the material from this presentation. First, I noticed that the online culture of both Flickr and Wikipedia were called out as having a clear impact. They are in fact two very different communities. In the case of the LOC and Flickr we heard that part of what seemed to keep the comments constructive and friendly was that Flickr&#8217;s users strive to keep a &#8216;play nice&#8217; atmosphere in place. In contrast, we heard that Wikipedia was perceived as confusing and unpredictable when the CDL staff was updating pages to add links back to their primary sources. They never felt certain that the links they were working so hard to add wouldn&#8217;t be removed the next day.</p>
<p>These are just two examples of ways in which the archival community is beginning to bump into various online communities. We need to really understand the cultural rules for each of the communities in which we want to participate. Another excellent example of this was the revelation that LOC should only upload 50 new images a week into Flickr because of the way in which users view new images uploaded by their friends. It would be unfortunate for LOC to loose many of its Flickr friends because it overwhelmed their Flickr feeds with 1,000 images.</p>
<p><strong><em>Personas: Targeting Real People</em></strong></p>
<p>I was also very pleased to hear Lena discuss the creation of personas to define and target the audiences they want to serve. If you want to listen to a great presentation on personas &#8211; give a listen to the <a href="http://www.iasummit.org/2008/" title="IA Summit 2008">IA Summit 2008</a> presentation <a href="http://boxesandarrows.com/view/ia-summit-2008-day-3" title="IA Summit 2008: Data driven design research personas">Data driven design research personas</a> (2nd podcast down on the page) while going though the <a href="http://www.slideshare.net/toddwarfel/data-driven-personas-summit-08" title="IA Summit 2008: Slides for Data driven design research personas">presentation slides up on slideshare</a>. I promise it is a very accessible talk (ie, low on jargon and tech &#8211; high on real life examples) and very worth your time. It was one of the best sessions I saw at that conference.</p>
<p><strong><em>Finding Images Without Words</em></strong></p>
<p>While today it <em>is</em> generally true that people must use words to find images &#8211; someday people will be able to use images to find images. An example of this work in progress is an experimental service named <a href="http://labs.systemone.at/retrievr/about" title="retrievr - search by sketch, search by image">retrievr</a>. You can already use this tool to search for Flickr images either by uploading an image or by creating a sketch you want to match. Another <a href="http://www.xcavator.net/Photo-Search#" title="Xcavator.net: Search Stock Photography">interesting image search interface is found over on Xcavator.net</a>. You pick a photo as your starting point &#8211; and then you can even trace a subsection of the image to be used for subsequent image matching. We are not there yet &#8211; but we will be someday. I can only image the number of Untitled images that will finally be found!</p>
<p><strong><em>Vigilance</em></strong></p>
<p>Your reward for reading this far is discovering my rationale for using the image I included at the top of this post. I think that many people are worried that we must be like the San Jose Vigilance Committee of 1906 &#8211; on our guard to stop people from stealing images from cultural heritage institutions when they are posted online. I would argue that the two projects described in this session show the benefits of a more open attitude. The Internet isn&#8217;t the wild west anymore. We should stop treating it that way. We don&#8217;t need Vigilance Committees online &#8211; we need ambassadors, interpreters and brave pioneers like Lena, Helena and the amazing teams of people who made the projects they described come to life.</p>
<p><em>Image credit: History San Jose Research Library via <a href="http://content.cdlib.org/ark:/13030/kt1w10180b/?brand=calisphere" title="Calisphere: Vigilance Commitee Warning">Calisphere</a>.</em></p>
<p><em>As is the case with all my session summaries from SAA2008, please accept my apologies in advance for any cases in which I misquote, overly simplify or miss points altogether in the post above. These sessions move fast and my main goal is to capture the core of the ideas presented and exchanged. Feel free to contact me about corrections to my summary either via comments on this post or via my <a href="http://www.spellboundblog.com/contact/" title="Contact Jeanne">contact form</a>.</em></p>
<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2008/09/02/saa2008-revealing-archival-collections-at-the-webs-surface-session-102/">SAA2008: Revealing Archival Collections at the Web&#8217;s Surface (Session 102)</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.spellboundblog.com/2008/09/02/saa2008-revealing-archival-collections-at-the-webs-surface-session-102/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Freebase Parallax Search Interface: Exploring Olympic Games Facts</title>
		<link>http://www.spellboundblog.com/2008/08/16/freebase-parallax-search-olympic-games-facts/</link>
		<comments>http://www.spellboundblog.com/2008/08/16/freebase-parallax-search-olympic-games-facts/#comments</comments>
		<pubDate>Sat, 16 Aug 2008 05:05:50 +0000</pubDate>
		<dc:creator>Jeanne</dc:creator>
				<category><![CDATA[access]]></category>
		<category><![CDATA[information visualization]]></category>
		<category><![CDATA[interface design]]></category>
		<category><![CDATA[metadata]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[software]]></category>

		<guid isPermaLink="false">http://www.spellboundblog.com/2008/08/16/freebase-parallax-search-olympic-games-facts/</guid>
		<description><![CDATA[Well-formed data posted about a new Freebase project named Parallax. This new search interface takes faceted browsing another step &#8211; in this case making it easy to jump sideways from one dataset to another related dataset. Parallax still includes filters on the left side &#8211; but the twist comes from the opportunity to select what [...]<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2008/08/16/freebase-parallax-search-olympic-games-facts/">Freebase Parallax Search Interface: Exploring Olympic Games Facts</a></p>
]]></description>
			<content:encoded><![CDATA[<p><a href="http://well-formed-data.net/archives/153/parallax" title="Well-formed data post">Well-formed data posted</a> about a new <a href="http://www.freebase.com/" title="Freebase">Freebase</a> project named <a href="http://mqlx.com/~david/parallax/index.html" title="Parallax">Parallax</a>. This new search interface takes <a href="http://en.wikipedia.org/wiki/Faceted_browser" title="Wikipedia: Faceted Browser">faceted browsing</a> another step &#8211; in this case making it easy to jump sideways from one dataset to another related dataset. Parallax still includes filters on the left side &#8211; but the twist comes from the opportunity to select what are called &#8216;Connections&#8217; from the list in the upper right hand corner of the search results page.</p>
<p>This sort of thing makes the most sense when you can see examples. The creator of Parallax has published a great little <a href="http://www.vimeo.com/1513562" title="Vimeo: Freebase Parallax">video tour</a>, but I also wanted to show you some neat data sets that were very easy to discover and embed in my blog. Since so many people are thinking about the Olympics right now, I thought I would start by exploring the <a href="http://mqlx.com/~david/parallax/browse.html?state=!((d:(t:/user/jg/default_domain/olympic_games),s:(v:!((c:ThumbnailView,s:())),vi:0)))" title="Parallax: Olympic Games Collection">Olympic Games Collection</a> from Freebase. Below I have two data sets. On the left you will see a list of Olympic Games &#8211; and on the right you will see a list of Olympic event venues. <em>(NOTE: to those reading this through a feed reader &#8211; you will likely have to click through to view the lists)</em></p>
<p><iframe src="http://mqlx.com/~david/parallax/thumbnail-view-embed-thumbnails.html?thumbsize=80&amp;query=%7B%22id%22%3Anull%2C%22limit%22%3A100%2C%22type%22%3A%22%2Fuser%2Fjg%2Fdefault_domain%2Folympic_games%22%7D" width="48%" height="300"></iframe><iframe src="http://mqlx.com/~david/parallax/thumbnail-view-embed-thumbnails.html?thumbsize=80&amp;query=%7B%22id%22%3Anull%2C%22limit%22%3A100%2C%22!%2Folympics%2Folympic_games%2Fvenues%22%3A%5B%7B%22type%22%3A%22%2Fuser%2Fjg%2Fdefault_domain%2Folympic_games%22%2C%22id%22%3Anull%2C%22name%22%3Anull%7D%5D%7D" width="48%" height="300"></iframe></p>
<p>Now lets take a real sidestep and pull up a list of sports teams who use a former Olympic facility as a venue. This is the sort of question that you could figure out on your own, but it would be a pain in the neck to do by hand. See the list on the left below which took just as long to create as it took me to spot that Team (venue) was on the list of &#8216;more connections&#8217; when my list of Olympic Venues was being displayed. The frame on the right below displays the one Olympic Venue that Freebase knows to have won an award (in this case the <a href="http://www.freebase.com/view/guid/9202a8c04000641f80000000086e5a54">Structural Special Award</a>).</p>
<p><iframe src="http://mqlx.com/~david/parallax/thumbnail-view-embed-tiles.html?thumbsize=50&amp;query=%7B%22id%22%3Anull%2C%22limit%22%3A100%2C%22!%2Fsports%2Fsports_facility%2Fteams%22%3A%5B%7B%22!%2Folympics%2Folympic_games%2Fvenues%22%3A%5B%7B%22type%22%3A%22%2Fuser%2Fjg%2Fdefault_domain%2Folympic_games%22%7D%5D%2C%22id%22%3Anull%2C%22name%22%3Anull%7D%5D%7D" width="48%" height="300"></iframe><iframe src="http://mqlx.com/~david/parallax/thumbnail-view-embed-thumbnails.html?thumbsize=80&amp;query=%7B%22id%22%3Anull%2C%22limit%22%3A100%2C%22!%2Faward%2Faward_honor%2Fhonored_for%22%3A%5B%7B%22!%2Faward%2Faward_winning_work%2Fawards_won%22%3A%5B%7B%22!%2Folympics%2Folympic_games%2Fvenues%22%3A%5B%7B%22type%22%3A%22%2Fuser%2Fjg%2Fdefault_domain%2Folympic_games%22%7D%5D%2C%22id%22%3Anull%2C%22name%22%3Anull%7D%5D%7D%5D%7D" width="48%" height="300"></iframe></p>
<p>Of course the lists above are only as good as the data behind them, but you can see how interesting it could be to use Parallax to explore connected information. Now take this idea to the world of archives and libraries, <a href="http://en.wikipedia.org/wiki/OPAC" title="Wikipedia: OPAC">OPACs</a> and finding aids and imagine the sorts of questions you can start asking. Yes &#8211; it does depend on the data being connected, but that is happening more and more all the time. The promise of the semantic web is structured data everywhere we turn.</p>
<p>Go play with Parallax. Look at <a href="http://mqlx.com/~david/parallax/browse.html?state=!((d:(t:/venture_capital/venture_funded_company),s:(v:!((c:ThumbnailView,s:())),vi:0)))" title="Parallax: Venture Funded Companies">Venture Funded Companies</a> and then look at all the <a href="http://mqlx.com/~david/parallax/browse.html?state=!((d:(t:/venture_capital/venture_funded_company),s:(v:!((c:ThumbnailView,s:())),vi:0)),(d:(l:'Games%20Published',p:!((f:!t,p:/cvg/cvg_publisher/games_published))),s:(v:!((c:ThumbnailView,s:())),vi:0)))" title="Parallax: Games Developed by Venture Funded Companies">Games Developed by those companies</a>. Examine the list of  <a href="http://mqlx.com/~david/parallax/browse.html?state=!((d:(t:/user/joshuamclark/default_domain/bird),s:(v:!((c:ThumbnailView,s:())),vi:0)))" title="Parallax: Bird Species">Bird Species</a> and then see what <a href="http://mqlx.com/~david/parallax/browse.html?state=!((d:(t:/user/joshuamclark/default_domain/bird),s:(v:!((c:ThumbnailView,s:())),vi:0)),(d:(l:School,p:!((f:!t,p:/education/school_mascot/school))),s:(v:!((c:ThumbnailView,s:())),vi:0)))" title="Parallax: Schools with Bird Mascots">schools have bird mascots</a>&#8230; and THEN see a <a href="http://mqlx.com/~david/parallax/browse.html?state=!((d:(t:/user/joshuamclark/default_domain/bird),s:(v:!((c:ThumbnailView,s:())),vi:0)),(d:(l:School,p:!((f:!t,p:/education/school_mascot/school))),s:(v:!((c:ThumbnailView,s:())),vi:0)),(d:(l:Person,p:!((f:!t,p:/business/employer/employees),(f:!t,p:/business/employment_tenure/person))),s:(v:!((c:ThumbnailView,s:())),vi:0)))" title="Parallax: People who Attended Schools with Bird Mascots">list of famous people who went to schools that have bird mascots</a>.</p>
<p>Put in your own search from the <a href="http://mqlx.com/~david/parallax/" title="Parallax">Parallax homepage</a> and play with the available connections.  Map and timeline views are also available &#8211; though they only work if your data includes location and temporal data, respectively. If you find a great sequence of data sets &#8211; please share them!</p>
<p>This post is from from: <a href="http://www.spellboundblog.com">Spellbound Blog</a>.<br/><br/><a href="http://www.spellboundblog.com/2008/08/16/freebase-parallax-search-olympic-games-facts/">Freebase Parallax Search Interface: Exploring Olympic Games Facts</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.spellboundblog.com/2008/08/16/freebase-parallax-search-olympic-games-facts/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
	</channel>
</rss>

