<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>Bitwacker Associates</title>
	<atom:link href="http://bitwacker.wordpress.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://bitwacker.wordpress.com</link>
	<description>Applied Web Science</description>
	<lastBuildDate>Fri, 27 Jan 2012 18:57:58 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='bitwacker.wordpress.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://1.gravatar.com/blavatar/1b5aadad4fd48e3eba0a568bb66f8e5f?s=96&#038;d=http%3A%2F%2Fs2.wp.com%2Fi%2Fbuttonw-com.png</url>
		<title>Bitwacker Associates</title>
		<link>http://bitwacker.wordpress.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://bitwacker.wordpress.com/osd.xml" title="Bitwacker Associates" />
	<atom:link rel='hub' href='http://bitwacker.wordpress.com/?pushpress=hub'/>
		<item>
		<title>Elsevier/Tetherless World Health &amp; Life Sciences Hackathon (27-28 June 2011)</title>
		<link>http://bitwacker.wordpress.com/2011/06/20/elseviertetherless-world-health-life-sciences-hackathon-27-28-june-2011/</link>
		<comments>http://bitwacker.wordpress.com/2011/06/20/elseviertetherless-world-health-life-sciences-hackathon-27-28-june-2011/#comments</comments>
		<pubDate>Mon, 20 Jun 2011 12:53:57 +0000</pubDate>
		<dc:creator>John Erickson</dc:creator>
				<category><![CDATA[computer science]]></category>
		<category><![CDATA[linked data]]></category>
		<category><![CDATA[elsevier]]></category>
		<category><![CDATA[hackathon]]></category>
		<category><![CDATA[health]]></category>
		<category><![CDATA[lice sciences]]></category>
		<category><![CDATA[RPI]]></category>
		<category><![CDATA[TWCHack11]]></category>
		<category><![CDATA[TWCRPI]]></category>

		<guid isPermaLink="false">http://bitwacker.wordpress.com/?p=393</guid>
		<description><![CDATA[Create Apps; Win Prizes! The Tetherless World Constellation at RPI is pleased to announce that TWC and the SciVerse team at Elsevier are planning a Health and Life Sciences-themed, 24-hour hackathon to be held 27-28 June 2011. The event is sponsored by Elsevier and held at Pat&#8217;s Barn, on the campus of the Rensselaer Technology Park. [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bitwacker.wordpress.com&amp;blog=11115446&amp;post=393&amp;subd=bitwacker&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p><strong><em>Create Apps; Win Prizes!</em></strong></p>
<p><a href="http://twcsciverse2011.eventbrite.com/"><img src="http://bitwacker.files.wordpress.com/2011/06/logo_blog.gif?w=500" alt="Elsevier/TWC Health &amp; Life Sciences Hackathon" title="Elsevier/TWC Health &amp; Life Sciences Hackathon"   class="alignleft size-full wp-image-394" /></a>The <a href="http://tw.rpi.edu">Tetherless World Constellation</a> at RPI is pleased to announce that TWC and the <a href="http://www.hub.sciverse.com/">SciVerse</a> team at Elsevier are planning a Health and Life Sciences-themed, <a title="TWC Elsevier Hackathon 2011" href="http://tw.rpi.edu/web/event/TWCElsevierHackathonJune2011" target="_blank">24-hour hackathon</a> to be held <strong>27-28 June 2011</strong>. The event is sponsored by Elsevier and held at <a href="http://www.rpitechpark.com/RensselaerTechnologyPark-PatsBarn.php">Pat&#8217;s Barn,</a> on the campus of the Rensselaer Technology Park.</p>
<p>After a short tutorial period by TWC RPI staff and distinguished guests, participants will compete with each other to develop Semantic Web mashups using <a href="http://linkeddata.org">linked data</a> from TWC and other sources, web APIs from Elsevier SciVerse, and visualization and other resources from around the Web. </p>
<p><strong><em>Prizes</em></strong><br />
The contest will encompass building apps utilizing the SciVerse API and other resources in multiple categories, including Health and Life Sciences and Open classes.  Overall, there will be three winners:</p>
<ul>
<li>First place:  $1500</li>
<li>Second place:  $1000</li>
<li>Third place:  $500</li>
</ul>
<p><strong><em>Judging</em></strong><br />
A distinguished panel of judges has assembled that includes domain experts, faculty and senior representatives from Elsevier:</p>
<ul>
<li>Paolo Ciccarese (Scientist and Senior Software Engineer, Mass General Hospital; Faculty, Harvard Medical School)
</li>
<li>Chris Baker (Research Chair, Innovatia)</li>
<li>Bob Powers (Semantics Engineer, Consultant at Predictive Medicine)
</li>
<li>M. Scott Marshall (Department of Medical Statistics and Bioinformatics, Leiden University Medical Center)</li>
<li>Ora Lassila (Principal Technologist, Nokia; co-author of the W3C RDF specification)</li>
<li>Elizabeth Brooks (Head of Computing &amp; IT, UHI, Scotland)</li>
<li>Hajo Oltmanns (Elsevier: SVP Health Sciences Strategy)</li>
<li>Scott Virkler (Elsevier: SVP e-Products Global Medical Research)</li>
<li>Helen Moran (Elsevier: VP Smart Content Strategy)</li>
</ul>
<p><strong><em>Refreshments</em></strong><br />
All attendees will be provided lunch, dinner, and midnight snack on 27 June and breakfast and lunch on 28 June.</p>
<p><strong><em>Travel Assistance</em></strong><br />
A small amount of <a href="http://tw.rpi.edu/web/event/TWCElsevierHackathonJune2011/Assistance">travel assistance</a> will be made available for students and non-profits on a <em>competitive basis</em>. Please see our <a href="http://tw.rpi.edu/web/event/TWCElsevierHackathonJune2011/Assistance">Travel Assistance</a> page or <a href="mailto:olyerickson@gmail.com">contact us</a> for further details.</p>
<p><strong><em>Travel and Lodging Information</em></strong><br />
See the <a title="&lt;a title=" href="http://tw.rpi.edu/web/event/TWCElsevierHackathonJune2011" target="_blank">Elsevier/Tetherless World Health and Life Sciences Hackathon web site</a> for specific information about <a href="http://tw.rpi.edu/web/event/TWCElsevierHackathonJune2011/Transportation">transportation</a> and <a href="http://tw.rpi.edu/web/event/TWCElsevierHackathonJune2011/Lodging">lodging</a> near the venue. <em>Please note that the Hackathon runs for <strong>24 hours,</strong> so it is unlikely that participants will want lodging on the night of 27 June&#8230;</em></p>
<p><strong><em>Contacts</em></strong><br />
Please browse to the <strong>Contacts</strong> area of the <a href="http://tw.rpi.edu/web/event/TWCElsevierHackathonJune2011/Contact" target="_blank">Elsevier/Tetherless World Health and Life Sciences Hackathon web site</a> or follow the EventBright <a href="http://www.eventbrite.com/contact-organizer?eid=1672248741">event organizer</a> link if you have questions. </p>
<p><strong><em>Follow us on Twitter!</em></strong><br />
The hash for this event is <a href="http://search.twitter.com/search?q=%23TWCHack11">#TWCHack11</a></p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/bitwacker.wordpress.com/393/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/bitwacker.wordpress.com/393/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/bitwacker.wordpress.com/393/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/bitwacker.wordpress.com/393/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/bitwacker.wordpress.com/393/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/bitwacker.wordpress.com/393/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/bitwacker.wordpress.com/393/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/bitwacker.wordpress.com/393/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/bitwacker.wordpress.com/393/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/bitwacker.wordpress.com/393/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/bitwacker.wordpress.com/393/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/bitwacker.wordpress.com/393/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/bitwacker.wordpress.com/393/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/bitwacker.wordpress.com/393/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bitwacker.wordpress.com&amp;blog=11115446&amp;post=393&amp;subd=bitwacker&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://bitwacker.wordpress.com/2011/06/20/elseviertetherless-world-health-life-sciences-hackathon-27-28-june-2011/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/e2ab144cff31bc669eebb5de34f7bfc9?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">olyerickson</media:title>
		</media:content>

		<media:content url="http://bitwacker.files.wordpress.com/2011/06/logo_blog.gif" medium="image">
			<media:title type="html">Elsevier/TWC Health &#38; Life Sciences Hackathon</media:title>
		</media:content>
	</item>
		<item>
		<title>Energizing Innovation Research through Linked Open Patent Data</title>
		<link>http://bitwacker.wordpress.com/2011/05/31/energizing-innovation-research-through-linked-open-patent-data/</link>
		<comments>http://bitwacker.wordpress.com/2011/05/31/energizing-innovation-research-through-linked-open-patent-data/#comments</comments>
		<pubDate>Wed, 01 Jun 2011 02:35:51 +0000</pubDate>
		<dc:creator>John Erickson</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://bitwacker.wordpress.com/?p=376</guid>
		<description><![CDATA[Please note this is a DRAFT and may change throughout the day (1 June 2011) On June 17 I will be joining other researchers at a Patent Data Workshop jointly hosted by the USPTO and NSF at the U.S. Patent &#38; Trademark Office in Alexandria, VA. This workshop, supported by the USPTO Office of Chief [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bitwacker.wordpress.com&amp;blog=11115446&amp;post=376&amp;subd=bitwacker&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p><b><i>Please note this is a DRAFT and may change throughout the day (1 June 2011)</i></b></p>
<p><a href="http://bitwacker.files.wordpress.com/2011/05/uspto_logo.jpg"><img src="http://bitwacker.files.wordpress.com/2011/05/uspto_logo.jpg?w=150&#038;h=150" alt="" title="uspto_logo" width="150" height="150" class="alignleft size-thumbnail wp-image-387" /></a>On June 17 I will be joining other researchers at a <b>Patent Data Workshop</b> jointly hosted by the <a href="http://www.uspto.gov/">USPTO</a> and <a href="http://www.nsf.gov/">NSF</a> at the U.S. Patent &amp; Trademark Office in Alexandria, VA. This workshop, supported by the USPTO <a href="http://www.uspto.gov/ip/officechiefecon/index.jsp">Office of Chief Economist</a> and the <a href="http://scienceofsciencepolicy.net/">Science of Science and Innovation Policy Program</a> (SciSIP) at the NSF, will bring researchers together to share their ideas on how to facilitate the more efficient use of patent and trademark data, and ultimately to improve both the quantity and caliber of innovation policy scholarship. </p>
<p>The stated goals of this workshop include:</p>
<ol>
<li>Creating an <i>information exchange infrastructure</i> for both the production and informed evaluation of transparent, high-quality research into innovation;
<li>Promoting an <i>intellectual environment</i> particularly hospitable to high-impact quantitative studies;
<li>Creating a distinct <i>community</i> with well-developed research norms and cumulative influence; and
<li>Championing the <i>development of a platform</i> to support a robust body of empirical research into the economic and social consequences of innovation.
</ol>
<p>Each participant planning to attend this workshop has been asked to prepare a blog post that outlines (a) our understanding of the most significant theoretical or empirical challenges in this space, and/or (b) where the frontier of knowledge is, what innovative things are being done at the frontier &#8212; or within reach of being done to solve the set of problems &#8212; and where targeted funding could yield the highest payoffs in getting to solutions. The purpose of this post is to offer some of my thoughts based on progress made by <b>linked open government data</b> initiatives in the US and around the world.</p>
<p><b>Background: The Tetherless World and Linked Open Government Data</b><br />
<a href="http://bitwacker.files.wordpress.com/2011/05/tw-logo-v2.png"><img src="http://bitwacker.files.wordpress.com/2011/05/tw-logo-v2.png?w=500" alt="" title="tw-logo-v2"   class="alignleft size-full wp-image-386" /></a>Since early 2010 the <a href="http://tw.rpi.edu">Tetherless World Constellation</a> (TWC) at Rensselaer Polytechnic Institute has collaborated with the White House <a href="http://data.gov">Data.gov</a> team to make thousands of open government datasets more accessible for consumption by web-based applications and services, including mashups leveraging Semantic Web technologies. TWC has created an infrastructure, embodied by the <a href="http://logd.tw.rpi.edu">TWC LOGD Portal</a>, for automatically converting to RDF and enhancing government data published in tabular (e.g. CSV) format; publishing these converted datasets as downloadable &#8220;dump files&#8221; and through SPARQL endpoints; demonstrating highly effective methodologies for using such linked open government data assets as the basis for the agile creation of lightweight, powerful visualizations and other mashups. In addition to providing a searchable interface to thousands of converted Data.gov datasets, the TWC LOGD Portal publishes a growing set of demos and tutorials for use by the LOGD community. </p>
<p><a href="http://bitwacker.files.wordpress.com/2011/05/datagov_logo.png"><img src="http://bitwacker.files.wordpress.com/2011/05/datagov_logo.png?w=150&#038;h=39" alt="" title="datagov_logo" width="150" height="39" class="alignleft size-thumbnail wp-image-389" /></a>The Data.gov/TWC LOGD partnership and similar international LOGD efforts, especially the UK&#8217;s <a href="http://data.gov.uk">Data.gov.uk</a> initiative, have demonstrated the value and potential for innovation achieved by exposing government data using linked data principles. Indeed, the effective application of the linked data approach to a multitude of data sharing and integration challenges in commerce, industry and eScience has shown its promise as a basis for a more efficient, agile <i>research information exchange infrastructure.</i> </p>
<p><b>Recommendation: Create a &#8220;DBPedia&#8221; for Patent Data</b><br />
<img src="http://bitwacker.files.wordpress.com/2011/05/dbpedia_links.png?w=150&#038;h=121" alt="" title="dbpedia_links" width="150" height="121" class="alignleft size-medium wp-image-382" />The <a href="http://richard.cyganiak.de/2007/10/lod/imagemap.html">Linked Open Data Cloud</a> diagram famously illustrates the growing number of providers of linked open data around the world. Careful examination of the LOD Cloud shows that most sources are sparsely linked, and a very few &#8212; most notably, <a href="http://dbpedia.org">DBPedia.org</a>, are extremely heavily linked. The reason is that the Web of Data has increasingly adopted DBPedia as a reliable source or hub for canonical entity URIs. This means that as providers put their datasets online, they enhance their datasets by providing sameAs links to DBPedia URIs for named entities within these datasets. This enables their datasets to be easily linked to other datasets and increases their utility and value as the basis for visualizations and linked data mashups. </p>
<p>Providers embrace DBPedia&#8217;s URI conventions as &#8220;canonical&#8221; in order to make their datasets more easily adopted. Our objective with patent and trademark reference data and research information in general must be to break down barriers to its widespread use, recognizing that <i>we may have no idea how it may be used.</i> Linked data principles and the Web of Data emerging from them have re-written what it means to make data integration easy. Whereas even a few short years ago it was useful to simply provide a searchable patent database through a proprietary UI, next-generation innovation infrastructures will be based on globally interlinked graphs drive by <i>concept</i> and <i>descriptive metadata</i> extracted from patent records, research publications, business publications and indeed data from social networks. Scholars of innovation will traverse these graphs and mash them with other graphs in ways we cannot anticipate, and thus make serendipitous discoveries about the process of innovation we cannot predict today. </p>
<p>My <b>DBPedia</b> reference comes from the idea of identifying concepts and specific manifestations of innovation in the patent corpus. Consider an arbitrary patent disclosure; it can be represented as a graph of concepts and related manifestations. The infrastructure I&#8217;m proposing will enable the interlinking of URI-named concepts, not only with other patent records but also scientific literature, the financial and news media, social networks, etc. From a research standpoint, this will enable the study of the emergence, spread and influence on innovation in many dimensions. </p>
<p><b>Conclusions</b><br />
The USPTO has already made great strides in improving access to and understanding of patent and trademark data; an excellent example is the <a href="http://www.uspto.gov/about/stratplan/dashboards.jsp">Data Visualization Center</a> and specific data visualization tools such as the <a href="http://www.uspto.gov/dashboards/patents/main.dashxml">Patent Dashboard</a> which provides graphic summaries of USPTO activities. These are &#8220;canned apps,&#8221; however; the next generation of open government will require finer grained access to this data, presented as enhanced linked data and using open licensing principles. As USPTO datasets are presented in this way, researchers will be able to interlink this data with datasets from other sources, resulting in a more effective study of the causes of innovation and indeed the outcomes of government programs intended to stimulate innovation. </p>
<p><b>References</b></p>
<ol>
<li><a href="http://1.usa.gov/l5AOKh">NSF Patent Data Workshop</a>. NSF Award Abstract #1102468 (31 Jan 2011).
<li>Julia Lane, <a href="http://www.ostina.org/content/view/4218/1187/">The Science of Science and Innovation Policy (SciSIP) Program at the US National Science Foundation</a>. OST Bridges vol. 22 (July 2009)
</ol>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/bitwacker.wordpress.com/376/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/bitwacker.wordpress.com/376/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/bitwacker.wordpress.com/376/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/bitwacker.wordpress.com/376/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/bitwacker.wordpress.com/376/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/bitwacker.wordpress.com/376/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/bitwacker.wordpress.com/376/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/bitwacker.wordpress.com/376/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/bitwacker.wordpress.com/376/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/bitwacker.wordpress.com/376/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/bitwacker.wordpress.com/376/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/bitwacker.wordpress.com/376/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/bitwacker.wordpress.com/376/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/bitwacker.wordpress.com/376/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bitwacker.wordpress.com&amp;blog=11115446&amp;post=376&amp;subd=bitwacker&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://bitwacker.wordpress.com/2011/05/31/energizing-innovation-research-through-linked-open-patent-data/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/e2ab144cff31bc669eebb5de34f7bfc9?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">olyerickson</media:title>
		</media:content>

		<media:content url="http://bitwacker.files.wordpress.com/2011/05/uspto_logo.jpg?w=150" medium="image">
			<media:title type="html">uspto_logo</media:title>
		</media:content>

		<media:content url="http://bitwacker.files.wordpress.com/2011/05/tw-logo-v2.png" medium="image">
			<media:title type="html">tw-logo-v2</media:title>
		</media:content>

		<media:content url="http://bitwacker.files.wordpress.com/2011/05/datagov_logo.png?w=150" medium="image">
			<media:title type="html">datagov_logo</media:title>
		</media:content>

		<media:content url="http://bitwacker.files.wordpress.com/2011/05/dbpedia_links.png?w=300" medium="image">
			<media:title type="html">dbpedia_links</media:title>
		</media:content>
	</item>
		<item>
		<title>TWC LOGD Million Dataset Challenge</title>
		<link>http://bitwacker.wordpress.com/2011/02/11/twc-logd-million-dataset-challenge/</link>
		<comments>http://bitwacker.wordpress.com/2011/02/11/twc-logd-million-dataset-challenge/#comments</comments>
		<pubDate>Fri, 11 Feb 2011 19:01:52 +0000</pubDate>
		<dc:creator>John Erickson</dc:creator>
				<category><![CDATA[Big Ideas]]></category>
		<category><![CDATA[Data.gov]]></category>
		<category><![CDATA[government transparency]]></category>
		<category><![CDATA[linked data]]></category>
		<category><![CDATA[metadata]]></category>
		<category><![CDATA[data.gov]]></category>
		<category><![CDATA[gov 2.0]]></category>
		<category><![CDATA[linked open data]]></category>
		<category><![CDATA[open government]]></category>
		<category><![CDATA[open government data]]></category>

		<guid isPermaLink="false">http://bitwacker.wordpress.com/?p=342</guid>
		<description><![CDATA[Many quotes have been attributed to Steve Jobs, but my favorite is the following: Set totally outrageous goals! Well, one of the more &#8220;outrageous&#8221; goals for the Linking Open Government Data project at the Tetherless World Constellation at RPI this term is to create the most comprehensive and useful catalog of open government datasets in [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bitwacker.wordpress.com&amp;blog=11115446&amp;post=342&amp;subd=bitwacker&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Many quotes have been attributed to Steve Jobs, but my favorite is the following: </p>
<blockquote><p>Set totally outrageous goals!</p></blockquote>
<p>Well, one of the more &#8220;outrageous&#8221; goals for the <a href="http://logd.tw.rpi.edu">Linking Open Government Data</a> project at the <a href="http://tw.rpi.edu">Tetherless World Constellation</a> at RPI this term is to create the most comprehensive and useful catalog of open government datasets in the world. </p>
<p>To this end, I am challenging our students &#8212; and indeed <i>everyone</i> within earshot &#8212; to participate in what I&#8217;ve dubbed the <a href="http://bit.ly/g0d3tY">TWC LOGD Million Dataset Challenge</a>: I&#8217;m challenging you to help us create a master catalog of more than <b>1 million</b> open government datasets from around the world! In return, we&#8217;ll make the catalog publicly available through our <a href="http://logd.tw.rpi.edu">TWC LOGD Portal</a>, as RDF dumps and via a SPARQL endpoint.</p>
<p>To get this thing started and to make it as easy as possible, I&#8217;ve created a <a href="http://bit.ly/g0d3tY">Google Form-based interface</a>. Follow the link, add metadata, move on&#8230;</p>
<p>I&#8217;ve structured the form to accept both <i>catalog</i> and <i>individual dataset</i> entries. Just chose the right options in the form&#8230;</p>
<p>To submit a dataset: <a href="http://bit.ly/g0d3tY">http://bit.ly/g0d3tY</a></p>
<p>To view the current status (spreadsheet): <a href="http://bit.ly/eANqSg">http://bit.ly/eANqSg</a> Total (18 Feb 2011): More than <b>331,345</b> datasets</p>
<p>A few resources to get started:</p>
<ol>
<li>Worldwide <a href="http://ckan.net/package">Search CKAN: The Data Hub</a> <b>Prime source!</b></li>
<li>Guardian.co.uk&#8217;s <a href="http://www.guardian.co.uk/world-government-data">catalog of over 12K world government datasets</a> <b>Prime source!</b></li>
<li><a href="http://www.quora.com/Where-can-I-get-large-datasets-open-to-the-public">Where can I get large datasets open to the public?</a> (Quora) <b>Prime source!</b></li>
<li>DataMarket.com&#8217;s <a href="http://datamarket.com/data/">International Dataset Search page</a> <b>Prime source!</b></li>
<li>USA <a href="http://data.gov/">Data.gov</a> <a href="http://www.data.gov/catalog/raw">raw data catalog</a> <b>Prime source!</b></li>
<li>USA <a href="http://data.gov/">Data.gov</a> <a href="http://www.data.gov/catalog/geodata">geodata catalog</a> <b>Prime source!</b></li>
<li>UK <a href="http://data.gov.uk/">Data.gov.uk project</a> <b>Prime source!</b></li>
<li>Africa <a href="http://www.africover.org/system/africover_data.php">Africover datasets download</a> (multiple countries) <b>Prime source!</b></li>
<li><a href="http://apoikola.wordpress.com/2010/01/23/open-government-data-catalogs/">Blog listing many catalogs</a> <b>Prime source!</b></li>
<li><a href="http://bit.ly/fKSnWz">EU Official National Data Catalogs</a> <b>Prime source!</b></li>
<li>ePSI <a href="http://bit.ly/eeN8vM">Public Sector Information (PSI) Data Catalogues (by governments) </a> <b>Prime source!</b></li>
<li>Open Data Euskadi <a href="http://bit.ly/ij607H">International Catalog</a> (Based on eOSI list) <b>Prime source!</b></li>
<li>Open Knowledge <a href="http://lod2.okfn.org/eu-data-catalogues/">List of European Open Data Catalogues</a> <b>Prime source!</b></li>
<li>Civic Commons <a href="http://wiki.civiccommons.org/#Open_Data_Initiatives">List of Open Data Initiatives</a> <b>Prime source!</b></li>
<li>Univ. of Colorado (Boulder) Libraries <a href="http://ucblibraries.colorado.edu/govpubs/for/foreigngovt.htm">Foreign Information by Country</a> <b>Prime source!</b></li>
<li>Factual.com&#8217;s <a href="http://www.factual.com/topic/government">&#8220;comprehensive&#8221; repository of government data</a></li>
<li><a href="http://www.nysgis.state.ny.us/">New York State GIS Clearinghouse</a></li>
<li><a href="http://www.health.state.ny.us/statistics/">New York State Dept. of Health Statistics</a></li>
<li><a href="http://www.cde.ca.gov/ds/dc/">California Dept of Education data collections</a></li>
<li><a href="http://opengovernmentdata.org/catalogues/">OpenGovernmentData.org catalogs</a></li>
<li>GovLoop.com&#8217;s <a href="http://data.govloop.com/Government/List-of-Open-Gov-Plans/x46u-4d2e">List of Open Government Plans</a> (US federal agencies)</li>
<li><a href="http://opendata.socrata.com/">Socrata open datasets</a></li>
<li><a href="http://digitaliser.dk/ressourcer?tabContainerResources=tabDatakildeResources">Danish data catalog</a></li>
<li><a href="http://suomi.fi/datakatalogi">Finnish data catalog</a></li>
<li><a href="http://data.australia.gov.au/">Australian data catalog</a></li>
<li><a href="http://www.spaghettiopendata.org/dati">spaghettiopendata.org</a> Citizen-driven Italian Open Data site </li>
<li><a href="http://www.istat.it/">Istat.it</a> Italian government statistical data </li>
<li><a href="http://www.dati.piemonte.it/dati.html">Datasets from the Piedmont region of Italy</a></li>
<li><a href="http://data.cnr.it/site/">data.CNR.it</a> The Italian National Research Council site </li>
<li><a href="http://aws.amazon.com/publicdatasets/">Public Datasets</a> on <a href="http://aws.amazon.com/publicdatasets/">Amazon Web Services</a></li>
<li><a href="http://dnr.alaska.gov/SpatialUtility/SUC?cmd=vmd&amp;layerid=37">Alaska State GeoSpatial Data Clearinghouse</a> (Alaska DNR)</li>
<li><a href="https://sdbs.adb.org/sdbs/index.jsp">Asian Development Bank Statistical Database Service</a></li>
</ol>
<p><b>Note:</b> As the dataset grows we&#8217;ll improve both the data entry and catalog interface. Our immediate goal is to grow the list&#8230;</p>
<p><b>Note:</b> Watch for answers to this Quora question: <a href="http://b.qr.ae/i60MXZ">What is the most comprehensive list of international open government datasets?</a></p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/bitwacker.wordpress.com/342/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/bitwacker.wordpress.com/342/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/bitwacker.wordpress.com/342/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/bitwacker.wordpress.com/342/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/bitwacker.wordpress.com/342/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/bitwacker.wordpress.com/342/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/bitwacker.wordpress.com/342/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/bitwacker.wordpress.com/342/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/bitwacker.wordpress.com/342/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/bitwacker.wordpress.com/342/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/bitwacker.wordpress.com/342/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/bitwacker.wordpress.com/342/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/bitwacker.wordpress.com/342/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/bitwacker.wordpress.com/342/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bitwacker.wordpress.com&amp;blog=11115446&amp;post=342&amp;subd=bitwacker&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://bitwacker.wordpress.com/2011/02/11/twc-logd-million-dataset-challenge/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/e2ab144cff31bc669eebb5de34f7bfc9?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">olyerickson</media:title>
		</media:content>
	</item>
		<item>
		<title>&#8220;Falling down is part of LIFE&#8230;Getting back up is LIVING&#8221;</title>
		<link>http://bitwacker.wordpress.com/2011/01/19/falling-down-is-part-of-life-getting-back-up-is-living/</link>
		<comments>http://bitwacker.wordpress.com/2011/01/19/falling-down-is-part-of-life-getting-back-up-is-living/#comments</comments>
		<pubDate>Wed, 19 Jan 2011 15:38:22 +0000</pubDate>
		<dc:creator>John Erickson</dc:creator>
				<category><![CDATA[Big Ideas]]></category>
		<category><![CDATA[web science]]></category>
		<category><![CDATA[inspirational quotes]]></category>
		<category><![CDATA[meme tracking]]></category>
		<category><![CDATA[memes]]></category>
		<category><![CDATA[social networks]]></category>

		<guid isPermaLink="false">http://bitwacker.wordpress.com/?p=336</guid>
		<description><![CDATA[This inspirational quote is being re-posted around my networks today: There comes a time in life, when you walk away from all the drama and people who create it. You surround yourself with people who make you laugh, forget the bad, and focus on the good. So, love the people who treat you right. Pray [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bitwacker.wordpress.com&amp;blog=11115446&amp;post=336&amp;subd=bitwacker&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>This inspirational quote is being re-posted around my networks today:</p>
<blockquote><p>
There comes a time in life, when you walk away from all the drama and people who create it. You surround yourself with people who make you laugh, forget the bad, and focus on the good. So, love the people who treat you right. Pray for the ones who don&#8217;t. Life is too short to be anything but happy. Falling down is part of LIFE&#8230;Getting back up is LIVING&#8230;&#8230;&#8230;Re-post if you agree; I just did
</p></blockquote>
<p>I&#8217;ve been trying to trace the origins of this meme using Google and focusing on the quote, <b>Falling down is part of LIFE&#8230;Getting back up is LIVING</b>; it seems to have been active on the Web for about a year, especially in &#8220;mommy blogs&#8221; and on Facebook. </p>
<p><b>Update:</b> </p>
<ul>
<li>My journey has taken me to <a href="http://knowyourmeme.com/">KnowYourMeme</a>, a site dedicated to tracking trends in Internet culture. </li>
<li>An oft-cited story on the tracking of political memes, <a href="http://www.nytimes.com/2009/07/13/technology/internet/13influence.html">Study Measures the Chatter of the News Cycle</a> described the 2009 paper by Jure Leskovecy, Lars Backstrom and Jon Kleinberg, <a href="http://www.cs.cornell.edu/home/kleinber/kdd09-quotes.pdf">Meme-tracking and the Dynamics of the News Cycle</a> (PDF).</li>
</ul>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/bitwacker.wordpress.com/336/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/bitwacker.wordpress.com/336/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/bitwacker.wordpress.com/336/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/bitwacker.wordpress.com/336/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/bitwacker.wordpress.com/336/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/bitwacker.wordpress.com/336/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/bitwacker.wordpress.com/336/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/bitwacker.wordpress.com/336/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/bitwacker.wordpress.com/336/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/bitwacker.wordpress.com/336/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/bitwacker.wordpress.com/336/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/bitwacker.wordpress.com/336/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/bitwacker.wordpress.com/336/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/bitwacker.wordpress.com/336/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bitwacker.wordpress.com&amp;blog=11115446&amp;post=336&amp;subd=bitwacker&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://bitwacker.wordpress.com/2011/01/19/falling-down-is-part-of-life-getting-back-up-is-living/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/e2ab144cff31bc669eebb5de34f7bfc9?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">olyerickson</media:title>
		</media:content>
	</item>
		<item>
		<title>Fall 2010 TWC-RPI Undergraduate Research Summaries</title>
		<link>http://bitwacker.wordpress.com/2010/12/20/fall-2010-twc-rpi-undergraduate-research-summaries/</link>
		<comments>http://bitwacker.wordpress.com/2010/12/20/fall-2010-twc-rpi-undergraduate-research-summaries/#comments</comments>
		<pubDate>Mon, 20 Dec 2010 14:56:35 +0000</pubDate>
		<dc:creator>John Erickson</dc:creator>
				<category><![CDATA[computer science]]></category>
		<category><![CDATA[Data.gov]]></category>
		<category><![CDATA[linked data]]></category>
		<category><![CDATA[web science]]></category>
		<category><![CDATA[data.gov]]></category>
		<category><![CDATA[Jim Hendler]]></category>
		<category><![CDATA[Rensselaer]]></category>
		<category><![CDATA[RPI]]></category>
		<category><![CDATA[semantic web]]></category>
		<category><![CDATA[Tetherless World Constellation]]></category>
		<category><![CDATA[undergraduate research]]></category>

		<guid isPermaLink="false">http://bitwacker.wordpress.com/?p=280</guid>
		<description><![CDATA[The Fall 2010 semester marked the beginning of the Tetherless World Constellation&#8217;s undergraduate research program at Rensselaer Polytechnic Institute (RPI). Although TWC has enjoyed significant contributions from RPI undergrads since its inception, this term we stepped up our game by more &#8220;formally&#8221; incorporating a group of undergrads into TWC&#8217;s research programs, established regular meetings for [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bitwacker.wordpress.com&amp;blog=11115446&amp;post=280&amp;subd=bitwacker&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>The Fall 2010 semester marked the beginning of the <a href="http://tw.rpi.edu">Tetherless World Constellation&#8217;s</a> undergraduate research program at Rensselaer Polytechnic Institute (RPI). Although TWC has enjoyed significant contributions from RPI undergrads since its inception, this term we stepped up our game by more &#8220;formally&#8221; incorporating a group of undergrads into TWC&#8217;s research programs, established regular meetings for the group, and with input from the students began outfitting their own space in RPI&#8217;s <a href="http://tw.rpi.edu/wiki/Winslow_Building">Winslow Building</a>. </p>
<p><a href="http://allone.weblog.com/">Patrick West,</a> my fellow TWC undergrad research coordinator and I asked the students to blog about their work throughout the semester; with the end of term, we asked them to post summary descriptions of their work and their thoughts about the fledgling TWC undergrad research program itself. We&#8217;ve provided short summaries and links to those blogs below&#8230;</p>
<ul>
<li><a href="http://helmc2.wordpress.com/">Cameron Helm</a> began the term coming up to speed on SPARQL and RDF, experimented with several of the public TWC endpoints, and then worked with <a href="http://ngp2.wordpress.com/">Philip</a> on basic visualizations. He then slashed his way through the tutorials on TWC&#8217;s <a href="http://logd.tw.rpi.edi">LOGD Portal</a>, eventually creating impressive visualizations such as this <a href="http://www.cs.rpi.edu/~helmc2/USQuakes.html">earthquake map.</a> Cameron is very interested in the subject of data visualization and looks to do more work in this area in the future.</li>
<li>After a short TWC learning period, <a href="http://souzada.wordpress.com/">Dan Souza</a> began helping doctoral candidate <a href="http://www.evanpatton.com/">Evan Patton</a> create an Android version of the <a href="http://kcap09.stanford.edu/share/posterDemos/170/paper170.pdf">Mobile Wine Agent</a> application, with all the amazing visualization and data integration required, including Twitter and Facebook integration. Mid-semester Dan also responded to the call to help with the crash&#8221; development of the Android/iPhone <b>TalkTracker</b> app, in time for <a href="http://iswc2010.semanticweb.org/">ISWC 2010</a> in early November. Dan continues to work with Evan and others for early 2011 releases of Android, iPhone/iPad Touch and iPad versions of the Mobile Wine Agent. </li>
<li><a href="http://blog.smeirc.com/">David Molik</a> reports that he learned web coding skills, ontology creation, server installation and administration. David contributed to the development and operation of a test site for the new, semantic web savvy website for the Biological and Chemical Oceanography Data Management Office <a href="http://www.bco-dmo.org/">BCO-DMO</a> of the <a href="http://www.whoi.edu/">Woods Hole Oceanographic Institute</a>. </li>
<li><a href="http://chambj2.wordpress.com/">Jay Chamberlin</a> spent much of his time working on the <a href="http://opendap.org/">OPeNDAP Project</a>, an open source server to distribute scientific data that is stored in various formats. His involvement included everything from learning his way around the OPeNAP server, to working with infrastructure such as TWC&#8217;s LDAP services, to helping migrate documentation from the previous Wiki to the new Drupal site, to actually implementing required changes to the OPeNDAP code base. </li>
<li><a href="http://ngp2.wordpress.com/">Philip Ng</a> worked on a wide variety of projects this fall, starting with basic visualizations, helping with ISWC applications, and including iPad development for the Mobile Wine Agent. Philip&#8217;s blog is fascinating to read as he works his way through the challenges of creating applications, including his multi-part series on implementing the social media features.</li>
<li><a href="https://bulaza.wordpress.com/">Alexei Bulazel</a> began working with <a href="http://difranzo.com/">Dominic DiFranzo</a> on a health-related mashup using Data.gov datasets and is now working on a research paper with <a href="http://blog.smeirc.com/">David</a> on &#8220;human flesh search engine&#8221; techniques, a topic that top thinkers including Tetherless World Senior Constellation Professor <a href="http://www.cs.rpi.edu/~hendler/">Jim Hendler</a> have explored in recent talks. <i>Note: For more background on this phenomena, see e.g. <a href="http://nyti.ms/eahdol">China’s Cyberposse</a>, NY Times (03 Mar 2010)</i></li>
</ul>
<p>Many of these students will be continuing on with these or other projects at TWC in 2011; we also expect several new students to be joining the group. The entire team at the Tetherless World Constellation thanks them for their efforts and many important contributions this fall, and looks forward to being amazed by their continued great work in the coming year!</p>
<p><i>John S. Erickson, Ph.D.</i></p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/bitwacker.wordpress.com/280/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/bitwacker.wordpress.com/280/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/bitwacker.wordpress.com/280/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/bitwacker.wordpress.com/280/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/bitwacker.wordpress.com/280/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/bitwacker.wordpress.com/280/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/bitwacker.wordpress.com/280/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/bitwacker.wordpress.com/280/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/bitwacker.wordpress.com/280/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/bitwacker.wordpress.com/280/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/bitwacker.wordpress.com/280/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/bitwacker.wordpress.com/280/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/bitwacker.wordpress.com/280/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/bitwacker.wordpress.com/280/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bitwacker.wordpress.com&amp;blog=11115446&amp;post=280&amp;subd=bitwacker&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://bitwacker.wordpress.com/2010/12/20/fall-2010-twc-rpi-undergraduate-research-summaries/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/e2ab144cff31bc669eebb5de34f7bfc9?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">olyerickson</media:title>
		</media:content>
	</item>
		<item>
		<title>The TWC/Elsevier Data.gov Dataset Search App</title>
		<link>http://bitwacker.wordpress.com/2010/12/19/the-twcelsevier-data-gov-dataset-search-app/</link>
		<comments>http://bitwacker.wordpress.com/2010/12/19/the-twcelsevier-data-gov-dataset-search-app/#comments</comments>
		<pubDate>Sun, 19 Dec 2010 23:51:06 +0000</pubDate>
		<dc:creator>John Erickson</dc:creator>
				<category><![CDATA[computer science]]></category>
		<category><![CDATA[Data.gov]]></category>
		<category><![CDATA[government transparency]]></category>
		<category><![CDATA[linked data]]></category>
		<category><![CDATA[web science]]></category>
		<category><![CDATA[data.gov]]></category>
		<category><![CDATA[elsevier]]></category>
		<category><![CDATA[mashups]]></category>
		<category><![CDATA[open government data]]></category>
		<category><![CDATA[RPI]]></category>
		<category><![CDATA[sciverse]]></category>
		<category><![CDATA[Tetherless World Constellation]]></category>

		<guid isPermaLink="false">http://bitwacker.wordpress.com/?p=265</guid>
		<description><![CDATA[Since Summer 2010 I&#8217;ve had the privilege of working as a research engineer at the Tetherless World Constellation (TWC) at RPI, primarily helping the team in the execution of various projects related to their association with the Obama Administration&#8217;s Data.gov initiative. One of those projects is an applet for the Elsevier SciVerse Hub portal. The [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bitwacker.wordpress.com&amp;blog=11115446&amp;post=265&amp;subd=bitwacker&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p><em>Since Summer 2010 I&#8217;ve had the privilege of working as a research engineer at the <a href="http://tw.rpi.edu">Tetherless World Constellation</a> (TWC) at RPI, primarily helping the team in the execution of various projects related to their association with the Obama Administration&#8217;s <a href="http://data.gov">Data.gov</a> initiative. One of those projects is an applet for the Elsevier <a href="http://sciverse.com">SciVerse Hub</a> portal. The following is from the <a>description page</a> for our application.</em></p>
<div id="attachment_275" class="wp-caption alignleft" style="width: 289px"><a href="http://bitwacker.files.wordpress.com/2010/12/logd_profile_screenshot.png"><img class="size-full wp-image-275" title="logd_profile_Screenshot" src="http://bitwacker.files.wordpress.com/2010/12/logd_profile_screenshot.png?w=500" alt=""   /></a><p class="wp-caption-text">Data.gov Dataset Search (Profile View)</p></div>
<p>The <strong>US Government Dataset Search</strong> application is an easy way for SciVerse users and developers to search from among over 300,000 available US government datasets at <a href="http://data.gov/" target="_blank">http://data.gov</a> to automatically find matches to their queries. Based on the user&#8217;s SciVerse Hub query, searches are simultaneously made against all datasets published through <a href="http://data.gov/" target="_blank">Data.gov</a> as well as the RDF-converted data and related demos at the <a href="http://logd.tw.rpi.edu/" target="_blank">Linking Open Government Data (LOGD) portal</a>, created by the <a href="http://tw.rpi.edu/" target="_blank">Tetherless World Constellation (TWC)</a> at <a href="http://rpi.edu/" target="_blank">Rensselaer Polytechnnic Institute (RPI)</a>.</p>
<p>Any user with the ability to search SciVerse Hub can use the US Government Dataset Search application. The application and the government data it exposes are made available free of charge. The US Government Dataset Search application is targeted at both SciVerse end users (researchers) and application developers interested in applying government datasets to their applications. <em>Researchers</em> utilizing SciVerse Hub are able to discover and access contextually relevant data from the US Government. <em>Developers</em> may utilize SciVerse Hub to identify RDF-converted data sets based on the US Government data and access this data in their applications through SPARQL endpoints or retrieve the datasets themselves.</p>
<p><em>How the US Government Dataset Search application works:</em> For each SciVerse query the user makes, a keyword search across all current Data.gov datasets is made via a SPARQL endpoint at the TWC LOGD portal. A summary of these results is presented on the Hub search results page. Detailed results are presented in tabular form in the &#8216;Canvas&#8217; (larger) view by clicking on any link. On the canvas view links are provided directly to the Data.gov dataset description pages as well as RDF-converted versions of these datasets at the TWC LOGD portal. Note that faceted search is not available with the application and only the original query in Hub willbe submitted.</p>
<p>All queries are made against the LOGD SPARQL endpoint at <a href="http://logd.tw.rpi.edu/sparql" target="_blank">http://logd.tw.rpi.edu/sparql</a> The application also makes use of the <a href="http://code.google.com/apis/visualization/interactive_charts.html" target="_blank">Google Visualization toolkit.</a></p>
<p>This application is optimized for Firefox, Chrome and Internet Explorer 8.</p>
<p>For more information about creating mashups using Data.gov datasets, please check out RPI&#8217;s <em>Linking Open Goverment Data (LOGD) Portal</em> at <a href="http://logd.tw.rpi.edu/" target="_blank">http://logd.tw.rpi.edu</a></p>
<p><strong>About the TWC Linking Open Government Data project:</strong> The TWC LOGD team investigates opening and linking government data using Semantic Web technologies. TWC LOGD actively develops tools for the large-scale translation of government-related datasets into RDF, linking them into the &#8216;Web of Data&#8217; and providing demos and tutorials on various means for consuming linked government data, including creating mashups, applications and data visualizations. The TWC LOGD Portal was awarded second place (open division) at the <a href="http://challenge.semanticweb.org/" target="_blank">2010 Semantic Web Challenge</a>, held during the 2010 International Semantic Web Conference<a href="http://iswc2010.semanticweb.org/" target="_blank">ISWC2010</a>.</p>
<p><strong>About the Tetherless World Constellation at RPI:</strong> The Tetherless World Constellation addresses the emerging area of <a href="http://webscience.org/" target="_blank">Web Science,</a> focusing on the World Wide Web and its future use. <a href="http://tw.rpi.edu/wiki/People" target="_blank">Faculty in the constellation</a> lead explorations into the principles that underlie the Web; enhance the Web&#8217;s reach beyond the desktop and laptop computer; and develop new technologies and languages that expand the capabilities of the Web. TWC researchers use powerful scientific and mathematical techniques from many disciplines to explore the modeling of the Web from network- and information- centric views. TWC&#8217;s objectives include making the next generation web natural to use while being responsive to the growing variety of policy and social needs, whether in the area of privacy, intellectual property, general compliance, or provenance. The Tetherless World Constellation is designing new techniques to explore social, scientific, and legal impacts of the evolving technologies deployed on the Web.</p>
<p><strong>News about the TWC/Elsevier US Government Dataset Search Application</strong></p>
<ol>
<li>Featured in <a href="http://news.rpi.edu/update.do?artcenterkey=2808">Looking Back at 2010 at Rensselar</a> RPI News &amp; Events (20 Dec 2010)</li>
<li><a href="http://bit.ly/huyUhe">SciVerse Hub Application Connects Researchers with U.S. Government Datasets</a> Information Today (20 Dec 2010) </li>
<li><a href="http://www.data.gov/communities/node/116/story/215">U.S. Government Dataset Search Opens Data.gov to Scientists</a> Data.gov website (14 Dec 2010)</li>
<li><a href="http://news.rpi.edu/update.do?artcenterkey=2804">New Application Allows Scientists Easy Access to Important Government Data</a> RPI News &amp; Events (10 Dec 2010)</li>
<li><a href="http://www.labmanager.com/news.asp?ID=1237">New Application Allows Scientists Easy Access to Important Government Data</a> Lab Manager Magazine (13 Dec 2010)</li>
<li><a href="http://www.eurekalert.org/pub_releases/2010-12/rpi-naa121010.php">New Application Allows Scientists Easy Access to Important Government Data</a> EurekAlert (10 Dec 2010)</li>
<li><a href="http://www.physorg.com/news/2010-12-application-scientists-easy-access-important.html">New Application Allows Scientists Easy Access to Important Government Data</a> Physorg.com (10 Dec 2010)</li>
<li><a href="http://bit.ly/hCI0fx">New Application Allows Scientists Easy Access to Important Government Data</a> FirstScience.com (10 Dec 2010)</li>
<li><a href="http://bit.ly/eRnYAj">New Application Allows Scientists Easy Access to Important Government Data</a> NewsoDrone.com (10 Dec 2010)</li>
</ol>
<p><b>UPDATE:</b> I&#8217;m currently developing an iGoogle Gadget version of the SciVerse app, based on the same core queries. A screen shot of the &#8220;profile&#8221; view of that app appears below. In addition to enabling me to monitor the health of our systems from my desktop, it also enables me to test out possible features for the SciVerse app itself.<br />
<div id="attachment_287" class="wp-caption alignleft" style="width: 384px"><a href="http://bitwacker.files.wordpress.com/2010/12/logd_google_20dec10.png"><img src="http://bitwacker.files.wordpress.com/2010/12/logd_google_20dec10.png?w=500" alt="" title="logd_google_20dec10"   class="size-full wp-image-287" /></a><p class="wp-caption-text">iGoogle Gadget version of the US Government Dataset Search app</p></div></p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/bitwacker.wordpress.com/265/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/bitwacker.wordpress.com/265/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/bitwacker.wordpress.com/265/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/bitwacker.wordpress.com/265/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/bitwacker.wordpress.com/265/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/bitwacker.wordpress.com/265/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/bitwacker.wordpress.com/265/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/bitwacker.wordpress.com/265/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/bitwacker.wordpress.com/265/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/bitwacker.wordpress.com/265/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/bitwacker.wordpress.com/265/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/bitwacker.wordpress.com/265/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/bitwacker.wordpress.com/265/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/bitwacker.wordpress.com/265/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bitwacker.wordpress.com&amp;blog=11115446&amp;post=265&amp;subd=bitwacker&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://bitwacker.wordpress.com/2010/12/19/the-twcelsevier-data-gov-dataset-search-app/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/e2ab144cff31bc669eebb5de34f7bfc9?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">olyerickson</media:title>
		</media:content>

		<media:content url="http://bitwacker.files.wordpress.com/2010/12/logd_profile_screenshot.png" medium="image">
			<media:title type="html">logd_profile_Screenshot</media:title>
		</media:content>

		<media:content url="http://bitwacker.files.wordpress.com/2010/12/logd_google_20dec10.png" medium="image">
			<media:title type="html">logd_google_20dec10</media:title>
		</media:content>
	</item>
		<item>
		<title>What I Want in a Software Developer(tm)</title>
		<link>http://bitwacker.wordpress.com/2010/10/28/what-i-want-in-a-developertm/</link>
		<comments>http://bitwacker.wordpress.com/2010/10/28/what-i-want-in-a-developertm/#comments</comments>
		<pubDate>Thu, 28 Oct 2010 12:18:08 +0000</pubDate>
		<dc:creator>John Erickson</dc:creator>
				<category><![CDATA[Big Ideas]]></category>
		<category><![CDATA[computer science]]></category>
		<category><![CDATA[management]]></category>
		<category><![CDATA[software development]]></category>
		<category><![CDATA[software engineering]]></category>
		<category><![CDATA[cs curricula]]></category>
		<category><![CDATA[Erlang]]></category>
		<category><![CDATA[numerical relativity]]></category>
		<category><![CDATA[PKI]]></category>
		<category><![CDATA[SOTON]]></category>

		<guid isPermaLink="false">http://bitwacker.wordpress.com/?p=249</guid>
		<description><![CDATA[Professors and students in a nearby research group have been brainstorming a syllabus for a new, low-level computer science course. Normally I only &#8220;lurk&#8221; in such discussions, but this time I couldn&#8217;t hold my tongue. The following is my contribution, from my perspective as one who has interacted with &#8220;computer scientists&#8221; as a fellow team [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bitwacker.wordpress.com&amp;blog=11115446&amp;post=249&amp;subd=bitwacker&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Professors and students in a <a href="http://www.dartmouth.edu/~pkilab/index.html">nearby research group</a> have been brainstorming a syllabus for a new, low-level computer science course. Normally I only &#8220;lurk&#8221; in such discussions, but this time I couldn&#8217;t hold my tongue. The following is my contribution, from <a href="http://bitwacker.wordpress.com/john-s-erickson-phd/">my perspective</a> as one who has interacted with &#8220;computer scientists&#8221; as a fellow team member, project leader, hiring manager, business partner and even corporate recruiter (interviewing mostly for other hiring managers). </p>
<p>This version has been edited slightly to make it better suited for a blog&#8230;</p>
<p><i>
<p>As an &#8220;old guy&#8221; who has interviewed his share of CS, CE and EE&#8217;s over the years (and <a href="http://bitwacker.wordpress.com/john-s-erickson-phd/">hire and/or managed</a> more than a few of them), here are my thoughts from an &#8220;outcomes&#8221; perspective&#8230;</p>
<ul>
<li>
<p>It&#8217;s really exciting to work with a developer who groks the concepts to such a degree that <b>specific languages and language boundaries simply don&#8217;t matter.</b> Seeing a prototype done in <a href="http://www.erlang.org/">Erlang</a> because it was perfectly suited is SO much better than listening to whining over how it is hard to do it in Java or C# or Visual Basic N. They are usually curious about everything; the dude that coded a prototype <a href="http://nosql-database.org/">NoSQL</a>-style data store for our team in Erlang had been playing with it for a few months, &#8220;just because&#8230;&#8221;</p>
</li>
<li>
<p><b>Methodical problem solving matters.</b> Which some would equate to Engineering(tm). But really it&#8217;s about gaining a ton of experience attacking problems. The number one thing I&#8217;ve looked for over the years is actual experience &#8212; through project work, interesting course projects, and esp. internships &#8212; in completing cool projects. And please, <b>don&#8217;t wait to be assigned;</b> always look for problems, and just do them.</p>
</li>
<li>
<p><b>Join the software ecosystem.</b> The most impressive developers I&#8217;ve met over the years &#8212; some are currently undergrads at the <a href="http://twrpi.edu">Tetherless World Constellation</a> at <a href="http://rpi.edu">RPI</a> &#8212; understand how to contribute to software ecosystem(s); usually this is through the open source community. They understand the tools, they understand how to engage with other developers, they understand how to analyze and improve other people&#8217;s code.</p>
<p>Here&#8217;s one way to think about it: if you aspire to be a professional musician (or artist), chances are you&#8217;ve participated in the &#8220;music ecosystem&#8221; in a wide variety of ways for many years, even before entering college. The best developers I&#8217;ve met &#8212; and those &#8220;computer scientists&#8221; who are developers at heart &#8212; have done the same (one guy I know built his first Linux kernel when he was in middle school).</p>
</li>
<li>
<p><b>Understand systems end-to-end.</b> Now we&#8217;re back to the topic at hand <img src='http://s1.wp.com/wp-includes/images/smilies/icon_wink.gif' alt=';)' class='wp-smiley' />  The best contributors over the years have been those who had hands-on experience with absolutely every aspect of the &#8220;system.&#8221; This doesn&#8217;t mean going <b>From Relays to Twitter in 10 Weeks</b>, but it does mean understanding the relationships between all system elements.</p>
</li>
</ul>
<p>I doubt very much that this is a problem for anyone on this list, because the very nature of <a href="http://www.dartmouth.edu/~deploypki/overview.html">PKI</a> work requires one to have just this sort of broad and deep knowledge; plus, your professor and I have had a few conversations about this over the years&#8230;BTW, my daughter&#8217;s now at <a href="http://www.soton.ac.uk/">Southampton</a> working on her Ph.D in <a href="http://bit.ly/c7bBkO">numerical relativity</a> and writing code on a <a href="http://bit.ly/adKUFb">supercomputer cluster<a> <img src='http://s1.wp.com/wp-includes/images/smilies/icon_wink.gif' alt=';)' class='wp-smiley' />
</p>
<p></i></p>
<p><b>UPDATE (29 Oct 2010):</b> <a href="http://www.nature.com/">Nature</a> recently published this interesting article, <a href="http://www.nature.com/news/2010/101013/full/467775a.html"> Computational science: &#8230;Error …why scientific programming does not compute,</a> (13 Oct 2010) on the increasing need for scientists to have hard-core software engineering skills to do their science.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/bitwacker.wordpress.com/249/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/bitwacker.wordpress.com/249/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/bitwacker.wordpress.com/249/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/bitwacker.wordpress.com/249/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/bitwacker.wordpress.com/249/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/bitwacker.wordpress.com/249/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/bitwacker.wordpress.com/249/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/bitwacker.wordpress.com/249/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/bitwacker.wordpress.com/249/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/bitwacker.wordpress.com/249/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/bitwacker.wordpress.com/249/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/bitwacker.wordpress.com/249/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/bitwacker.wordpress.com/249/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/bitwacker.wordpress.com/249/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bitwacker.wordpress.com&amp;blog=11115446&amp;post=249&amp;subd=bitwacker&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://bitwacker.wordpress.com/2010/10/28/what-i-want-in-a-developertm/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/e2ab144cff31bc669eebb5de34f7bfc9?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">olyerickson</media:title>
		</media:content>
	</item>
		<item>
		<title>Data Quality is in its Fitness to the Beholder</title>
		<link>http://bitwacker.wordpress.com/2010/07/12/data-beauty-is-in-the-eye-of-the-beholder/</link>
		<comments>http://bitwacker.wordpress.com/2010/07/12/data-beauty-is-in-the-eye-of-the-beholder/#comments</comments>
		<pubDate>Mon, 12 Jul 2010 14:30:22 +0000</pubDate>
		<dc:creator>John Erickson</dc:creator>
				<category><![CDATA[data quality]]></category>
		<category><![CDATA[linked data]]></category>
		<category><![CDATA[metadata]]></category>
		<category><![CDATA[ranking]]></category>

		<guid isPermaLink="false">http://bitwacker.wordpress.com/?p=234</guid>
		<description><![CDATA[A few weeks ago Leigh Dodds began a thoughtful discussion on SemanticOverflow with the question: There&#8217;s an increasing variety of data available as Linked Data coming from a range of different sources. I&#8217;m wondering what indicators we might use to judge the &#8220;quality&#8221; of a dataset&#8230;Clearly quality is a subjective thing, but I&#8217;d be interested [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bitwacker.wordpress.com&amp;blog=11115446&amp;post=234&amp;subd=bitwacker&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>A few weeks ago <a href="http://www.ldodds.com/blog/">Leigh Dodds</a> began a thoughtful discussion on <a href="http://www.semanticoverflow.com/questions/1072/quality-indicators-for-linked-data-datasets">SemanticOverflow</a> with the question:</p>
<blockquote><p>There&#8217;s an increasing variety of data available as <a href="http://linkeddata.org">Linked Data</a> coming from a range of different sources. I&#8217;m wondering what indicators we might use to judge the &#8220;quality&#8221; of a dataset&#8230;Clearly quality is a subjective thing, but I&#8217;d be interested to know what factors people might use to indicate whether a dataset was trustworthy, well modelled, sustainable, etc.</p></blockquote>
<p>For starters, I think we can all agree at the highest level that the measure of data quality is <i>subjective</i> and that &#8220;beauty is in the eye of the beholder&#8221;: the quality of a dataset is measured by its <i>fitness for use</i> in specific applications. <i>This question of determining and disseminating &#8220;fitness&#8221; scores is the rub!</i></p>
<p>In his answer to Leigh&#8217;s question, <a href="http://www.cs.umbc.edu/~finin/">Tim Finin</a> proposes adopting a <a href="http://en.wikipedia.org/wiki/PageRank">PageRank</a>-like mechanism, &#8220;LODrank&#8221; based on measured usage</p>
<blockquote><p>We could define LODrank as a PageRank-like measure that was a function of the number of links to/from other LOD datasets weighted by their LODrank. Alternatively, it might divided by the number of linkable instances in the collection, so that large datasets did not have an advantage&#8230;</p></blockquote>
<p>This approach scores data quality based on <i>observed fitness</i> as evidenced by discovered use and has the advantage of automation.</p>
<p>My replies went in a different direction, focusing instead on the subjective nature of data quality and the need to aggregate consumer-space rankings of datasets across a set of dimensions. In his 2005 white paper <a href="http://bit.ly/yAfbE5">Principles of Data Quality</a> [1] Arthur D. Chapman writes,</p>
<blockquote><p>Data quality is multidimensional, and involves data management, modelling and analysis, quality control and assurance, storage and presentation. As independently stated by Chrisman [2] and Strong et al. [3], <i>data quality is related to use and cannot be assessed independently of the user.</i> In a database, the data have no actual quality or value [4]; they only have potential value that is realized only when someone uses the data to do something useful. Information quality relates to its ability to satisfy its customers and to meet customers’ needs [5]</p></blockquote>
<p>Chapman goes on to enumerate a set of factors that contribute to fitness-for-use, citing Redman [6]:</p>
<ul>
<li>Accessibility</li>
<li>Accuracy</li>
<li>Timeliness</li>
<li>Completeness</li>
<li>Consistency with other sources</li>
<li>Relevance</li>
<li>Comprehensiveness</li>
<li>Providing a proper level of detail</li>
<li>Easy to &#8220;read&#8221;</li>
<li>Easy to &#8220;interpret&#8221;</li>
</ul>
<p>Each of these factors is fundamentally subjective, even if mechanisms exist within particular domains to take their measure &#8220;objectively.&#8221; Indeed, in some domains such ratings might only be done by humans, either through voting mechanisms or by individual reviewers.</p>
<p>I believe the greater linked data community needs to develop vocabulary terms for expressing metrics for data quality &#8212; consider the ten points above &#8212; and then within individual communities develop agreed-upon means to determine those values. Arguably this is a &#8220;Dublin Core&#8221; approach to the problem, in the sense that terms like <i>completeness</i> or <i>consistency</i> would be reused across domains with inherently different domain-specific meanings, but such reuse would facilitate consumers from other communities choosing datasets from outside their expertise. A non-physicist might then say, &#8220;The physics community says this dataset is <i>accurate</i>, by their measures.&#8221;</p>
<p>Some of these factors are even more deeply subjective and must be evaluated dynamically, based on the consumer&#8217;s immediate context. An example of this is <i>relevance</i>, which could be interpreted as equivalent to a recommendation.</p>
<p><i>If you have thoughts on data quality as it applies to linked data, consider answering <a href="http://www.semanticoverflow.com/questions/1072/quality-indicators-for-linked-data-datasets">Leigh&#8217;s question</a> at <a href="http://www.semanticoverflow.com/questions/1072/quality-indicators-for-linked-data-datasets">SemanticOverflow!</a></i></p>
<p><b>References:</b> (as cited by Chapman)</p>
<ol>
<li>Chapman, A. D. 2005. <a href="http://bit.ly/yAfbE5">Principles of Data Quality</a>, version 1.0. Report for the Global Biodiversity Information Facility, Copenhagen.</li>
<li>Chrisman, N.R., 1991. The Error Component in Spatial Data. pp. 165-174 in: Maguire D.J., Goodchild M.F. and Rhind D.W. (eds)</li>
<li>Geographical Information Systems Vol. 1, Principals: Longman Scientific and Technical.</li>
<li>Strong, D.M., Lee, Y.W.and Wang, R.W. 1997. Data quality in context. Communications of ACM 40(5): 103-110.</li>
<li>Dalcin, E.C. 2004. Data Quality Concepts and Techniques Applied to Taxonomic Databases. Thesis for the degree of Doctor of Philosophy,
<li>School of Biological Sciences, Faculty of Medicine, Health and Life Sciences, University of Southampton. November 2004. 266 pp.</li>
<li>English, L.P. 1999. Improving Data Warehouse and Business Information Quality: Methods for Reducing Costs and Increasing Profits. New York: John Wiley &amp; Sons, Inc. 518pp.</li>
<li>Redman, T.C. 2001. Data Quality: The Field Guide. Boston, MA: Digital Press.</li>
</ol>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/bitwacker.wordpress.com/234/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/bitwacker.wordpress.com/234/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/bitwacker.wordpress.com/234/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/bitwacker.wordpress.com/234/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/bitwacker.wordpress.com/234/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/bitwacker.wordpress.com/234/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/bitwacker.wordpress.com/234/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/bitwacker.wordpress.com/234/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/bitwacker.wordpress.com/234/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/bitwacker.wordpress.com/234/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/bitwacker.wordpress.com/234/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/bitwacker.wordpress.com/234/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/bitwacker.wordpress.com/234/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/bitwacker.wordpress.com/234/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bitwacker.wordpress.com&amp;blog=11115446&amp;post=234&amp;subd=bitwacker&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://bitwacker.wordpress.com/2010/07/12/data-beauty-is-in-the-eye-of-the-beholder/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/e2ab144cff31bc669eebb5de34f7bfc9?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">olyerickson</media:title>
		</media:content>
	</item>
		<item>
		<title>Regarding the Singularity</title>
		<link>http://bitwacker.wordpress.com/2010/06/16/regarding-the-singularity/</link>
		<comments>http://bitwacker.wordpress.com/2010/06/16/regarding-the-singularity/#comments</comments>
		<pubDate>Wed, 16 Jun 2010 14:02:41 +0000</pubDate>
		<dc:creator>John Erickson</dc:creator>
				<category><![CDATA[Big Ideas]]></category>
		<category><![CDATA[singularity movement]]></category>

		<guid isPermaLink="false">http://bitwacker.wordpress.com/?p=216</guid>
		<description><![CDATA[A recent set of articles in the New York Times and elsewhere, including the Kurzweil book, prompted a friend to ask me for my thoughts on the Singularity Movement. Here is an excerpt of the email I wrote: Regarding the Singularity Movement, I think economic arguments such as that presented by Robin Hanson in IEEE [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bitwacker.wordpress.com&amp;blog=11115446&amp;post=216&amp;subd=bitwacker&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p><em>A recent set of articles in the <strong>New York Times</strong> and elsewhere, including the Kurzweil book, prompted a friend to ask me for my thoughts on the <strong>Singularity Movement</strong>. Here is an excerpt of the email I wrote:</em></p>
<p><a href="http://bitwacker.files.wordpress.com/2010/06/singularity.jpg"><img class="alignleft size-thumbnail wp-image-221" title="Singularity" src="http://bitwacker.files.wordpress.com/2010/06/singularity.jpg?w=150&#038;h=109" alt="" width="150" height="109" /></a>Regarding the <a href="http://en.wikipedia.org/wiki/Technological_singularity">Singularity Movement</a>, I think economic arguments such as that presented by Robin Hanson in <a href="http://bit.ly/9yrjf3">IEEE Spectrum</a> (2008) carry more weight than the gushing futurist predictions from the likes of <a href="http://en.wikipedia.org/wiki/Ray_Kurzweil">Ray Kurzweil.</a> In the Spectrum article Hanson cites two previous singularities &#8212; the agricultural and industrial revolutions &#8212; and suggests that a revolution in machine intelligence is leading to a third that will take shape over the next half-century.</p>
<p>I tend to take most of what futurists say with a grain of salt, because they rely on a belief/assumption/confidence that the introduction of disruptive technologies into a society yields predictable results &#8212; for good or bad &#8212; which never happens. The combination of factors including technologies being human constructions, the fact that we as humans never make completely rational decisions, and the fact that all of this takes place within a <i>fundamentally chaotic</i>, only approximately predictable context, means that we simply cannot know what will happen in the future!</p>
<p><em><a href="http://bitwacker.files.wordpress.com/2010/06/pagerank-byfml.gif"><img class="alignleft size-thumbnail wp-image-224" title="pagerank-byfml" src="http://bitwacker.files.wordpress.com/2010/06/pagerank-byfml.gif?w=150&#038;h=100" alt="" width="150" height="100" /></a>Here&#8217;s what I know:</em> We humans are wired to build and use tools and, to the extent possible, adapt to the environments we build &#8212; or die trying. Google, while amazing, is still a tool; an engineered system that (given enough time) I can explain to you. Ironically enough, the reason Google works so well is because it&#8217;s actually based on simpler, but more fundamental principals than the systems which preceded it, closer to how naturally-occurring networks emerge and function. But the way Google has been adopted and applied in the &#8220;ecosystem,&#8221; while making sense in hindsight, could not have been predicted.</p>
<p><a href="http://bitwacker.files.wordpress.com/2010/06/lehrer_how_1.png"><img class="alignleft size-thumbnail wp-image-223" title="Lehrer_How_1" src="http://bitwacker.files.wordpress.com/2010/06/lehrer_how_1.png?w=118&#038;h=150" alt="" width="118" height="150" /></a>I&#8217;m currently reading Jonah Lehrer&#8217;s <a href="http://www.jonahlehrer.com/books">How We Decide</a>, a wonderful exploration of the biochemistry of how we make decisions. Any such discussion naturally much touch on how various imbalances (e.g. dopamine, etc) effect that process, and how well-intentioned efforts by doctors to counteract certain imbalances leads to very unexpected and usually undesired results. </p>
<p>Lehrer&#8217;s book makes it profoundly clear that <i>we never know for certain what will happen when we diddle with the decision-making processes in our brain,</i> whether it involves extending the lower levels of the nervous system (the sensory level) or the higher level processes. Researchers <em>do</em> know that we seem to adapt well to lower-level, e.g. neural prosthetics, but each higher-level process involves a synaptic algorithm that we don&#8217;t completely understand &#8212; mostly because our brain is a distributed system, not a single &#8220;algorithm,&#8221; whose &#8220;result&#8221; is emergent.</p>
<p>That ultimately is my point: our brains are distributed systems that exhibit adaptive and unpredictable behaviors, and we can&#8217;t begin to understand what will happen when we explore higher-level prosthetics based on &#8220;intelligent machines.&#8221; <em>Something</em> will happen, but there is no reason to believe it will lead to either a Utopian <em>or</em> Dystopian existence any more than the agricultural or industrial revolutions resulted in one or the other. Indeed, the introduction of those practices to certain natural and economic ecosystems led to both regional successes and catastrophes.</p>
<p><b>For Further Information:</b></p>
<ul>
<li><a href="http://singularityu.org/">The Singularity University</a></li>
<li><a href="http://www.nytimes.com/2010/06/13/business/13sing.html">Merely Human? That’s So Yesterday.</a> New York Times (11 June 2010).</li>
<li><a href="http://spectrum.ieee.org/robotics/robotics-software/economics-of-the-singularity/0">Economics Of The Singularity</a> IEEE Spectrum (May 2008)</li>
<li><a href="http://www.kurzweilai.net/meme/frame.html?m=1">Singularity articles</a> at <a href="http://www.kurzweilai.net">http://www.kurzweilai.net</a></li>
<li><a href="http://www.onintelligence.org/">On Intelligence,</a> the companion site to Jeff Hawkin&#8217;s provocative book by the same name. The book introduces the concept of <a href="http://www.numenta.com/about-numenta/numenta-technology.php">Hierarchical Temporal Memory</a> (HTM) based on a layered hierarchical model of how the neocortex functions.</li>
</ul>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/bitwacker.wordpress.com/216/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/bitwacker.wordpress.com/216/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/bitwacker.wordpress.com/216/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/bitwacker.wordpress.com/216/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/bitwacker.wordpress.com/216/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/bitwacker.wordpress.com/216/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/bitwacker.wordpress.com/216/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/bitwacker.wordpress.com/216/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/bitwacker.wordpress.com/216/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/bitwacker.wordpress.com/216/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/bitwacker.wordpress.com/216/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/bitwacker.wordpress.com/216/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/bitwacker.wordpress.com/216/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/bitwacker.wordpress.com/216/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bitwacker.wordpress.com&amp;blog=11115446&amp;post=216&amp;subd=bitwacker&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://bitwacker.wordpress.com/2010/06/16/regarding-the-singularity/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/e2ab144cff31bc669eebb5de34f7bfc9?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">olyerickson</media:title>
		</media:content>

		<media:content url="http://bitwacker.files.wordpress.com/2010/06/singularity.jpg?w=150" medium="image">
			<media:title type="html">Singularity</media:title>
		</media:content>

		<media:content url="http://bitwacker.files.wordpress.com/2010/06/pagerank-byfml.gif?w=150" medium="image">
			<media:title type="html">pagerank-byfml</media:title>
		</media:content>

		<media:content url="http://bitwacker.files.wordpress.com/2010/06/lehrer_how_1.png?w=118" medium="image">
			<media:title type="html">Lehrer_How_1</media:title>
		</media:content>
	</item>
		<item>
		<title>Concerning the King Arthur Flour Expansion</title>
		<link>http://bitwacker.wordpress.com/2010/05/28/concerning-the-king-arthur-flour-expansion/</link>
		<comments>http://bitwacker.wordpress.com/2010/05/28/concerning-the-king-arthur-flour-expansion/#comments</comments>
		<pubDate>Fri, 28 May 2010 19:29:46 +0000</pubDate>
		<dc:creator>John Erickson</dc:creator>
				<category><![CDATA[Big Ideas]]></category>
		<category><![CDATA[politics]]></category>

		<guid isPermaLink="false">http://bitwacker.wordpress.com/?p=208</guid>
		<description><![CDATA[Recently the King Arthur Flour Company, a global provider of quality baking supplies based in my home town of Norwich, Vermont, proposed an expansion that would include a sewer extension. This issue is being debated locally, and I thought would provide good fodder for my blog&#8230;John Erickson Since Jill and I moved to Norwich some [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bitwacker.wordpress.com&amp;blog=11115446&amp;post=208&amp;subd=bitwacker&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p><i>Recently the <a href="http://www.kingarthurflour.com/">King Arthur Flour Company</a>, a global provider of quality baking supplies based in my home town of Norwich, Vermont, proposed an expansion that would include a sewer extension. This issue is being <a href="http://norwichnavel.blogspot.com/2010/05/king-arthur-qualified.html">debated locally</a>, and I thought would provide good fodder for my blog&#8230;John Erickson</i></p>
<p>Since Jill and I moved to Norwich some 18 years ago, I&#8217;ve been troubled by what seems like a lack of support for sustainable economic development within our town. I&#8217;m proud that Norwich has a high-quality global company &#8220;like&#8221; King Arthur based here, a company that is employ-owned, successful and growing. At the same time I&#8217;m embarrassed that Norwich isn&#8217;t doing more to sustain the economic well being of the Upper Valley.</p>
<p>15 years ago this month partners and I began the process of launching a company called NetRghts. Loving Norwich and Vermont, I had a vision of starting a sustainable high-tech company that would be based here and would create local jobs. The inevitable question of where to base our company arose; being the Vermonter in the mix and drinking from the KoolAid of iconic successes like Green Mountain Gringo, I argued for us to set up offices in Norwich, Wilder or WRJ. My co-founders thought this was <i>ludicrous</i>; not only did they envision the (obvious to them) negative tax implications, but they also perceived no end of difficulty with infrastructure, etc. Since they had been successful with a previous Lebanon-based software startup, I went along for the ride and we set up shop in downtown Lebanon.</p>
<p><i>But I wouldn&#8217;t give up that easily.</i> At one point Vermont eTV &#8212; remember them? &#8212; had a call-in with Gov Dean&#8217;s youthful, energetic director of economic development. Vermont had recently provided incentives for ETI&#8217;s expansion, and my direct question to &#8220;Slick&#8221; was: what can Vermont do to keep companies like ours <i>in Vermont?</i> Or, were my co-founders right, there (weren&#8217;t) any incentives to lure us to Vermont. His answer: regrettably, yes, my co-founders were right. If we needed money for bricks-n-mortar expansion to grow a widget-building business, yes, but since we were &#8220;knowledge-based,&#8221; <i>nothing.</i> Frankly, I was shocked, since this was during the same period that Gov Dean (who I&#8217;m a fan of!) was roaming the state advocating green high-tech businesses in cabins on mountaintops&#8230;</p>
<p>I&#8217;ve bored you with this ancient history in order to provide some context as to why I believe the citizens of Norwich should greet initiatives such as King Arthur&#8217;s with the question, <i>what can we as neighbors do to help?</i> Their opening proposal may or may not be ideal &#8212; I&#8217;m not saying &#8220;Roll over, little Norwich!&#8221; &#8212; but I <i>do</i> believe it is our responsibility to do what we can to foster economic development in this town, and this includes hearing their plans with an open mind.</p>
<p>I&#8217;m tired of Norwich not merely depending on, but <i>assuming</i> that other towns in the region will feed our hungry, host our homeless, pay our salaries, sell us our auto parts. Instead, we should be asking how we can help those among us with the initiative to bring it on home to Norwich&#8230;</p>
<p><i>Disclaimer: I am not affiliated with King Arthur Flour, but I do confess to loving their products and have been known to roam their <a href="http://www.kingarthurflour.com/jobs/">jobs portal</a> from time to time&#8230;</i></p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/bitwacker.wordpress.com/208/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/bitwacker.wordpress.com/208/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/bitwacker.wordpress.com/208/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/bitwacker.wordpress.com/208/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/bitwacker.wordpress.com/208/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/bitwacker.wordpress.com/208/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/bitwacker.wordpress.com/208/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/bitwacker.wordpress.com/208/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/bitwacker.wordpress.com/208/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/bitwacker.wordpress.com/208/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/bitwacker.wordpress.com/208/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/bitwacker.wordpress.com/208/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/bitwacker.wordpress.com/208/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/bitwacker.wordpress.com/208/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bitwacker.wordpress.com&amp;blog=11115446&amp;post=208&amp;subd=bitwacker&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://bitwacker.wordpress.com/2010/05/28/concerning-the-king-arthur-flour-expansion/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/e2ab144cff31bc669eebb5de34f7bfc9?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">olyerickson</media:title>
		</media:content>
	</item>
	</channel>
</rss>
