<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Who will filter the stream first?</title>
	<atom:link href="http://www.craigekerstiens.com/who-will-filter-the-stream-first/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.craigekerstiens.com/who-will-filter-the-stream-first/</link>
	<description>My thoughts and predictions on technology and business, and sometimes strong ones at that</description>
	<lastBuildDate>Tue, 26 Jan 2010 23:01:25 -0800</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.1</generator>
	<item>
		<title>By: Craig</title>
		<link>http://www.craigekerstiens.com/who-will-filter-the-stream-first/comment-page-1/#comment-25532</link>
		<dc:creator>Craig</dc:creator>
		<pubDate>Tue, 26 Jan 2010 23:01:25 +0000</pubDate>
		<guid isPermaLink="false">http://www.craigekerstiens.com/?p=160#comment-25532</guid>
		<description>I agree it COULD result in mis-classification, but there&#039;s a lot being done quite well with unsupervised clustering that&#039;s proving effective. Users on at least some services such as twitter are already providing tagging of other users, and validating which items fall in a category via tagging. By expanding the classification to just beyond California, to California and Traveling you could clearly identify that as an area of interest, thereby eliminating things only pertaining to California and only pertaining to Traveling. Essentially any filtering you could do would be some improvement, and how effective you could be would be more restricted by the amount you wanted to see in your feed as much as it would on how effective your filtering was. Only for the users that greatly limited what they wanted in a feed would you miss out. Since you&#039;re already opting in to relevant users, instead of on the entire corpus of twitter you have to worry less about the overload of information of what may occur for conflicting classifications. 

I think your initial question really becomes the driver for then being the input to filtering the stream. You begin with unsupervised clustering via voluntary tagging, like a hashtag on twitter, and couple that with geolocation. From there you have the classification of information and can track that to how the user interacts.</description>
		<content:encoded><![CDATA[<p>I agree it COULD result in mis-classification, but there&#8217;s a lot being done quite well with unsupervised clustering that&#8217;s proving effective. Users on at least some services such as twitter are already providing tagging of other users, and validating which items fall in a category via tagging. By expanding the classification to just beyond California, to California and Traveling you could clearly identify that as an area of interest, thereby eliminating things only pertaining to California and only pertaining to Traveling. Essentially any filtering you could do would be some improvement, and how effective you could be would be more restricted by the amount you wanted to see in your feed as much as it would on how effective your filtering was. Only for the users that greatly limited what they wanted in a feed would you miss out. Since you&#8217;re already opting in to relevant users, instead of on the entire corpus of twitter you have to worry less about the overload of information of what may occur for conflicting classifications. </p>
<p>I think your initial question really becomes the driver for then being the input to filtering the stream. You begin with unsupervised clustering via voluntary tagging, like a hashtag on twitter, and couple that with geolocation. From there you have the classification of information and can track that to how the user interacts.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Matt</title>
		<link>http://www.craigekerstiens.com/who-will-filter-the-stream-first/comment-page-1/#comment-25530</link>
		<dc:creator>Matt</dc:creator>
		<pubDate>Tue, 26 Jan 2010 20:26:49 +0000</pubDate>
		<guid isPermaLink="false">http://www.craigekerstiens.com/?p=160#comment-25530</guid>
		<description>To answer your last question: because products have a defined set of meta-data and context in which that data is useful.  They have years upon years worth of behavioral patterns stored to determine similar interest among items.

The problem with the information on Facebook and other social sites is that the information isn&#039;t nearly as easily placed into context using artificial intelligence.  The amount of AI work to do something like the example you listed with &quot;California&quot; could easily result into you seeing someone is currently enjoying a Tupac and Dr. Dre track - something you aren&#039;t very likely interested in.

I think the better question becomes: how do you convert a user&#039;s data into actual information?  The first obvious step to me is meta-data, even if it means having the user voluntarily qualify and add extra information to their shared data.  Figuring out this meta-data on a 140 character sentence seems impossible without some type of user input.  I think we already have a lot of tools to help us make a more educated guess: geolocation, &quot;tagging&quot; of other users, etc., but we still have a long way to go.</description>
		<content:encoded><![CDATA[<p>To answer your last question: because products have a defined set of meta-data and context in which that data is useful.  They have years upon years worth of behavioral patterns stored to determine similar interest among items.</p>
<p>The problem with the information on Facebook and other social sites is that the information isn&#8217;t nearly as easily placed into context using artificial intelligence.  The amount of AI work to do something like the example you listed with &#8220;California&#8221; could easily result into you seeing someone is currently enjoying a Tupac and Dr. Dre track &#8211; something you aren&#8217;t very likely interested in.</p>
<p>I think the better question becomes: how do you convert a user&#8217;s data into actual information?  The first obvious step to me is meta-data, even if it means having the user voluntarily qualify and add extra information to their shared data.  Figuring out this meta-data on a 140 character sentence seems impossible without some type of user input.  I think we already have a lot of tools to help us make a more educated guess: geolocation, &#8220;tagging&#8221; of other users, etc., but we still have a long way to go.</p>
]]></content:encoded>
	</item>
</channel>
</rss>

<!-- Dynamic page generated in 0.247 seconds. -->
<!-- Cached page generated by WP-Super-Cache on 2010-09-10 11:32:13 -->
