<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Structured Search Is On The Table</title>
	<atom:link href="http://thenoisychannel.com/2009/10/13/structured-search-is-on-the-table/feed/" rel="self" type="application/rss+xml" />
	<link>http://thenoisychannel.com/2009/10/13/structured-search-is-on-the-table/</link>
	<description></description>
	<lastBuildDate>Wed, 17 Mar 2010 11:17:41 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: Daniel Tunkelang</title>
		<link>http://thenoisychannel.com/2009/10/13/structured-search-is-on-the-table/comment-page-1/#comment-4912</link>
		<dc:creator>Daniel Tunkelang</dc:creator>
		<pubDate>Tue, 24 Nov 2009 01:41:42 +0000</pubDate>
		<guid isPermaLink="false">http://thenoisychannel.com/?p=2689#comment-4912</guid>
		<description>Not that Google Squared is fully baked, but I get lot more than two answers when query &lt;a href=&quot;http://www.google.com/squared/search?q=oracle+acquisitions&quot; rel=&quot;nofollow&quot;&gt;oracle acquisitions&lt;/a&gt;.

BEA Systems, Demantra, Retek, TimesTen, Portal Software, Oblix, LODESTAR Corporation, Stellent, and 360Commerce look right to me. Network computer and Oracle Corporations are false positives--and Sun Microsystems, which actually isn&#039;t an Oracle acquisition yet, triggered the latter. So there&#039;s work to do, for sure. But, as the &lt;a href=&quot;http://www.google.com/support/faqs/bin/answer.py?answer=139211&quot; rel=&quot;nofollow&quot;&gt;FAQ&lt;/a&gt; says, &quot;Google Squared is still in an experimental phase at this point, so you might encounter some hiccups while using it.&quot;

There are paid services in this domain, e.g., those listed Dow Jones&#039;s &lt;a href=&quot;http://www.taxonomywarehouse.com/&quot; rel=&quot;nofollow&quot;&gt;Taxonomy Warehouse&lt;/a&gt;. But it seems that a number of folks are competing in the world of free. That means a lot of price pressure--if you&#039;re going to charge, you really have to knock it out of the park, and to hope that free doesn&#039;t catch up on quality.</description>
		<content:encoded><![CDATA[<p>Not that Google Squared is fully baked, but I get lot more than two answers when query <a href="http://www.google.com/squared/search?q=oracle+acquisitions" rel="nofollow">oracle acquisitions</a>.</p>
<p>BEA Systems, Demantra, Retek, TimesTen, Portal Software, Oblix, LODESTAR Corporation, Stellent, and 360Commerce look right to me. Network computer and Oracle Corporations are false positives&#8211;and Sun Microsystems, which actually isn&#8217;t an Oracle acquisition yet, triggered the latter. So there&#8217;s work to do, for sure. But, as the <a href="http://www.google.com/support/faqs/bin/answer.py?answer=139211" rel="nofollow">FAQ</a> says, &#8220;Google Squared is still in an experimental phase at this point, so you might encounter some hiccups while using it.&#8221;</p>
<p>There are paid services in this domain, e.g., those listed Dow Jones&#8217;s <a href="http://www.taxonomywarehouse.com/" rel="nofollow">Taxonomy Warehouse</a>. But it seems that a number of folks are competing in the world of free. That means a lot of price pressure&#8211;if you&#8217;re going to charge, you really have to knock it out of the park, and to hope that free doesn&#8217;t catch up on quality.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Pankaj Mehra</title>
		<link>http://thenoisychannel.com/2009/10/13/structured-search-is-on-the-table/comment-page-1/#comment-4911</link>
		<dc:creator>Pankaj Mehra</dc:creator>
		<pubDate>Tue, 24 Nov 2009 00:21:04 +0000</pubDate>
		<guid isPermaLink="false">http://thenoisychannel.com/?p=2689#comment-4911</guid>
		<description>The query Oracle acquisitions on Google Squared brings back a whopping two answers! Some things are worth paying for so you can get good qality answers. Room for a half-decent paid service here.</description>
		<content:encoded><![CDATA[<p>The query Oracle acquisitions on Google Squared brings back a whopping two answers! Some things are worth paying for so you can get good qality answers. Room for a half-decent paid service here.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: The latest search and social news for geeks - Ass hats, Crap hats, SEO Sucks and more &#124; Search Engine Journal</title>
		<link>http://thenoisychannel.com/2009/10/13/structured-search-is-on-the-table/comment-page-1/#comment-4708</link>
		<dc:creator>The latest search and social news for geeks - Ass hats, Crap hats, SEO Sucks and more &#124; Search Engine Journal</dc:creator>
		<pubDate>Tue, 20 Oct 2009 14:20:29 +0000</pubDate>
		<guid isPermaLink="false">http://thenoisychannel.com/?p=2689#comment-4708</guid>
		<description>[...] Structured Search Is On The Table – the Noisy Channel [...]</description>
		<content:encoded><![CDATA[<p>[...] Structured Search Is On The Table – the Noisy Channel [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Christopher</title>
		<link>http://thenoisychannel.com/2009/10/13/structured-search-is-on-the-table/comment-page-1/#comment-4665</link>
		<dc:creator>Christopher</dc:creator>
		<pubDate>Thu, 15 Oct 2009 02:05:34 +0000</pubDate>
		<guid isPermaLink="false">http://thenoisychannel.com/?p=2689#comment-4665</guid>
		<description>@Laurent,

This is an example of where using more syntactic/semantic data as context rather than just c0-located n-gams can really improve recall.

I also agree with Daniel that using corpus analysis to improve on document-level named entity detection is a great tool in the arsenal.</description>
		<content:encoded><![CDATA[<p>@Laurent,</p>
<p>This is an example of where using more syntactic/semantic data as context rather than just c0-located n-gams can really improve recall.</p>
<p>I also agree with Daniel that using corpus analysis to improve on document-level named entity detection is a great tool in the arsenal.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Daniel Tunkelang</title>
		<link>http://thenoisychannel.com/2009/10/13/structured-search-is-on-the-table/comment-page-1/#comment-4664</link>
		<dc:creator>Daniel Tunkelang</dc:creator>
		<pubDate>Thu, 15 Oct 2009 00:50:46 +0000</pubDate>
		<guid isPermaLink="false">http://thenoisychannel.com/?p=2689#comment-4664</guid>
		<description>Laurent, my experience is comparable, and others I&#039;ve talked to say similar things. The way I&#039;ve improved on document-level annotation has been by leveraging corpus analysis, e.g., in the &quot;Supporting Exploratory Search for the ACM Digital Library&quot; work presented at &lt;a href=&quot;http://research.microsoft.com/en-us/um/people/ryenw/hcir2008/&quot; rel=&quot;nofollow&quot;&gt;HCIR 2008&lt;/a&gt;. We had similar success applying corpus analysis to improve on document-level named entity detection, which, in my experience, offers a similar challenge of low recall.</description>
		<content:encoded><![CDATA[<p>Laurent, my experience is comparable, and others I&#8217;ve talked to say similar things. The way I&#8217;ve improved on document-level annotation has been by leveraging corpus analysis, e.g., in the &#8220;Supporting Exploratory Search for the ACM Digital Library&#8221; work presented at <a href="http://research.microsoft.com/en-us/um/people/ryenw/hcir2008/" rel="nofollow">HCIR 2008</a>. We had similar success applying corpus analysis to improve on document-level named entity detection, which, in my experience, offers a similar challenge of low recall.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Laurent</title>
		<link>http://thenoisychannel.com/2009/10/13/structured-search-is-on-the-table/comment-page-1/#comment-4662</link>
		<dc:creator>Laurent</dc:creator>
		<pubDate>Thu, 15 Oct 2009 00:05:44 +0000</pubDate>
		<guid isPermaLink="false">http://thenoisychannel.com/?p=2689#comment-4662</guid>
		<description>I&#039;ve tried their online demo at &lt;a href=&quot;http://viewer.opencalais.com/&quot; rel=&quot;nofollow&quot;&gt;http://viewer.opencalais.com/&lt;/a&gt; but it doesn&#039;t pick up the locations (most often just state level, not city) or dates and times. However, it&#039;s good at identifying people and sometimes main topics (e.g. environment).
For &lt;a href=&quot;http://www.twillage.com&quot; rel=&quot;nofollow&quot;&gt;twillage.com&lt;/a&gt;, I&#039;d like to be able to query &quot;conferences in Europe in December&quot; or &quot;happy hours near Palo Alto this friday&quot;.
What about you?</description>
		<content:encoded><![CDATA[<p>I&#8217;ve tried their online demo at <a href="http://viewer.opencalais.com/" rel="nofollow">http://viewer.opencalais.com/</a> but it doesn&#8217;t pick up the locations (most often just state level, not city) or dates and times. However, it&#8217;s good at identifying people and sometimes main topics (e.g. environment).<br />
For <a href="http://www.twillage.com" rel="nofollow">twillage.com</a>, I&#8217;d like to be able to query &#8220;conferences in Europe in December&#8221; or &#8220;happy hours near Palo Alto this friday&#8221;.<br />
What about you?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Daniel Tunkelang</title>
		<link>http://thenoisychannel.com/2009/10/13/structured-search-is-on-the-table/comment-page-1/#comment-4652</link>
		<dc:creator>Daniel Tunkelang</dc:creator>
		<pubDate>Wed, 14 Oct 2009 19:43:39 +0000</pubDate>
		<guid isPermaLink="false">http://thenoisychannel.com/?p=2689#comment-4652</guid>
		<description>Have you tried experimenting with &lt;a href=&quot;http://opencalais.com/&quot; rel=&quot;nofollow&quot;&gt;Open Calais&lt;/a&gt;? It&#039;s not perfect, but it&#039;s free if you don&#039;t bang on it too hard.</description>
		<content:encoded><![CDATA[<p>Have you tried experimenting with <a href="http://opencalais.com/" rel="nofollow">Open Calais</a>? It&#8217;s not perfect, but it&#8217;s free if you don&#8217;t bang on it too hard.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Laurent</title>
		<link>http://thenoisychannel.com/2009/10/13/structured-search-is-on-the-table/comment-page-1/#comment-4651</link>
		<dc:creator>Laurent</dc:creator>
		<pubDate>Wed, 14 Oct 2009 19:29:49 +0000</pubDate>
		<guid isPermaLink="false">http://thenoisychannel.com/?p=2689#comment-4651</guid>
		<description>If the tweet doesn&#039;t contain neither a date, time, or placename, but a link, we could follow it and try to figure out if the landing page describes an event and get the place, date and time from there. Maybe you could help ;)</description>
		<content:encoded><![CDATA[<p>If the tweet doesn&#8217;t contain neither a date, time, or placename, but a link, we could follow it and try to figure out if the landing page describes an event and get the place, date and time from there. Maybe you could help <img src='http://thenoisychannel.com/wordpress/wp-includes/images/smilies/icon_wink.gif' alt=';)' class='wp-smiley' /> </p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Daniel Tunkelang</title>
		<link>http://thenoisychannel.com/2009/10/13/structured-search-is-on-the-table/comment-page-1/#comment-4650</link>
		<dc:creator>Daniel Tunkelang</dc:creator>
		<pubDate>Wed, 14 Oct 2009 19:22:02 +0000</pubDate>
		<guid isPermaLink="false">http://thenoisychannel.com/?p=2689#comment-4650</guid>
		<description>Nicely done. Not clear that you can do much more with 140-character tweets.</description>
		<content:encoded><![CDATA[<p>Nicely done. Not clear that you can do much more with 140-character tweets.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Laurent</title>
		<link>http://thenoisychannel.com/2009/10/13/structured-search-is-on-the-table/comment-page-1/#comment-4649</link>
		<dc:creator>Laurent</dc:creator>
		<pubDate>Wed, 14 Oct 2009 18:12:53 +0000</pubDate>
		<guid isPermaLink="false">http://thenoisychannel.com/?p=2689#comment-4649</guid>
		<description>automatically, but it&#039;s simply keyword-based. We currently monitor about 100 keywords using twitter&#039;s streaming API (track). And use our own regular expressions to detect dates/times and then use another service for placename extraction.</description>
		<content:encoded><![CDATA[<p>automatically, but it&#8217;s simply keyword-based. We currently monitor about 100 keywords using twitter&#8217;s streaming API (track). And use our own regular expressions to detect dates/times and then use another service for placename extraction.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Daniel Tunkelang</title>
		<link>http://thenoisychannel.com/2009/10/13/structured-search-is-on-the-table/comment-page-1/#comment-4648</link>
		<dc:creator>Daniel Tunkelang</dc:creator>
		<pubDate>Wed, 14 Oct 2009 17:58:50 +0000</pubDate>
		<guid isPermaLink="false">http://thenoisychannel.com/?p=2689#comment-4648</guid>
		<description>Interesting--definitely better precision than &lt;a href=&quot;http://search.twitter.com/search?q=&amp;ands=concert&amp;near=San+Francisco&amp;within=15&amp;units=mi&quot; rel=&quot;nofollow&quot;&gt;http://search.twitter.com/search?q=&amp;ands=concert&amp;near=San+Francisco&amp;within=15&amp;units=mi&lt;/a&gt;.  Do you implement event detection through &lt;a href=&quot;http://www.twillage.com/post&quot; rel=&quot;nofollow&quot;&gt;humans&lt;/a&gt; or analyzing the text of tweets?</description>
		<content:encoded><![CDATA[<p>Interesting&#8211;definitely better precision than <a href="http://search.twitter.com/search?q=&#038;ands=concert&#038;near=San+Francisco&#038;within=15&#038;units=mi" rel="nofollow">http://search.twitter.com/search?q=&#038;ands=concert&#038;near=San+Francisco&#038;within=15&#038;units=mi</a>.  Do you implement event detection through <a href="http://www.twillage.com/post" rel="nofollow">humans</a> or analyzing the text of tweets?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Laurent</title>
		<link>http://thenoisychannel.com/2009/10/13/structured-search-is-on-the-table/comment-page-1/#comment-4647</link>
		<dc:creator>Laurent</dc:creator>
		<pubDate>Wed, 14 Oct 2009 17:40:07 +0000</pubDate>
		<guid isPermaLink="false">http://thenoisychannel.com/?p=2689#comment-4647</guid>
		<description>At &lt;a href=&quot;http://www.twillage.com&quot; rel=&quot;nofollow&quot;&gt;http://www.twillage.com&lt;/a&gt; we also do structured search. We try to detect event-related tweets from twitter, extract placenames, dates and then let people search for what&#039;s happening in a specific town.
e.g. &lt;a href=&quot;http://www.twillage.com/search?q=concert+san+francisco&quot; rel=&quot;nofollow&quot;&gt;http://www.twillage.com/search?q=concert+san+francisco&lt;/a&gt;</description>
		<content:encoded><![CDATA[<p>At <a href="http://www.twillage.com" rel="nofollow">http://www.twillage.com</a> we also do structured search. We try to detect event-related tweets from twitter, extract placenames, dates and then let people search for what&#8217;s happening in a specific town.<br />
e.g. <a href="http://www.twillage.com/search?q=concert+san+francisco" rel="nofollow">http://www.twillage.com/search?q=concert+san+francisco</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Christopher</title>
		<link>http://thenoisychannel.com/2009/10/13/structured-search-is-on-the-table/comment-page-1/#comment-4643</link>
		<dc:creator>Christopher</dc:creator>
		<pubDate>Wed, 14 Oct 2009 02:41:45 +0000</pubDate>
		<guid isPermaLink="false">http://thenoisychannel.com/?p=2689#comment-4643</guid>
		<description>LOL,

Yes, many ways to say the same thing. I&#039;m following the InfoChimps closely, they and Freebase seem to be a significant step ahead of the others.</description>
		<content:encoded><![CDATA[<p>LOL,</p>
<p>Yes, many ways to say the same thing. I&#8217;m following the InfoChimps closely, they and Freebase seem to be a significant step ahead of the others.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Daniel Tunkelang</title>
		<link>http://thenoisychannel.com/2009/10/13/structured-search-is-on-the-table/comment-page-1/#comment-4642</link>
		<dc:creator>Daniel Tunkelang</dc:creator>
		<pubDate>Wed, 14 Oct 2009 02:35:18 +0000</pubDate>
		<guid isPermaLink="false">http://thenoisychannel.com/?p=2689#comment-4642</guid>
		<description>Infochimps is &quot;the world&#039;s largest open platform for data&quot;. Freebase is &quot;an open database of the world’s information&quot;. Factual is &quot;a platform where anyone can share and mash open data on any subject&quot;. Will the real Slim Shady please stand up? :-)

&lt;object width=&quot;425&quot; height=&quot;344&quot;&gt;&lt;param name=&quot;movie&quot; value=&quot;http://www.youtube.com/v/1QRKxDuZPJs&amp;hl=en&amp;fs=1&amp;rel=0&quot;&gt;&lt;/param&gt;&lt;param name=&quot;allowFullScreen&quot; value=&quot;true&quot;&gt;&lt;/param&gt;&lt;param name=&quot;allowscriptaccess&quot; value=&quot;always&quot;&gt;&lt;/param&gt;&lt;embed src=&quot;http://www.youtube.com/v/1QRKxDuZPJs&amp;hl=en&amp;fs=1&amp;rel=0&quot; type=&quot;application/x-shockwave-flash&quot; allowscriptaccess=&quot;always&quot; allowfullscreen=&quot;true&quot; width=&quot;425&quot; height=&quot;344&quot;&gt;&lt;/embed&gt;&lt;/object&gt;

But point taken--I&#039;ll check it out.</description>
		<content:encoded><![CDATA[<p>Infochimps is &#8220;the world&#8217;s largest open platform for data&#8221;. Freebase is &#8220;an open database of the world’s information&#8221;. Factual is &#8220;a platform where anyone can share and mash open data on any subject&#8221;. Will the real Slim Shady please stand up? <img src='http://thenoisychannel.com/wordpress/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> </p>
<p><object width="425" height="344"><param name="movie" value="http://www.youtube.com/v/1QRKxDuZPJs&#038;hl=en&#038;fs=1&#038;rel=0"></param><param name="allowFullScreen" value="true"></param><param name="allowscriptaccess" value="always"></param><embed src="http://www.youtube.com/v/1QRKxDuZPJs&#038;hl=en&#038;fs=1&#038;rel=0" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="344"></embed></object></p>
<p>But point taken&#8211;I&#8217;ll check it out.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: christopher</title>
		<link>http://thenoisychannel.com/2009/10/13/structured-search-is-on-the-table/comment-page-1/#comment-4641</link>
		<dc:creator>christopher</dc:creator>
		<pubDate>Wed, 14 Oct 2009 02:23:01 +0000</pubDate>
		<guid isPermaLink="false">http://thenoisychannel.com/?p=2689#comment-4641</guid>
		<description>Hi Daniel,

Take a look at what the @infochimps are doing. They have a super open data repository and a monetization strategy.</description>
		<content:encoded><![CDATA[<p>Hi Daniel,</p>
<p>Take a look at what the @infochimps are doing. They have a super open data repository and a monetization strategy.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
