<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	
	xmlns:georss="http://www.georss.org/georss"
	xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#"
	
	>
<channel>
	<title>
	Comments on: Software Needed: Enhanced web search	</title>
	<atom:link href="https://gregladen.com/blog/2009/11/02/software-needed-enhanced-web-s/feed/" rel="self" type="application/rss+xml" />
	<link>https://gregladen.com/blog/2009/11/02/software-needed-enhanced-web-s/</link>
	<description></description>
	<lastBuildDate>Tue, 03 Nov 2009 07:22:34 +0000</lastBuildDate>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	<generator>https://wordpress.org/?v=6.4.8</generator>
	<item>
		<title>
		By: DrMcCoy		</title>
		<link>https://gregladen.com/blog/2009/11/02/software-needed-enhanced-web-s/#comment-550018</link>

		<dc:creator><![CDATA[DrMcCoy]]></dc:creator>
		<pubDate>Tue, 03 Nov 2009 07:22:34 +0000</pubDate>
		<guid isPermaLink="false">http://scienceblogs.com/gregladen/2009/11/02/software-needed-enhanced-web-s/#comment-550018</guid>

					<description><![CDATA[Hmm, or alternatively, I bet a Greasemonkey script doing that could be hacked together...]]></description>
			<content:encoded><![CDATA[<p>Hmm, or alternatively, I bet a Greasemonkey script doing that could be hacked together&#8230;</p>
]]></content:encoded>
		
			</item>
		<item>
		<title>
		By: Googleverse		</title>
		<link>https://gregladen.com/blog/2009/11/02/software-needed-enhanced-web-s/#comment-550017</link>

		<dc:creator><![CDATA[Googleverse]]></dc:creator>
		<pubDate>Tue, 03 Nov 2009 07:16:16 +0000</pubDate>
		<guid isPermaLink="false">http://scienceblogs.com/gregladen/2009/11/02/software-needed-enhanced-web-s/#comment-550017</guid>

					<description><![CDATA[Interesting Idea.]]></description>
			<content:encoded><![CDATA[<p>Interesting Idea.</p>
]]></content:encoded>
		
			</item>
		<item>
		<title>
		By: Gray Gaffer		</title>
		<link>https://gregladen.com/blog/2009/11/02/software-needed-enhanced-web-s/#comment-550016</link>

		<dc:creator><![CDATA[Gray Gaffer]]></dc:creator>
		<pubDate>Mon, 02 Nov 2009 22:12:42 +0000</pubDate>
		<guid isPermaLink="false">http://scienceblogs.com/gregladen/2009/11/02/software-needed-enhanced-web-s/#comment-550016</guid>

					<description><![CDATA[Google API has the ability to return XML formatting for the results, but their terms mean that you have to use their API, get a special API key, and have the web site using it be publicly accessible without restrictions. Without their library modules and valid key all you get back is encrypted binary.

So back to page scraping.]]></description>
			<content:encoded><![CDATA[<p>Google API has the ability to return XML formatting for the results, but their terms mean that you have to use their API, get a special API key, and have the web site using it be publicly accessible without restrictions. Without their library modules and valid key all you get back is encrypted binary.</p>
<p>So back to page scraping.</p>
]]></content:encoded>
		
			</item>
		<item>
		<title>
		By: Dan J		</title>
		<link>https://gregladen.com/blog/2009/11/02/software-needed-enhanced-web-s/#comment-550015</link>

		<dc:creator><![CDATA[Dan J]]></dc:creator>
		<pubDate>Mon, 02 Nov 2009 17:40:57 +0000</pubDate>
		<guid isPermaLink="false">http://scienceblogs.com/gregladen/2009/11/02/software-needed-enhanced-web-s/#comment-550015</guid>

					<description><![CDATA[I&#039;ve done some Perl code that performs web page text analysis for SEO, and some of that seems connected to some of what you&#039;re interested in doing. Stripping out all but the actual &lt;em&gt;content&lt;/em&gt; of the pages is essential.

I can see how &lt;code&gt;googlereg&lt;/code&gt; would be helpful for some genealogical searching that I do once in a while.]]></description>
			<content:encoded><![CDATA[<p>I&#8217;ve done some Perl code that performs web page text analysis for SEO, and some of that seems connected to some of what you&#8217;re interested in doing. Stripping out all but the actual <em>content</em> of the pages is essential.</p>
<p>I can see how <code>googlereg</code> would be helpful for some genealogical searching that I do once in a while.</p>
]]></content:encoded>
		
			</item>
		<item>
		<title>
		By: Greg Laden		</title>
		<link>https://gregladen.com/blog/2009/11/02/software-needed-enhanced-web-s/#comment-550014</link>

		<dc:creator><![CDATA[Greg Laden]]></dc:creator>
		<pubDate>Mon, 02 Nov 2009 14:51:47 +0000</pubDate>
		<guid isPermaLink="false">http://scienceblogs.com/gregladen/2009/11/02/software-needed-enhanced-web-s/#comment-550014</guid>

					<description><![CDATA[The pipes look interesting . I&#039;ve seen that before but forgot about them.

As far as dealing with HTML, that&#039;s fairly easy with the proper text based web readers and sed, but there should be something in the google api that will work.

The problem with the google api might be that they change it now and then.  ]]></description>
			<content:encoded><![CDATA[<p>The pipes look interesting . I&#8217;ve seen that before but forgot about them.</p>
<p>As far as dealing with HTML, that&#8217;s fairly easy with the proper text based web readers and sed, but there should be something in the google api that will work.</p>
<p>The problem with the google api might be that they change it now and then.  </p>
]]></content:encoded>
		
			</item>
		<item>
		<title>
		By: Gray Gaffer		</title>
		<link>https://gregladen.com/blog/2009/11/02/software-needed-enhanced-web-s/#comment-550013</link>

		<dc:creator><![CDATA[Gray Gaffer]]></dc:creator>
		<pubDate>Mon, 02 Nov 2009 14:39:46 +0000</pubDate>
		<guid isPermaLink="false">http://scienceblogs.com/gregladen/2009/11/02/software-needed-enhanced-web-s/#comment-550013</guid>

					<description><![CDATA[Yes, I know it is not a command line tool. That would take some more research, like - are Google search results available in pure XML feed formats instead of wrapped in human-only visual HTML crippled syntax? If that is true, then there are CPAN XML modules that can be used along with the regex post-results filter. But if your desired end result is a web page then Yahoo Pipes may do the trick for you.]]></description>
			<content:encoded><![CDATA[<p>Yes, I know it is not a command line tool. That would take some more research, like &#8211; are Google search results available in pure XML feed formats instead of wrapped in human-only visual HTML crippled syntax? If that is true, then there are CPAN XML modules that can be used along with the regex post-results filter. But if your desired end result is a web page then Yahoo Pipes may do the trick for you.</p>
]]></content:encoded>
		
			</item>
		<item>
		<title>
		By: Gray Gaffer		</title>
		<link>https://gregladen.com/blog/2009/11/02/software-needed-enhanced-web-s/#comment-550012</link>

		<dc:creator><![CDATA[Gray Gaffer]]></dc:creator>
		<pubDate>Mon, 02 Nov 2009 14:36:19 +0000</pubDate>
		<guid isPermaLink="false">http://scienceblogs.com/gregladen/2009/11/02/software-needed-enhanced-web-s/#comment-550012</guid>

					<description><![CDATA[Interesting. But not a Perl one-liner because the results are not quite simple enough.

HOWEVER

I found this interesting tool you might want to check out:

Yahoo Pipes

http://pipes.yahoo.com/pipes/

includes amongst many other results filters a regex tool. With examples filtering Google results.]]></description>
			<content:encoded><![CDATA[<p>Interesting. But not a Perl one-liner because the results are not quite simple enough.</p>
<p>HOWEVER</p>
<p>I found this interesting tool you might want to check out:</p>
<p>Yahoo Pipes</p>
<p><a href="http://pipes.yahoo.com/pipes/" rel="nofollow ugc">http://pipes.yahoo.com/pipes/</a></p>
<p>includes amongst many other results filters a regex tool. With examples filtering Google results.</p>
]]></content:encoded>
		
			</item>
	</channel>
</rss>
