<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: ChemRefer And Robots.txt</title>
	<atom:link href="http://www.chemspider.com/open-chemistry-web/chemrefer-and-robotstxt.html/feed" rel="self" type="application/rss+xml" />
	<link>http://www.chemspider.com/open-chemistry-web/chemrefer-and-robotstxt.html</link>
	<description>The Project: Open and Integrated Structure and Text Search</description>
	<lastBuildDate>Sun, 19 Apr 2009 16:13:14 +0100</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: ChemSpider Blog &#187; Blog Archive &#187; Support from the Publishing Community for Text Indexing Scientific Articles</title>
		<link>http://www.chemspider.com/open-chemistry-web/chemrefer-and-robotstxt.html/comment-page-1#comment-129</link>
		<dc:creator>ChemSpider Blog &#187; Blog Archive &#187; Support from the Publishing Community for Text Indexing Scientific Articles</dc:creator>
		<pubDate>Thu, 06 Dec 2007 02:05:31 +0000</pubDate>
		<guid isPermaLink="false">http://www.chemspider.com/open-chemistry-web/?p=3#comment-129</guid>
		<description>[...] few weeks ago Will Griffiths blogged about robots.txt files for informing on indexing policies for a website and then later on our discussions with RSC. As a result of our experiences in the [...]</description>
		<content:encoded><![CDATA[<p>[...] few weeks ago Will Griffiths blogged about robots.txt files for informing on indexing policies for a website and then later on our discussions with RSC. As a result of our experiences in the [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: David Bradley</title>
		<link>http://www.chemspider.com/open-chemistry-web/chemrefer-and-robotstxt.html/comment-page-1#comment-8</link>
		<dc:creator>David Bradley</dc:creator>
		<pubDate>Tue, 16 Oct 2007 11:22:54 +0000</pubDate>
		<guid isPermaLink="false">http://www.chemspider.com/open-chemistry-web/?p=3#comment-8</guid>
		<description>As I understand it, robots.txt is taken by search engines to be nothing more than a set of suggestions. Several people in the SEO community report that SEs sometimes ignore robots.txt altogether and some spiders don&#039;t even have the ability to read it in the first place let alone respond to it. robots.txt is certainly not a legal document. If you post information on the web you must accept that it will be spidered. The only full-on way to stop a spider in its tracks is to ban its IP address from your server. But, that opens a minefield as some spiders have multiple IPs and some are dynamic, so what could be the IP address of a spider you wish to ban one day could be a potential visitor the next.</description>
		<content:encoded><![CDATA[<p>As I understand it, robots.txt is taken by search engines to be nothing more than a set of suggestions. Several people in the SEO community report that SEs sometimes ignore robots.txt altogether and some spiders don&#8217;t even have the ability to read it in the first place let alone respond to it. robots.txt is certainly not a legal document. If you post information on the web you must accept that it will be spidered. The only full-on way to stop a spider in its tracks is to ban its IP address from your server. But, that opens a minefield as some spiders have multiple IPs and some are dynamic, so what could be the IP address of a spider you wish to ban one day could be a potential visitor the next.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
