<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: The Entire ChemSpider Database is On Its Way to PubChem!</title>
	<atom:link href="http://www.chemspider.com/blog/the-entire-chemspider-database-is-on-its-way-to-pubchem.html/feed" rel="self" type="application/rss+xml" />
	<link>http://www.chemspider.com/blog/the-entire-chemspider-database-is-on-its-way-to-pubchem.html</link>
	<description>Building Community for Chemists</description>
	<lastBuildDate>Tue, 21 May 2013 16:16:29 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.5</generator>
	<item>
		<title>By: M Karthikeyan</title>
		<link>http://www.chemspider.com/blog/the-entire-chemspider-database-is-on-its-way-to-pubchem.html/comment-page-1#comment-8742</link>
		<dc:creator>M Karthikeyan</dc:creator>
		<pubDate>Tue, 11 Dec 2007 01:43:07 +0000</pubDate>
		<guid isPermaLink="false">http://www.chemspider.com/blog/?p=267#comment-8742</guid>
		<description><![CDATA[Last friday I did update the Pubchem data (FTP download of compounds) locally in a oracle database server using Jchem (chemaxon tools). my statistics (excluding the error structures deducted by Jchem) is reproduced below (total: 11,623,278) . Each row represent one million counts. Please note that 13-16 has low count of actual entries for every one million molecules expected. To my surprise today the total list of molecules available for download at Pubchem (compounds) increased to 23 millions. I am in the process of updating the local database for actual counts.

x10+6   Entries
1	863132
2	653990
3	755562
4	624029
5	961141
6	609013
7	601475
8	937742
9	985608
10	882349
11	998475
12	996339
13	11865
14	3499
15	2582
16	59521
17	920222
18	756734


any update on pubchem is welcome.
M.Karthikeyan]]></description>
		<content:encoded><![CDATA[<p>Last friday I did update the Pubchem data (FTP download of compounds) locally in a oracle database server using Jchem (chemaxon tools). my statistics (excluding the error structures deducted by Jchem) is reproduced below (total: 11,623,278) . Each row represent one million counts. Please note that 13-16 has low count of actual entries for every one million molecules expected. To my surprise today the total list of molecules available for download at Pubchem (compounds) increased to 23 millions. I am in the process of updating the local database for actual counts.</p>
<p>x10+6   Entries<br />
1	863132<br />
2	653990<br />
3	755562<br />
4	624029<br />
5	961141<br />
6	609013<br />
7	601475<br />
8	937742<br />
9	985608<br />
10	882349<br />
11	998475<br />
12	996339<br />
13	11865<br />
14	3499<br />
15	2582<br />
16	59521<br />
17	920222<br />
18	756734</p>
<p>any update on pubchem is welcome.<br />
M.Karthikeyan</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Unilever Centre for Molecular Informatics, Cambridge - petermr&#8217;s blog &#187; Blog Archive &#187; Chemspider and Pubchem - open data</title>
		<link>http://www.chemspider.com/blog/the-entire-chemspider-database-is-on-its-way-to-pubchem.html/comment-page-1#comment-7673</link>
		<dc:creator>Unilever Centre for Molecular Informatics, Cambridge - petermr&#8217;s blog &#187; Blog Archive &#187; Chemspider and Pubchem - open data</dc:creator>
		<pubDate>Fri, 30 Nov 2007 09:30:45 +0000</pubDate>
		<guid isPermaLink="false">http://www.chemspider.com/blog/?p=267#comment-7673</guid>
		<description><![CDATA[[...] ChemSpider Blog » Blog Archive » The Entire ChemSpider Database is On Its Way to PubChem! [...]]]></description>
		<content:encoded><![CDATA[<p>[...] ChemSpider Blog » Blog Archive » The Entire ChemSpider Database is On Its Way to PubChem! [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Antony Williams</title>
		<link>http://www.chemspider.com/blog/the-entire-chemspider-database-is-on-its-way-to-pubchem.html/comment-page-1#comment-7659</link>
		<dc:creator>Antony Williams</dc:creator>
		<pubDate>Fri, 30 Nov 2007 03:08:19 +0000</pubDate>
		<guid isPermaLink="false">http://www.chemspider.com/blog/?p=267#comment-7659</guid>
		<description><![CDATA[Rich,

I believe PubChem update on a fairly regular basis but they&#039;ve never dealt with over 17.5 M structures so I don&#039;t know how ling it will take them.

Regarding the linkage to patents - PubChem will link back to ChemSpider via the ChemSpider ID and Data Source connection. Since the SureChem patent structures are on ChemSpider users will land from PubChem onto ChemSpider and only determine the link to patents at that point. The link from PubChem to SureChem patents directly would come if SureChem chose to deposit their own structure collection directly.

Regarding &quot;This sets the bar very high for Open data in chemistry. I’m not sure what to call it, but it’s a game-changer.&quot; You&#039;ve likely seen over the past few months we&#039;ve been challenged on Open Access and Open Data. We&#039;re declaring neither. We&#039;re declaring Free Access. Our Web APIs have opened up access to certain data only. If people choose to scrape other data such as predicted data they are violating rights of other groups who have allowed us to use their algorithms. While SureChem remains free access then mashups of some form might be possible but would likely take a lot of work. 

Thanks for the compliments Rich..they are always welcome.]]></description>
		<content:encoded><![CDATA[<p>Rich,</p>
<p>I believe PubChem update on a fairly regular basis but they&#8217;ve never dealt with over 17.5 M structures so I don&#8217;t know how ling it will take them.</p>
<p>Regarding the linkage to patents &#8211; PubChem will link back to ChemSpider via the ChemSpider ID and Data Source connection. Since the SureChem patent structures are on ChemSpider users will land from PubChem onto ChemSpider and only determine the link to patents at that point. The link from PubChem to SureChem patents directly would come if SureChem chose to deposit their own structure collection directly.</p>
<p>Regarding &#8220;This sets the bar very high for Open data in chemistry. I’m not sure what to call it, but it’s a game-changer.&#8221; You&#8217;ve likely seen over the past few months we&#8217;ve been challenged on Open Access and Open Data. We&#8217;re declaring neither. We&#8217;re declaring Free Access. Our Web APIs have opened up access to certain data only. If people choose to scrape other data such as predicted data they are violating rights of other groups who have allowed us to use their algorithms. While SureChem remains free access then mashups of some form might be possible but would likely take a lot of work. </p>
<p>Thanks for the compliments Rich..they are always welcome.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Rich Apodaca</title>
		<link>http://www.chemspider.com/blog/the-entire-chemspider-database-is-on-its-way-to-pubchem.html/comment-page-1#comment-7612</link>
		<dc:creator>Rich Apodaca</dc:creator>
		<pubDate>Thu, 29 Nov 2007 16:10:54 +0000</pubDate>
		<guid isPermaLink="false">http://www.chemspider.com/blog/?p=267#comment-7612</guid>
		<description><![CDATA[Antony,

This is great news. Any word on when the first structures will be appearing? How frequently will PubChem be updated with new ChemSpider content?

Also, through what mechanism will individual PubChem compounds be linked into patents and the primary literature?

For example, I&#039;m familiar with PubChem Substance records. Two fields that allow URLS are:

PUBCHEM_EXT_DATASOURCE_URL
PUBCHEM_EXT_SUBSTANCE_URL

So one way would be for Substance records to link back to the appropriate ChemSpider compound summary page. Another would be for ChemSpider compound IDs to be added to the PUBCHEM_EXT_DATASOURCE_REGID field. Or will another mechanism be used?

Regardless of how exactly linkage occurs, the end result would be that any third party could, independently of ChemSpider, reconstruct the entire ChemSpider compound database. By using the ChemSpider Web APIs, they could develop a parallel service that re-processes the ChemSpider analytical data and patent/primary literature data, possibly mashing up the data from other sources as well.

This sets the bar very high for Open data in chemistry. I&#039;m not sure what to call it, but it&#039;s a game-changer.

Great work!]]></description>
		<content:encoded><![CDATA[<p>Antony,</p>
<p>This is great news. Any word on when the first structures will be appearing? How frequently will PubChem be updated with new ChemSpider content?</p>
<p>Also, through what mechanism will individual PubChem compounds be linked into patents and the primary literature?</p>
<p>For example, I&#8217;m familiar with PubChem Substance records. Two fields that allow URLS are:</p>
<p>PUBCHEM_EXT_DATASOURCE_URL<br />
PUBCHEM_EXT_SUBSTANCE_URL</p>
<p>So one way would be for Substance records to link back to the appropriate ChemSpider compound summary page. Another would be for ChemSpider compound IDs to be added to the PUBCHEM_EXT_DATASOURCE_REGID field. Or will another mechanism be used?</p>
<p>Regardless of how exactly linkage occurs, the end result would be that any third party could, independently of ChemSpider, reconstruct the entire ChemSpider compound database. By using the ChemSpider Web APIs, they could develop a parallel service that re-processes the ChemSpider analytical data and patent/primary literature data, possibly mashing up the data from other sources as well.</p>
<p>This sets the bar very high for Open data in chemistry. I&#8217;m not sure what to call it, but it&#8217;s a game-changer.</p>
<p>Great work!</p>
]]></content:encoded>
	</item>
</channel>
</rss>
