<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>ChemSpider Blog &#187; Quality and Content</title>
	<atom:link href="http://www.chemspider.com/blog/category/improving-the-quality-and-content-on-chemspider/feed" rel="self" type="application/rss+xml" />
	<link>http://www.chemspider.com/blog</link>
	<description>Building Community for Chemists</description>
	<lastBuildDate>Thu, 18 Mar 2010 11:08:41 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Integrated RSC Publishing New ChemSpider Functionality at ACS Spring 2010 Part 5</title>
		<link>http://www.chemspider.com/blog/integrated-rsc-publishing-new-chemspider-functionality-at-acs-spring-2010-part-5.html</link>
		<comments>http://www.chemspider.com/blog/integrated-rsc-publishing-new-chemspider-functionality-at-acs-spring-2010-part-5.html#comments</comments>
		<pubDate>Wed, 17 Mar 2010 03:10:30 +0000</pubDate>
		<dc:creator>Antony Williams</dc:creator>
				<category><![CDATA[How ChemSpider Runs]]></category>
		<category><![CDATA[Quality and Content]]></category>
		<category><![CDATA[RSC Publishing]]></category>
		<category><![CDATA[ChemSpider]]></category>
		<category><![CDATA[Google Books]]></category>
		<category><![CDATA[Google Scholar]]></category>
		<category><![CDATA[Microsoft Academic Search]]></category>

		<guid isPermaLink="false">http://www.chemspider.com/blog/?p=1665</guid>
		<description><![CDATA[The functionality discussed below will be released at the ACS   Spring Meeting during the week of March 21st 2010
Following on from the last post regarding integrating to RSC Databases via the RSC Publishing Beta web services layer this post expands on the nature of the integration that we have been able to introduce. [...]]]></description>
			<content:encoded><![CDATA[<p><strong>The functionality discussed below will be released at the ACS   Spring Meeting during the week of March 21st 2010</strong></p>
<p>Following on from the <a href="http://www.chemspider.com/blog/integrated-rsc-databases-new-chemspider-functionality-at-acs-spring-2010-part-4.html">last post regarding integrating to RSC Databases</a> via the RSC Publishing Beta web services layer this post expands on the nature of the integration that we have been able to introduce. The RSC publishing beta gives us access to over 500,000 journal articles, book chapters and database records through  one simple search interface. Using a similar approach to that outlined for the RSC database searches, that of using validated synonyms as the basis of the search for chemicals, we are able to search across the entire ePlatform of articles and retrieve hits as shown below. The hits are under the RSC journals tab.</p>
<p>Since the RSC publishing platform segregates the journals from the books the same search will return results from RSC books also. Our tests show that this is incredibly fast and highly accurate. This is our first venture into tapping into the chemical compounds sitting inside the RSC archive. More work is coming&#8230;</p>
<p>If you look at the tabs below you will also see that we have integrated to Google Books, Google Scholar and the Microsoft Academic Search. We are truly integrating to available internet resources to bring together the benefits of all of the primary search engines available.</p>
<p><img class="aligncenter size-full wp-image-1667" title="eplatform" src="http://www.chemspider.com/blog/wp-content/uploads/2010/03/eplatform.png" alt="eplatform" width="644" height="445" /></p>
<script type="text/javascript">
  addthis_url    = 'http%3A%2F%2Fwww.chemspider.com%2Fblog%2Fintegrated-rsc-publishing-new-chemspider-functionality-at-acs-spring-2010-part-5.html';
  addthis_title  = 'Integrated+RSC+Publishing+New+ChemSpider+Functionality+at+ACS+Spring+2010+Part+5';
  addthis_pub    = '';
</script><script type="text/javascript" src="http://s7.addthis.com/js/addthis_widget.php?v=12" ></script>
<p class="buymebeer"><form action="https://www.paypal.com/cgi-bin/webscr" target="paypal" method="post"><input type="hidden" name="cmd" value="_xclick" /><input type="hidden" name="business" value="tony27587@gmail.com" /><input type="hidden" name="return" value="Thanks...I'll use the caffeine to keep me awake and keep blogging..." /><input type="hidden" name="item_name" value="Buy me a Coffee for Integrated RSC Publishing New ChemSpider Functionality at ACS Spring 2010 Part 5 " /><input type="hidden" name="currency_code" value="USD" /><input type="hidden" name="amount" value="" /><input type="image" src="http://www.chemspider.com/blog/wp-content/plugins/buy-me-beer/icon_cafe.gif" align="left" alt="Buy me a Coffee" title="Buy me a Coffee" hspace="3" /></form><a href="https://www.paypal.com/cgi-bin/webscr?cmd=_xclick&amp;business=tony27587@gmail.com&amp;currency_code=USD&amp;amount=&amp;return=Thanks...I'll use the caffeine to keep me awake and keep blogging...&amp;item_name=Buy+me+a+Coffee+for+Integrated+RSC+Publishing+New+ChemSpider+Functionality+at+ACS+Spring+2010+Part+5+" target="paypal">Buy me a Coffee</a></p>]]></content:encoded>
			<wfw:commentRss>http://www.chemspider.com/blog/integrated-rsc-publishing-new-chemspider-functionality-at-acs-spring-2010-part-5.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Integrated RSC Databases New ChemSpider Functionality at ACS Spring 2010 Part 4</title>
		<link>http://www.chemspider.com/blog/integrated-rsc-databases-new-chemspider-functionality-at-acs-spring-2010-part-4.html</link>
		<comments>http://www.chemspider.com/blog/integrated-rsc-databases-new-chemspider-functionality-at-acs-spring-2010-part-4.html#comments</comments>
		<pubDate>Tue, 16 Mar 2010 19:07:45 +0000</pubDate>
		<dc:creator>Antony Williams</dc:creator>
				<category><![CDATA[ChemSpider Syntheses]]></category>
		<category><![CDATA[ChemSpider SyntheticPages]]></category>
		<category><![CDATA[How ChemSpider Runs]]></category>
		<category><![CDATA[Quality and Content]]></category>
		<category><![CDATA[RSC Publishing]]></category>
		<category><![CDATA[ChemSpider]]></category>
		<category><![CDATA[Lab Hazards Bulletin]]></category>
		<category><![CDATA[Mass Spectrometry Bulletin]]></category>
		<category><![CDATA[Methods in Organic Synthesis]]></category>
		<category><![CDATA[Natural Products Update]]></category>
		<category><![CDATA[Royal Society of Chemistry]]></category>
		<category><![CDATA[RSC]]></category>
		<category><![CDATA[RSC databases]]></category>
		<category><![CDATA[RSC Publishing Beta]]></category>

		<guid isPermaLink="false">http://www.chemspider.com/blog/?p=1660</guid>
		<description><![CDATA[The functionality discussed below will be released at the ACS  Spring Meeting during the week of March 21st 2010
The Royal Society of Chemistry has a whole series of databases. None of them have been structure searchable&#8230;until now. As with our PubMed integration and our Google Patents integration rolling out shortly, just because a database [...]]]></description>
			<content:encoded><![CDATA[<p><strong>The functionality discussed below will be released at the ACS  Spring Meeting during the week of March 21st 2010</strong></p>
<p>The Royal Society of Chemistry has a whole series of databases. None of them have been structure searchable&#8230;until now. As with our <a href="http://www.chemspider.com/blog/new-functionality-in-the-world-of-chemspider-part-2-improved-pubmed-integration.html">PubMed integration</a> and our <a href="http://www.chemspider.com/blog/google-patents-new-chemspider-functionality-at-acs-spring-2010-part-3.html">Google Patents integration</a> rolling out shortly, just because a database hasn&#8217;t had the chemical structures extracted and indexed doesn&#8217;t mean that those resources cannot be made &#8220;structure searchable&#8221;. It&#8217;s not a subtle distinction however, as discussed in the <a href="http://www.chemspider.com/blog/google-patents-new-chemspider-functionality-at-acs-spring-2010-part-3.html">Google Patents blog post</a>. These types of integrations depend on the correct association between chemical names and structures, access to an API allowing facile and flexible searching and, something that is purely serendipitous in nature, the<strong> absence</strong> of overlaps between chemical names and common language.</p>
<p>We have used the recently announced <a href="http://pubs.rsc.org/blog/prospect/post/2010/03/09/RSC-Publishing-beta-platform-is-now-live.aspx">RSC Publishing beta platform</a> and the API made available to us to enable the searching. As my colleague Graham McCann announced recently &#8220;(the) platform gives  access to over 500,000 journal articles, book chapters and database  records through one simple search interface. The new platform delivers  faster browsing, intelligent searching and more intuitive navigation and  is open for beta testing now.&#8221;</p>
<p>Our approach has been to search the title and the abstract for each of the databases for all of the validated identifiers. It works. It is FAST and it provides &#8220;structure-related&#8221; access to all six RSC databases. An example screen shot is below where a search on chlorobenzene retrieves data on each of the following databases: <a href="http://www.rsc.org/Publishing/CurrentAwareness/msb/index.asp">Mass Spectrometry Bulletin</a>, <a href="http://www.rsc.org/publishing/currentawareness/lhb/index.asp">Laboratory Hazards Bulletin</a>, <a href="http://www.rsc.org/publishing/currentawareness/mos/index.asp">Methods in Organic Synthesis</a>, <a href="http://www.rsc.org/publishing/currentawareness/CCR/">Catalysts and Catalysed Reactions</a>, <a href="http://www.rsc.org/publishing/currentawareness/npu/">Natural Product Updates</a> and <a href="http://www.rsc.org/publishing/currentawareness/aa/index.asp">Analytical Abstracts</a>. The screen shot below shows the analytical abstracts linked by the term chlorobenzene in the title or abstract itself. 284 hits..in a fraction of a second. The abstract is linked out to the original article via DOI, where possible.</p>
<p><img class="aligncenter size-full wp-image-1662" title="databases" src="http://www.chemspider.com/blog/wp-content/uploads/2010/03/databases.png" alt="databases" width="680" height="334" /></p>
<p>My personal favorites in the set of databases are the <a href="http://www.rsc.org/publishing/currentawareness/npu/">Natural Product Updates</a> (NPU) and the <a href="http://www.rsc.org/publishing/currentawareness/mos/index.asp">Methods in Organic Synthesis</a> (MOS) databases. The NPU database contains tens of thousands of natural product chemical structures, together with chemical names, references and some physical properties. Rich resources for ChemSpider. MOS includes includes reaction schemes, title and bibliographic details. Rich resources to connect to ChemSpider SyntheticPages in the future.</p>
<p>We have only just started to tap into the riches contained within the RSC archive. It&#8217;s like stumbling across a roomful of rubies to pick up diamonds. There is content all around us waiting for us to connect. We will connect this up to ChemSpider and make it available. Access to the databases will be shown at the ACS Meeting in San Francisco.</p>
<script type="text/javascript">
  addthis_url    = 'http%3A%2F%2Fwww.chemspider.com%2Fblog%2Fintegrated-rsc-databases-new-chemspider-functionality-at-acs-spring-2010-part-4.html';
  addthis_title  = 'Integrated+RSC+Databases+New+ChemSpider+Functionality+at+ACS+Spring+2010+Part+4';
  addthis_pub    = '';
</script><script type="text/javascript" src="http://s7.addthis.com/js/addthis_widget.php?v=12" ></script>
<p class="buymebeer"><form action="https://www.paypal.com/cgi-bin/webscr" target="paypal" method="post"><input type="hidden" name="cmd" value="_xclick" /><input type="hidden" name="business" value="tony27587@gmail.com" /><input type="hidden" name="return" value="Thanks...I'll use the caffeine to keep me awake and keep blogging..." /><input type="hidden" name="item_name" value="Buy me a Coffee for Integrated RSC Databases New ChemSpider Functionality at ACS Spring 2010 Part 4 " /><input type="hidden" name="currency_code" value="USD" /><input type="hidden" name="amount" value="" /><input type="image" src="http://www.chemspider.com/blog/wp-content/plugins/buy-me-beer/icon_cafe.gif" align="left" alt="Buy me a Coffee" title="Buy me a Coffee" hspace="3" /></form><a href="https://www.paypal.com/cgi-bin/webscr?cmd=_xclick&amp;business=tony27587@gmail.com&amp;currency_code=USD&amp;amount=&amp;return=Thanks...I'll use the caffeine to keep me awake and keep blogging...&amp;item_name=Buy+me+a+Coffee+for+Integrated+RSC+Databases+New+ChemSpider+Functionality+at+ACS+Spring+2010+Part+4+" target="paypal">Buy me a Coffee</a></p>]]></content:encoded>
			<wfw:commentRss>http://www.chemspider.com/blog/integrated-rsc-databases-new-chemspider-functionality-at-acs-spring-2010-part-4.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>New ChemSpider Functionality at ACS Spring 2010 Part 2 NMR Prediction</title>
		<link>http://www.chemspider.com/blog/new-chemspider-functionality-at-acs-spring-2010-part-2-nmr-prediction.html</link>
		<comments>http://www.chemspider.com/blog/new-chemspider-functionality-at-acs-spring-2010-part-2-nmr-prediction.html#comments</comments>
		<pubDate>Tue, 16 Mar 2010 16:41:03 +0000</pubDate>
		<dc:creator>Antony Williams</dc:creator>
				<category><![CDATA[Community Building]]></category>
		<category><![CDATA[How ChemSpider Runs]]></category>
		<category><![CDATA[Quality and Content]]></category>
		<category><![CDATA[ChemSpider]]></category>
		<category><![CDATA[NMR databases]]></category>
		<category><![CDATA[NMR Prediction]]></category>
		<category><![CDATA[NMRShiftDB]]></category>

		<guid isPermaLink="false">http://www.chemspider.com/blog/?p=1641</guid>
		<description><![CDATA[The functionality discussed below will be released at the ACS Spring Meeting during the week of March 21st 2010
We had previously released NMR prediction on ChemSpider as announced here. Based on community feedback we later removed that connection and had never reconnected, despite reported improvements. I am an NMR spectroscopist by training &#8230;if you check [...]]]></description>
			<content:encoded><![CDATA[<p><strong>The functionality discussed below will be released at the ACS Spring Meeting during the week of March 21st 2010</strong></p>
<p>We had previously released NMR prediction on ChemSpider as announced <a href="http://www.chemspider.com/blog/nmr-prediction-now-available-via-chemspider.html">here</a>. Based on community <a href="http://www.chemspider.com/blog/feedback-on-nmr-prediction-on-chemspider.html">feedback</a> we later <a href="http://www.chemspider.com/blog/removal-of-nmr-prediction-from-chemspider.html">removed that connection</a> and had never reconnected, despite reported <a href="http://www.chemspider.com/blog/nmrdborg-nmr-predictor-already-improved.html">improvements</a>. I am an NMR spectroscopist by training &#8230;if you check out my <a href="http://www.mendeley.com/profiles/antony-williams/">Mendeley profile</a> you&#8217;ll see that the majority of my papers are NMR-based. Because I am an NMR jock, and despite working in cheminformatics I do keep my hands in NMR research (NMR prediction and computer assisted structure elucidation) I really wanted to make sure that we deliver NMR prediction via ChemSpider. I was involved with the development of the <a href="http://www.acdlabs.com/products/adh/nmr/nmr_pred/">ACD/Labs NMR prediction</a> tools for H1, C13, N15, F19 and P31 nuclei. There are a number of other NMR prediction modules on the market including those of Bio-Rad (in the <a href="http://www.knowitall.com/academic/welcome.asp">Know-It-All package</a>), <a href="http://www.modgraph.co.uk/product_nmr.htm">Modgraph</a> and certainly the work of <a href="http://homepage.univie.ac.at/Wolfgang.Robien/index.html">Wolfgang Robien</a>, one of the founding fathers of NMR prediction. These are primarily commercial packages.</p>
<p>In the background we have been working on the introduction of NMR prediction to ChemSpider in time for the ACS. We were looking for a platform that we could integrate that involved community deposition of data to ensure there was a growing database to enhance the prediction algorithms. We also wanted to know that the underlying data quality was good. We wanted to integrate to an Open system that had support from both an active community of participants as well as at least one developer who could provide support if we needed it. All of these criteria point to only one resource, <a href="http://www.ebi.ac.uk/nmrshiftdb/">NMRShiftDB</a>. There have been some heated discussions,<a href="http://www.chemspider.com/blog/further-comments-on-the-quality-of-nmrshiftdb-and-nmr-prediction-algorithm-validation.html"> including on this blog, regarding data quality</a>, especially in NMRShiftDB. However, I <a href="http://pubs.acs.org/doi/abs/10.1021/ci700363r">co-authored a paper</a> with Chris Steinbeck and colleagues from ACD/Labs validating the dataset as well as ACD/Labs&#8217; NMR prediction approaches.</p>
<p>NMRShiftDB is a high quality data set and certainly contains enough data to provide a training set for NMR prediction algorithms. The NMR predictions provided by NMRShiftDB are used by many people and overall feedback seems to be very positive.  Based on our previous knowledge of the data in NMRShiftDB, and the availability of a well defined programming interface to connect ChemSpider, we have worked with Stefan Kuhn at the EBI to produce a first level integration.</p>
<p>As a result at the ACS meeting in San Francisco next week we will roll out NMR prediction integration. In keeping with the new layout model we have adopted for ChemSpider using <a href="http://www.chemspider.com/blog/new-chemspider-functionality-at-acs-spring-2010-part-1-tabbed-infoboxes.html">tabbed approaches</a> for display of data, we have bundled together all predictions. The first ACD/Labs tab provides access to ACD/Labs PhysChem properties, the EPI Summary provides access to the EPISuite and the NMRShiftDB provides access to the predicted NMR spectra. The left spectrum shows the Proton NMR spectrum and the right spectrum shows the C13 NMR spectrum.</p>
<p><img class="aligncenter size-full wp-image-1642" title="NMRshiftDB" src="http://www.chemspider.com/blog/wp-content/uploads/2010/03/NMRshiftDB.png" alt="NMRshiftDB" width="471" height="251" /></p>
<p>When the system is<strong> fully </strong>integrated the process will work as follows. Since NMRShiftDB already contains many thousands of assigned spectra we will retrieve the experimentally assigned spectra directly and display them. When we cannot retrieve the experimental spectra then we will predict the NMR spectra and display them.</p>
<p>In the future we <em>might </em>pre-predict and store the NMR spectra for all structures on the NMR database. I am a little leery of doing this at present as we need to gather some basic feedback from the ChemSpider users regarding the performance of the NMR prediction algorithms and our existing implementation. In terms of predicting NMR spectra across a database of this size then a lot of consideration has to be given to domain applicability..i.e, what subset of structures should be excluded from having NMR predictions performed? For example, organometallic complexes, free radicals etc. CAS likely had to take this type of issue into account when they <a href="http://www.cas.org/newsevents/releases/protonnmr122908.html">applied NMR predictions to their CAS registry</a>.</p>
<p>If there are other NMR prediction algorithms or databases that you would be interested in integrating into ChemSpider please contact me. If you are a cheminformatics vendor selling NMR predictions/databases we would be VERY interested in receiving JUST the structures from your NMR databases. We will deposit them and link directly to your product page as an indicator that you have NMR data available.</p>
<script type="text/javascript">
  addthis_url    = 'http%3A%2F%2Fwww.chemspider.com%2Fblog%2Fnew-chemspider-functionality-at-acs-spring-2010-part-2-nmr-prediction.html';
  addthis_title  = 'New+ChemSpider+Functionality+at+ACS+Spring+2010+Part+2+NMR+Prediction';
  addthis_pub    = '';
</script><script type="text/javascript" src="http://s7.addthis.com/js/addthis_widget.php?v=12" ></script>
<p class="buymebeer"><form action="https://www.paypal.com/cgi-bin/webscr" target="paypal" method="post"><input type="hidden" name="cmd" value="_xclick" /><input type="hidden" name="business" value="tony27587@gmail.com" /><input type="hidden" name="return" value="Thanks...I'll use the caffeine to keep me awake and keep blogging..." /><input type="hidden" name="item_name" value="Buy me a Coffee for New ChemSpider Functionality at ACS Spring 2010 Part 2 NMR Prediction" /><input type="hidden" name="currency_code" value="USD" /><input type="hidden" name="amount" value="" /><input type="image" src="http://www.chemspider.com/blog/wp-content/plugins/buy-me-beer/icon_cafe.gif" align="left" alt="Buy me a Coffee" title="Buy me a Coffee" hspace="3" /></form><a href="https://www.paypal.com/cgi-bin/webscr?cmd=_xclick&amp;business=tony27587@gmail.com&amp;currency_code=USD&amp;amount=&amp;return=Thanks...I'll use the caffeine to keep me awake and keep blogging...&amp;item_name=Buy+me+a+Coffee+for+New+ChemSpider+Functionality+at+ACS+Spring+2010+Part+2+NMR+Prediction" target="paypal">Buy me a Coffee</a></p>]]></content:encoded>
			<wfw:commentRss>http://www.chemspider.com/blog/new-chemspider-functionality-at-acs-spring-2010-part-2-nmr-prediction.html/feed</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>How Much New Chemistry Is There In New RSC Articles</title>
		<link>http://www.chemspider.com/blog/how-much-new-chemistry-is-there-in-new-rsc-articles.html</link>
		<comments>http://www.chemspider.com/blog/how-much-new-chemistry-is-there-in-new-rsc-articles.html#comments</comments>
		<pubDate>Thu, 11 Mar 2010 19:36:55 +0000</pubDate>
		<dc:creator>Antony Williams</dc:creator>
				<category><![CDATA[Quality and Content]]></category>
		<category><![CDATA[RSC Publishing]]></category>
		<category><![CDATA[ChemSpider]]></category>
		<category><![CDATA[Project Prospect]]></category>
		<category><![CDATA[RSC]]></category>

		<guid isPermaLink="false">http://www.chemspider.com/blog/?p=1635</guid>
		<description><![CDATA[From the early days of the acquisition of ChemSpider by the RSC we have been focused on accessing the rich content that the RSC has contained in its databases and in its rich archive. We have been working hard for a number of months now to integrate systems, projects and processes into ChemSpider so that [...]]]></description>
			<content:encoded><![CDATA[<p>From the early days of the acquisition of ChemSpider by the RSC we have been focused on accessing the rich content that the RSC has contained in its databases and in its rich archive. We have been working hard for a number of months now to integrate systems, projects and processes into ChemSpider so that RSC chemistry is more discoverable. What we will be unveiling in the next few days we believe is big. We&#8217;ll roll it out one piece at a time. The<a href="http://www.chemspider.com/blog/first-deposited-compounds-from-rsc-prospect-in-chemspider.html"> last blog post</a> discussed the deposition of new compounds from RSC prospected articles into ChemSpider. The email below results from the deposition of compounds from one article. One set of 10 structures from one article that are directly deposited into ChemSpider when the article goes live. These are compounds that are deposited and live immediately, not abstracted later. Imagine when we are doing this for all RSC articles, database and books&#8230;.</p>
<p>ALL of the compounds below are NEW to the ChemSpider database&#8230;everyone of them. While not all RSC articles are only about novel compounds clearly there are new compounds moving into the database from the RSC publications.</p>
<p>Dear RSC Prospect,</p>
<p>This email is to notify that your deposition (#3427) has been published. Below please find a list of links to the structures that belong to your deposition:</p>
<p><a href="../../Chemical-Structure.23558982.html">http://www.chemspider.com/Chemical-Structure.23558982.html</a></p>
<p><a href="../../Chemical-Structure.23558983.html">http://www.chemspider.com/Chemical-Structure.23558983.html</a></p>
<p><a href="../../Chemical-Structure.23558984.html">http://www.chemspider.com/Chemical-Structure.23558984.html</a></p>
<p><a href="../../Chemical-Structure.23558985.html">http://www.chemspider.com/Chemical-Structure.23558985.html</a></p>
<p><a href="../../Chemical-Structure.23558986.html">http://www.chemspider.com/Chemical-Structure.23558986.html</a></p>
<p><a href="../../Chemical-Structure.23558987.html">http://www.chemspider.com/Chemical-Structure.23558987.html</a></p>
<p><a href="../../Chemical-Structure.23558988.html">http://www.chemspider.com/Chemical-Structure.23558988.html</a></p>
<p><a href="../../Chemical-Structure.23558989.html">http://www.chemspider.com/Chemical-Structure.23558989.html</a></p>
<p><a href="../../Chemical-Structure.23558990.html">http://www.chemspider.com/Chemical-Structure.23558990.html</a></p>
<p><a href="../../Chemical-Structure.23558991.html">http://www.chemspider.com/Chemical-Structure.23558991.html</a></p>
<p>Cheers,</p>
<p>ChemSpider</p>
<p>The structures link back directly to the RSC article via DOI as shown  below.</p>
<p><img class="aligncenter size-full wp-image-1636" title="Prospectedarticle" src="http://www.chemspider.com/blog/wp-content/uploads/2010/03/Prospectedarticle.png" alt="Prospectedarticle" width="536" height="146" /></p>
<script type="text/javascript">
  addthis_url    = 'http%3A%2F%2Fwww.chemspider.com%2Fblog%2Fhow-much-new-chemistry-is-there-in-new-rsc-articles.html';
  addthis_title  = 'How+Much+New+Chemistry+Is+There+In+New+RSC+Articles';
  addthis_pub    = '';
</script><script type="text/javascript" src="http://s7.addthis.com/js/addthis_widget.php?v=12" ></script>
<p class="buymebeer"><form action="https://www.paypal.com/cgi-bin/webscr" target="paypal" method="post"><input type="hidden" name="cmd" value="_xclick" /><input type="hidden" name="business" value="tony27587@gmail.com" /><input type="hidden" name="return" value="Thanks...I'll use the caffeine to keep me awake and keep blogging..." /><input type="hidden" name="item_name" value="Buy me a Coffee for How Much New Chemistry Is There In New RSC Articles" /><input type="hidden" name="currency_code" value="USD" /><input type="hidden" name="amount" value="" /><input type="image" src="http://www.chemspider.com/blog/wp-content/plugins/buy-me-beer/icon_cafe.gif" align="left" alt="Buy me a Coffee" title="Buy me a Coffee" hspace="3" /></form><a href="https://www.paypal.com/cgi-bin/webscr?cmd=_xclick&amp;business=tony27587@gmail.com&amp;currency_code=USD&amp;amount=&amp;return=Thanks...I'll use the caffeine to keep me awake and keep blogging...&amp;item_name=Buy+me+a+Coffee+for+How+Much+New+Chemistry+Is+There+In+New+RSC+Articles" target="paypal">Buy me a Coffee</a></p>]]></content:encoded>
			<wfw:commentRss>http://www.chemspider.com/blog/how-much-new-chemistry-is-there-in-new-rsc-articles.html/feed</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>First deposited compounds from RSC Prospect in ChemSpider</title>
		<link>http://www.chemspider.com/blog/first-deposited-compounds-from-rsc-prospect-in-chemspider.html</link>
		<comments>http://www.chemspider.com/blog/first-deposited-compounds-from-rsc-prospect-in-chemspider.html#comments</comments>
		<pubDate>Thu, 11 Mar 2010 18:16:53 +0000</pubDate>
		<dc:creator>Aileen Day</dc:creator>
				<category><![CDATA[How ChemSpider Runs]]></category>
		<category><![CDATA[Quality and Content]]></category>
		<category><![CDATA[RSC Publishing]]></category>
		<category><![CDATA[ChemSpider]]></category>

		<guid isPermaLink="false">http://www.chemspider.com/blog/?p=1632</guid>
		<description><![CDATA[We&#8217;ve taken the first step towards user being able to seamlessly bounce back and forth between finding compounds of interest using the ChemSpider search and selection tools and finding more information about them in RSC journals&#8230;
I&#8217;m pleased to announce that we&#8217;ve just switched on a deposition system which will take compounds from the prospected version of RSC articles as they [...]]]></description>
			<content:encoded><![CDATA[<p>We&#8217;ve taken the first step towards user being able to seamlessly bounce back and forth between finding compounds of interest using the ChemSpider search and selection tools and finding more information about them in RSC journals&#8230;</p>
<p>I&#8217;m pleased to announce that we&#8217;ve just switched on a deposition system which will take compounds from the prospected version of RSC articles as they are published and automatically deposit them into ChemSpider, making a link back to the original article from the new compound page. An example of a new compound is <a href="http://www.chemspider.com/Chemical-Structure.23558980.html">here</a> which was generated when <a href="http://dx.doi.org/10.1039/b926824c">this article</a> was prospected. The same deposition process is used to make links from existing ChemSpider compounds to new RSC articles, for example <a href="http://www.chemspider.com/Chemical-Structure.5236.html">here</a> was generated when <a href="http://dx.doi.org/10.1039/b922955h">this article</a> was published.</p>
<p>This is basically a way to stick our toe in the water to investigate how much intervention and cleaning is necessary to deposit compounds when all the information that we have been storing for them is the InChI without any 2D layout information (which is an issue that other potential data sources may also face too).  To do this we&#8217;ve been making use of the ChemSpider webservices <a href="http://www.chemspider.com/InChI.asmx">http://www.chemspider.com/InChI.asmx</a> to download the mol files of InChIs already in ChemSpider, or using the InChItoMol webservice to generate new mol files where they don&#8217;t exist already.  Tracking and fixing problems as they crop up at this manageable rate will help us when we face the larger task of importing all of the compounds that have been prospected in the past into ChemSpider.</p>
<script type="text/javascript">
  addthis_url    = 'http%3A%2F%2Fwww.chemspider.com%2Fblog%2Ffirst-deposited-compounds-from-rsc-prospect-in-chemspider.html';
  addthis_title  = 'First+deposited+compounds+from+RSC+Prospect+in+ChemSpider';
  addthis_pub    = '';
</script><script type="text/javascript" src="http://s7.addthis.com/js/addthis_widget.php?v=12" ></script>
<p class="buymebeer"><form action="https://www.paypal.com/cgi-bin/webscr" target="paypal" method="post"><input type="hidden" name="cmd" value="_xclick" /><input type="hidden" name="business" value="tony27587@gmail.com" /><input type="hidden" name="return" value="Thanks...I'll use the caffeine to keep me awake and keep blogging..." /><input type="hidden" name="item_name" value="Buy me a Coffee for First deposited compounds from RSC Prospect in ChemSpider" /><input type="hidden" name="currency_code" value="USD" /><input type="hidden" name="amount" value="" /><input type="image" src="http://www.chemspider.com/blog/wp-content/plugins/buy-me-beer/icon_cafe.gif" align="left" alt="Buy me a Coffee" title="Buy me a Coffee" hspace="3" /></form><a href="https://www.paypal.com/cgi-bin/webscr?cmd=_xclick&amp;business=tony27587@gmail.com&amp;currency_code=USD&amp;amount=&amp;return=Thanks...I'll use the caffeine to keep me awake and keep blogging...&amp;item_name=Buy+me+a+Coffee+for+First+deposited+compounds+from+RSC+Prospect+in+ChemSpider" target="paypal">Buy me a Coffee</a></p>]]></content:encoded>
			<wfw:commentRss>http://www.chemspider.com/blog/first-deposited-compounds-from-rsc-prospect-in-chemspider.html/feed</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>New Functionality in the World of ChemSpider: Part 5 Statistics of Deposition</title>
		<link>http://www.chemspider.com/blog/new-functionality-in-the-world-of-chemspider-part-5-statistics-of-deposition.html</link>
		<comments>http://www.chemspider.com/blog/new-functionality-in-the-world-of-chemspider-part-5-statistics-of-deposition.html#comments</comments>
		<pubDate>Wed, 23 Dec 2009 04:45:52 +0000</pubDate>
		<dc:creator>Antony Williams</dc:creator>
				<category><![CDATA[How ChemSpider Runs]]></category>
		<category><![CDATA[Quality and Content]]></category>
		<category><![CDATA[ChemSpider]]></category>
		<category><![CDATA[Online spectral data]]></category>

		<guid isPermaLink="false">http://www.chemspider.com/blog/?p=1530</guid>
		<description><![CDATA[In the recent rollout of functionality we added to the home page statistics regarding the number of various types of spectra that have been added to ChemSpider as well as updates of new data associated with data sources. We will likely optimize these displayed further in the future but this is an initial display for [...]]]></description>
			<content:encoded><![CDATA[<p>In the recent rollout of functionality we added to the home page statistics regarding the number of various types of spectra that have been added to ChemSpider as well as updates of new data associated with data sources. We will likely optimize these displayed further in the future but this is an initial display for the time being. It&#8217;s rather impressive how many different types of 2D NMR data are being uploaded to the database.</p>
<p><img class="aligncenter size-full wp-image-1531" title="statistics" src="http://www.chemspider.com/blog/wp-content/uploads/2009/12/statistics.png" alt="statistics" width="669" height="447" /></p>
<script type="text/javascript">
  addthis_url    = 'http%3A%2F%2Fwww.chemspider.com%2Fblog%2Fnew-functionality-in-the-world-of-chemspider-part-5-statistics-of-deposition.html';
  addthis_title  = 'New+Functionality+in+the+World+of+ChemSpider%3A+Part+5+Statistics+of+Deposition';
  addthis_pub    = '';
</script><script type="text/javascript" src="http://s7.addthis.com/js/addthis_widget.php?v=12" ></script>
<p class="buymebeer"><form action="https://www.paypal.com/cgi-bin/webscr" target="paypal" method="post"><input type="hidden" name="cmd" value="_xclick" /><input type="hidden" name="business" value="tony27587@gmail.com" /><input type="hidden" name="return" value="Thanks...I'll use the caffeine to keep me awake and keep blogging..." /><input type="hidden" name="item_name" value="Buy me a Coffee for New Functionality in the World of ChemSpider: Part 5 Statistics of Deposition" /><input type="hidden" name="currency_code" value="USD" /><input type="hidden" name="amount" value="" /><input type="image" src="http://www.chemspider.com/blog/wp-content/plugins/buy-me-beer/icon_cafe.gif" align="left" alt="Buy me a Coffee" title="Buy me a Coffee" hspace="3" /></form><a href="https://www.paypal.com/cgi-bin/webscr?cmd=_xclick&amp;business=tony27587@gmail.com&amp;currency_code=USD&amp;amount=&amp;return=Thanks...I'll use the caffeine to keep me awake and keep blogging...&amp;item_name=Buy+me+a+Coffee+for+New+Functionality+in+the+World+of+ChemSpider:+Part+5+Statistics+of+Deposition" target="paypal">Buy me a Coffee</a></p>]]></content:encoded>
			<wfw:commentRss>http://www.chemspider.com/blog/new-functionality-in-the-world-of-chemspider-part-5-statistics-of-deposition.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Integration to QSAR World and Aggregating Data Linking</title>
		<link>http://www.chemspider.com/blog/integration-to-qsar-world-and-aggregating-data-linking.html</link>
		<comments>http://www.chemspider.com/blog/integration-to-qsar-world-and-aggregating-data-linking.html#comments</comments>
		<pubDate>Tue, 27 Oct 2009 19:18:33 +0000</pubDate>
		<dc:creator>Antony Williams</dc:creator>
				<category><![CDATA[Community Building]]></category>
		<category><![CDATA[Quality and Content]]></category>
		<category><![CDATA[ChemSpider]]></category>
		<category><![CDATA[QSAR data]]></category>
		<category><![CDATA[QSAR World]]></category>

		<guid isPermaLink="false">http://www.chemspider.com/blog/?p=1483</guid>
		<description><![CDATA[Following on from my earlier post regarding our interest in aggregating physicochemical data for other groups to use in building their models and algorithms we announce that we are now depositing the data from QSAR world into ChemSpider and pointing back to the original sources on QSAR World. We harvest the SDF files, deposit onto [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignleft size-full wp-image-1484" title="QSAR world" src="http://www.chemspider.com/blog/wp-content/uploads/2009/10/QSAR-world.png" alt="QSAR world" width="173" height="78" />Following on from my earlier post regarding our interest in aggregating physicochemical data for other groups to use in building their models and algorithms we announce that we are now depositing the data from QSAR world into ChemSpider and pointing back to the original sources on QSAR World. We harvest the SDF files, deposit onto ChemSpider and provide direct links into the original SDF file, with the appropriate titles, so that our users can proceed to gather the data for re-analysis if they find it of interest. An example record is here for <a href="http://www.chemspider.com/Chemical-Structure.10482034.html">Atovaquone</a> where we list the links to data residing on QSAR world for download. The links can be seen under the supplemental information section as shown below where you can see links to seven different types of data. We have chosen, for the time being, to not deposit the values associated with these data onto ChemSpider as the data are very heterogeneous in representation even though they are all delivered as SDF files.</p>
<p><img class="aligncenter size-full wp-image-1485" title="supplemental information" src="http://www.chemspider.com/blog/wp-content/uploads/2009/10/supplemental-information.png" alt="supplemental information" width="357" height="229" /></p>
<script type="text/javascript">
  addthis_url    = 'http%3A%2F%2Fwww.chemspider.com%2Fblog%2Fintegration-to-qsar-world-and-aggregating-data-linking.html';
  addthis_title  = 'Integration+to+QSAR+World+and+Aggregating+Data+Linking';
  addthis_pub    = '';
</script><script type="text/javascript" src="http://s7.addthis.com/js/addthis_widget.php?v=12" ></script>
<p class="buymebeer"><form action="https://www.paypal.com/cgi-bin/webscr" target="paypal" method="post"><input type="hidden" name="cmd" value="_xclick" /><input type="hidden" name="business" value="tony27587@gmail.com" /><input type="hidden" name="return" value="Thanks...I'll use the caffeine to keep me awake and keep blogging..." /><input type="hidden" name="item_name" value="Buy me a Coffee for Integration to QSAR World and Aggregating Data Linking " /><input type="hidden" name="currency_code" value="USD" /><input type="hidden" name="amount" value="" /><input type="image" src="http://www.chemspider.com/blog/wp-content/plugins/buy-me-beer/icon_cafe.gif" align="left" alt="Buy me a Coffee" title="Buy me a Coffee" hspace="3" /></form><a href="https://www.paypal.com/cgi-bin/webscr?cmd=_xclick&amp;business=tony27587@gmail.com&amp;currency_code=USD&amp;amount=&amp;return=Thanks...I'll use the caffeine to keep me awake and keep blogging...&amp;item_name=Buy+me+a+Coffee+for+Integration+to+QSAR+World+and+Aggregating+Data+Linking+" target="paypal">Buy me a Coffee</a></p>]]></content:encoded>
			<wfw:commentRss>http://www.chemspider.com/blog/integration-to-qsar-world-and-aggregating-data-linking.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Enormous Jump in Number of Integrated Links from Wikipedia to ChemSpider</title>
		<link>http://www.chemspider.com/blog/enormous-jump-in-number-of-integrated-links-from-wikipedia-to-chemspider.html</link>
		<comments>http://www.chemspider.com/blog/enormous-jump-in-number-of-integrated-links-from-wikipedia-to-chemspider.html#comments</comments>
		<pubDate>Tue, 27 Oct 2009 17:12:47 +0000</pubDate>
		<dc:creator>Antony Williams</dc:creator>
				<category><![CDATA[ChemSpider Chemistry]]></category>
		<category><![CDATA[Community Building]]></category>
		<category><![CDATA[Quality and Content]]></category>
		<category><![CDATA[WiChempedia]]></category>
		<category><![CDATA[Wikipedia Services]]></category>
		<category><![CDATA[ChemSpider]]></category>
		<category><![CDATA[Linked data]]></category>
		<category><![CDATA[Quality in Chemistry]]></category>
		<category><![CDATA[Semantic Chemistry]]></category>
		<category><![CDATA[Wikipedia Chemistry]]></category>

		<guid isPermaLink="false">http://www.chemspider.com/blog/?p=1474</guid>
		<description><![CDATA[As an active member of the Wikipedia Chemistry team I continue to be impressed with the dedication and commitment that the members have to improving the quality AND quantity of information available on Wikipedia for chemists. The number of lost hours of sleep freely given to the benefit of Wikipedia, and in this specific case [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignleft size-full wp-image-1475" title="roadrunner" src="http://www.chemspider.com/blog/wp-content/uploads/2009/10/roadrunner.png" alt="roadrunner" width="180" height="136" />As an active member of the Wikipedia Chemistry team I continue to be impressed with the dedication and commitment that the members have to improving the quality AND quantity of information available on Wikipedia for chemists. The number of lost hours of sleep freely given to the benefit of Wikipedia, and in this specific case to the chemistry community, is immense. The number of &#8220;Compound Pages&#8221; on Wikipedia dedicated to drugs/chemicals has continued to grow and, despite a sincere effort on our part to keep everything linked up from ChemSpider to Wikipedia it&#8217;s a little like chasing the Road Runner&#8230;.we&#8217;re always behind!</p>
<p>We have been working with the WikiChem team of late to embed links from Wikipedia back to ChemSpider. I am humbled to know that our hard work to establish ChemSpider as a source of quality information has reached a level of trust such that Wikipedia now links from the ChemBoxes out to ChemSpider. The links are being updated on an on going basis at present with hundreds of new links already established and more being generated on an ongoing basis. <a href="http://en.wikipedia.org/wiki/User:Beetstra">Wikipedia User: Beetstra</a> has written a &#8216;bot that is inserting ChemSpiderIDs across the database (see below) and we ARE doing rigorous checking of all of the links.This was using a file that we generated on our side showing links to Wikipedia from ChemSpider.</p>
<p><img class="aligncenter size-full wp-image-1476" title="beetstra" src="http://www.chemspider.com/blog/wp-content/uploads/2009/10/beetstra.png" alt="beetstra" width="484" height="258" /></p>
<p>We will then be able to generate a list of all ChemBoxes/DrugBoxes without links from Wikipedia to ChemSpider and we will then make the links on our side, manually curating the structures, and then hand back a file to finish all linking. At this point we will have the backfile under control and we can perform ongoing updates as new compound pages are created on ChemSpider and, if we curate and find errors on Wikipedia or ChemSpider making a few manual edits is easy.</p>
<p>There are very dedicated teams on Wikipedia and ChemSpider carefully poring over data with their robots and eyeballs to create a linked data set of quality chemistry. It&#8217;s long, tedious AND important work. When its done we will have an expanded set of data to semantically link from RSC articles when we do markup.</p>
<script type="text/javascript">
  addthis_url    = 'http%3A%2F%2Fwww.chemspider.com%2Fblog%2Fenormous-jump-in-number-of-integrated-links-from-wikipedia-to-chemspider.html';
  addthis_title  = 'Enormous+Jump+in+Number+of+Integrated+Links+from+Wikipedia+to+ChemSpider';
  addthis_pub    = '';
</script><script type="text/javascript" src="http://s7.addthis.com/js/addthis_widget.php?v=12" ></script>
<p class="buymebeer"><form action="https://www.paypal.com/cgi-bin/webscr" target="paypal" method="post"><input type="hidden" name="cmd" value="_xclick" /><input type="hidden" name="business" value="tony27587@gmail.com" /><input type="hidden" name="return" value="Thanks...I'll use the caffeine to keep me awake and keep blogging..." /><input type="hidden" name="item_name" value="Buy me a Coffee for Enormous Jump in Number of Integrated Links from Wikipedia to ChemSpider" /><input type="hidden" name="currency_code" value="USD" /><input type="hidden" name="amount" value="" /><input type="image" src="http://www.chemspider.com/blog/wp-content/plugins/buy-me-beer/icon_cafe.gif" align="left" alt="Buy me a Coffee" title="Buy me a Coffee" hspace="3" /></form><a href="https://www.paypal.com/cgi-bin/webscr?cmd=_xclick&amp;business=tony27587@gmail.com&amp;currency_code=USD&amp;amount=&amp;return=Thanks...I'll use the caffeine to keep me awake and keep blogging...&amp;item_name=Buy+me+a+Coffee+for+Enormous+Jump+in+Number+of+Integrated+Links+from+Wikipedia+to+ChemSpider" target="paypal">Buy me a Coffee</a></p>]]></content:encoded>
			<wfw:commentRss>http://www.chemspider.com/blog/enormous-jump-in-number-of-integrated-links-from-wikipedia-to-chemspider.html/feed</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Thieme Announce Access to DOI&#8217;ed Primary Analytical Data Available</title>
		<link>http://www.chemspider.com/blog/thieme-announce-access-to-doied-primary-analytical-data-available.html</link>
		<comments>http://www.chemspider.com/blog/thieme-announce-access-to-doied-primary-analytical-data-available.html#comments</comments>
		<pubDate>Tue, 27 Oct 2009 01:26:37 +0000</pubDate>
		<dc:creator>Antony Williams</dc:creator>
				<category><![CDATA[Community Building]]></category>
		<category><![CDATA[Quality and Content]]></category>
		<category><![CDATA[Analytical Data]]></category>
		<category><![CDATA[Open Data]]></category>
		<category><![CDATA[Thieme]]></category>

		<guid isPermaLink="false">http://www.chemspider.com/blog/?p=1466</guid>
		<description><![CDATA[Today I received an email via the CHMINF list server pointing to the following Press Release. Part of the press release is shown here:
&#8220;In collaboration with the German National Library of Science and Technology (TIB) Thieme is the first publisher to make primary chemistry data accessible worldwide. Analytical data, from various experiments, is the foundation [...]]]></description>
			<content:encoded><![CDATA[<p>Today I received an email via the CHMINF list server pointing to the following <a href="http://www.thieme.de/connect/en/services/press-release/tc_oct_06_09.html">Press Release</a>. Part of the press release is shown here:</p>
<p>&#8220;In collaboration with the German National Library of Science and Technology (TIB) Thieme is the first publisher to make primary chemistry data accessible worldwide. Analytical data, from various experiments, is the foundation of research work and scientific papers. From now on, primary data will be registered and made available online via the Thieme eJournals website (<a href="http://www.thieme-connect.com/ejournals">www.thieme-connect.com/ejournals</a>) using digital object recognition in the form of Digital Object Identifiers (DOI). This will enable scientists to easily locate research articles, including accompanying data, and make enhanced use of the scientific content.&#8221;</p>
<p>There has been a lot of discussion over the years regarding making available &#8220;primary data&#8221;. We offered to do this on the ChemSpider Journal of Chemistry : if people wanted to submit analytical data with an article that we published then we would post them as spectra associated with the article. Unfortunately the general consensus based on a few conversations that I had is that it is a lot of work to prepare data and deposit it. This is one of the reasons that, until now, publishers have generally made the spectral data available as plots and printouts of the data. These data are generally made available as electronic supplementary data. These data ARE valuable even in that form but, and I believe that the majority of scientists would agree, they would be of more valuable if they were available in a format that would allow display in online applets, downloadable for processing and expansions etc. The RSC would certainly welcome the availability of spectral data associated with publications especially since they can now be hosted on ChemSpider.</p>
<p>Thieme have actually managed to pull off quite a coup and I commend them for their efforts. The first example datasets <a href="http://www.thieme-connect.de/DOI/DOI?10.1055/s-2008-1067226">are available here</a>. The listing includes &#8220;FIDs and associated files for the <sup>1</sup>H, <sup>13</sup>C and DEPT NMR spectra for compounds <em>14</em>, (<em>S</em><sub>S</sub>)-<em>23</em>, (<em>S</em><sub>S</sub>)-<em>25</em>, (<em>R</em><sub>S</sub>)-<em>26</em>, <em>27</em>, (<em>S</em><sub>S</sub>)-<em>28</em>, (<em>R</em><sub>S</sub>,<em>S</em><sub>S</sub>)-<em>29</em>, <em>30</em>, (<em>R</em><sub>S</sub>)-<em>36</em>, (<em>S</em><sub>S</sub>)-<em>36</em>, (<em>S</em><sub>S</sub>)-<em>37</em>, <em>38</em>, (<em>R</em><sub>S</sub>)-<em>39</em>, (<em>S</em><sub>S</sub>)-<em>39</em>, (<em>S</em><sub>S</sub>)-<em>44</em>, (<em>R</em><sub>S</sub>)-<em>46</em>, (<em>S</em><sub>S</sub>)-<em>46</em>, (<em>R</em><sub>S</sub>)-<em>48</em>, (<em>S</em><sub>S</sub>)-<em>48</em>, (<em>S</em><sub>S</sub>)-<em>49</em>, <em>52</em>, (<em>R</em><sub>S</sub>)-<em>53</em>, (<em>R</em><sub>S</sub>)-<em>55</em>, (<em>R</em><sub>S</sub>)-<em>57</em>, (<em>S</em><sub>S</sub>)-<em>57</em>, (<em>S</em><sub>S</sub>)-<em>58</em>, (<em>R</em><sub>S</sub>)-<em>61</em>, (<em>S</em><sub>S</sub>)-<em>61</em>, (<em>R</em><sub>S</sub>)-<em>62</em>, (<em>S</em><sub>S</sub>)-<em>62</em>, (<em>R</em><sub>S</sub>)-<em>65</em> and (<em>S</em><sub>S</sub>)-<em>65</em> are summarized.&#8221; That&#8217;s a lot of data.</p>
<p>Since these are primary data they cannot be copyrighted so I chose to download the data, take a look and insert a couple into ChemSpider as an example of what can be done with these data. The associated PDF for the data says &#8220;The files can be processed using the following programs: MestReC, Bruker’s WINNMR and XWINNMR.&#8221; The files came as binary Bruker files so needed to be reprocessed and, in order to be deposited, had to be converted to JCAMP-DX format, the format supported by the JSpecView applet used on ChemSpider to display spectra. In order to this I am fortunate to have access to <a href="http://www.acdlabs.com/products/spec_lab/exp_spectra/nmr_proc/">ACD/NMR Processor</a>, a product I managed for a few years while working at ACD/Labs. This product also supports the Bruker format so I imported the data, processed and exported as JCAMP and imported to ChemSpider.  For compound <strong>14</strong> I have attached the H1 and C13 spectra and <a href="http://www.chemspider.com/Chemical-Structure.23076230.html#spectra">they can be seen here</a>. I didn&#8217;t attach the &#8220;DEPT spectrum&#8221; yet. In order for me to download the spectra, redraw the structure, process the spectra, export as JCAMP and deposit to ChemSpider took about 15 minutes. However, there are a lot of spectra and it will take me a while. There are 32 compounds, I assume 3 spectra per compound (HNMR, CNMR and DEPT) so that&#8217;s a total of 96 spectra. It&#8217;ll take me about 10-12 hours just to deposit this collection so that&#8217;s a lot of work to do in my spare time. If anyone wants to help out and can process the spectra to deposit please do!</p>
<p>One of the spectra are shown below using the <a href="http://www.chemspider.com/blog/why-are-spectra-like-youtube-videos.html">Spectral Embed</a> function we introduced previously:<br />
<script src="http://www.chemspider.com/csjsapi.ashx?op=spec&amp;tk=7283856c-a2d2-4634-8a11-1ea873f84244&amp;bid=2147&amp;w=750&amp;h=400" type="text/javascript"></script></p>
<p>This is a rich collection of data&#8230;it can feed <a href="http://www.jcheminf.com/content/pdf/1758-2946-1-9.pdf">the Spectral Game described in this article</a>. I look forward to getting the data onto ChemSpider and will be following up with Thieme to see if we can work together to host the data in a more generic format for the future. It&#8217;s a shame that the data are locked into a binary file format that needs reprocessing to view and I believe display through the JSpecView applet is advantageous for all. I encourage Thieme to consider also making the structure collection available in molfile, SMILES, InChI and InChIKey format &#8211; the InChIs will make the article discoverable via internet searches and through the InChI Resolver while the download of molfiles will speed up the loading process to ChemSpider and other systems.</p>
<script type="text/javascript">
  addthis_url    = 'http%3A%2F%2Fwww.chemspider.com%2Fblog%2Fthieme-announce-access-to-doied-primary-analytical-data-available.html';
  addthis_title  = 'Thieme+Announce+Access+to+DOI%26%238217%3Bed+Primary+Analytical+Data+Available';
  addthis_pub    = '';
</script><script type="text/javascript" src="http://s7.addthis.com/js/addthis_widget.php?v=12" ></script>
<p class="buymebeer"><form action="https://www.paypal.com/cgi-bin/webscr" target="paypal" method="post"><input type="hidden" name="cmd" value="_xclick" /><input type="hidden" name="business" value="tony27587@gmail.com" /><input type="hidden" name="return" value="Thanks...I'll use the caffeine to keep me awake and keep blogging..." /><input type="hidden" name="item_name" value="Buy me a Coffee for Thieme Announce Access to DOI'ed Primary Analytical Data Available" /><input type="hidden" name="currency_code" value="USD" /><input type="hidden" name="amount" value="" /><input type="image" src="http://www.chemspider.com/blog/wp-content/plugins/buy-me-beer/icon_cafe.gif" align="left" alt="Buy me a Coffee" title="Buy me a Coffee" hspace="3" /></form><a href="https://www.paypal.com/cgi-bin/webscr?cmd=_xclick&amp;business=tony27587@gmail.com&amp;currency_code=USD&amp;amount=&amp;return=Thanks...I'll use the caffeine to keep me awake and keep blogging...&amp;item_name=Buy+me+a+Coffee+for+Thieme+Announce+Access+to+DOI'ed+Primary+Analytical+Data+Available" target="paypal">Buy me a Coffee</a></p>]]></content:encoded>
			<wfw:commentRss>http://www.chemspider.com/blog/thieme-announce-access-to-doied-primary-analytical-data-available.html/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Does ChemSpider Have Millions of Errors?</title>
		<link>http://www.chemspider.com/blog/does-chemspider-have-millions-of-errors.html</link>
		<comments>http://www.chemspider.com/blog/does-chemspider-have-millions-of-errors.html#comments</comments>
		<pubDate>Mon, 19 Oct 2009 14:31:10 +0000</pubDate>
		<dc:creator>Antony Williams</dc:creator>
				<category><![CDATA[Presentations]]></category>
		<category><![CDATA[Quality and Content]]></category>

		<guid isPermaLink="false">http://www.chemspider.com/blog/?p=1459</guid>
		<description><![CDATA[I gave a talk today at the ICIC 2009 meeting here in Sitges, Spain. It is an interesting meeting and I will report on some of the presentations later. I&#8217;m glad I am here. The presentation is here on Slideshare and is a modified version of a presentation I gave on Saturday at the Microsoft [...]]]></description>
			<content:encoded><![CDATA[<p>I gave a talk today at the <a href="http://www.infonortics.com/chemical/">ICIC 2009 meeting here in Sitges, Spain</a>. It is an interesting meeting and I will report on some of the presentations later. I&#8217;m glad I am here. The presentation is <a href="http://www.slideshare.net/AntonyWilliams/how-internet-resources-are-providing-a-collaborative-community-for-chemistry">here on Slideshare</a> and is a modified version of a <a href="http://www.slideshare.net/AntonyWilliams/navigating-the-complex-web-of-chemistry-using-chemspider-2264825">presentation I gave on Saturday at the Microsoft eScience conference</a> in Pittsburg. One of the questions that followed the presentation was in regards to whether ChemSpider could be used as a measuring stick for quality (I am paraphrasing). My response was that there are millions of errors on ChemSpider and that seemed to raise a giggle and other people since then seemed surprised.</p>
<div style="width:425px;text-align:left" id="__ss_2277151"><a style="font:14px Helvetica,Arial,Sans-serif;display:block;margin:12px 0 3px 0;text-decoration:underline;" href="http://www.slideshare.net/AntonyWilliams/how-internet-resources-are-providing-a-collaborative-community-for-chemistry" title="How Internet Resources Are Providing a Collaborative Community for Chemistry ">How Internet Resources Are Providing a Collaborative Community for Chemistry </a><object style="margin:0px" width="425" height="355"><param name="movie" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=howinternetresourcesareprovidingacollaborativecommunityforchemistry101909final-091019090956-phpapp02&#038;stripped_title=how-internet-resources-are-providing-a-collaborative-community-for-chemistry" /><param name="allowFullScreen" value="true"/><param name="allowScriptAccess" value="always"/><embed src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=howinternetresourcesareprovidingacollaborativecommunityforchemistry101909final-091019090956-phpapp02&#038;stripped_title=how-internet-resources-are-providing-a-collaborative-community-for-chemistry" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="355"></embed></object>
<div style="font-size:11px;font-family:tahoma,arial;height:26px;padding-top:2px;">View more <a style="text-decoration:underline;" href="http://www.slideshare.net/">presentations</a> from <a style="text-decoration:underline;" href="http://www.slideshare.net/AntonyWilliams">Antony Williams</a>.</div>
</div>
<p>In my opinion, as shocking as it sounds, it must be true. Why?</p>
<p>There are almost 23 million unique chemical entities on the database. Many of them have multiple names associated, experimental properties, many have 10s of links to external databases. The structural layout has been created using algorithms. Algorithms have been used to generate systematic names. There are spectra submitted by the public and they can be mis-referenced, as an example, or declared to run in one solvent and ACTUALLY run in another. There are sometimes multiple registry numbers associated with a compound&#8230;a CAS number for a salt associated with with the neutral compound for example. The multiple links out to external resources number in the 10s of millions and these are changing daily as other websites and databases curate and edit their data. Errors are inevitable and, I judge, there must be millions of errors on ChemSPider. Just as there must be millions on Wikipedia and in the search results you get back from Google. The question is what counts as an error? I&#8217;m using a broad stroke brush for an error&#8230;a structure with a poor depiction is an error. A misspelling is an error. A dead link to a database is an error. So&#8230;definitely millions. But we continue our work to whittle down the number, with the assistance of the community, everyday. But we&#8217;re doing it while we are depositing new compounds onto the database so it&#8217;s an interesting challenge. Millions of errors doesn&#8217;t make ChemSpider less useful&#8230;we&#8217;re just realistic about the magnitude of the challenge!</p>
<script type="text/javascript">
  addthis_url    = 'http%3A%2F%2Fwww.chemspider.com%2Fblog%2Fdoes-chemspider-have-millions-of-errors.html';
  addthis_title  = 'Does+ChemSpider+Have+Millions+of+Errors%3F';
  addthis_pub    = '';
</script><script type="text/javascript" src="http://s7.addthis.com/js/addthis_widget.php?v=12" ></script>
<p class="buymebeer"><form action="https://www.paypal.com/cgi-bin/webscr" target="paypal" method="post"><input type="hidden" name="cmd" value="_xclick" /><input type="hidden" name="business" value="tony27587@gmail.com" /><input type="hidden" name="return" value="Thanks...I'll use the caffeine to keep me awake and keep blogging..." /><input type="hidden" name="item_name" value="Buy me a Coffee for Does ChemSpider Have Millions of Errors?" /><input type="hidden" name="currency_code" value="USD" /><input type="hidden" name="amount" value="" /><input type="image" src="http://www.chemspider.com/blog/wp-content/plugins/buy-me-beer/icon_cafe.gif" align="left" alt="Buy me a Coffee" title="Buy me a Coffee" hspace="3" /></form><a href="https://www.paypal.com/cgi-bin/webscr?cmd=_xclick&amp;business=tony27587@gmail.com&amp;currency_code=USD&amp;amount=&amp;return=Thanks...I'll use the caffeine to keep me awake and keep blogging...&amp;item_name=Buy+me+a+Coffee+for+Does+ChemSpider+Have+Millions+of+Errors?" target="paypal">Buy me a Coffee</a></p>]]></content:encoded>
			<wfw:commentRss>http://www.chemspider.com/blog/does-chemspider-have-millions-of-errors.html/feed</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
	</channel>
</rss>
