<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: An Invitation to Collaborate on Open Notebook Science for an NMR Study</title>
	<atom:link href="http://www.chemspider.com/blog/an-invitation-to-collaborate-on-open-notebook-science-for-an-nmr-study.html/feed" rel="self" type="application/rss+xml" />
	<link>http://www.chemspider.com/blog/an-invitation-to-collaborate-on-open-notebook-science-for-an-nmr-study.html</link>
	<description>Building Community for Chemists</description>
	<lastBuildDate>Fri, 10 Feb 2012 11:07:05 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.1.2</generator>
	<item>
		<title>By: Unilever Centre for Molecular Informatics, Cambridge - petermr&#8217;s blog &#187; Blog Archive &#187; Open NMR - again. Why we do it</title>
		<link>http://www.chemspider.com/blog/an-invitation-to-collaborate-on-open-notebook-science-for-an-nmr-study.html/comment-page-1#comment-8237</link>
		<dc:creator>Unilever Centre for Molecular Informatics, Cambridge - petermr&#8217;s blog &#187; Blog Archive &#187; Open NMR - again. Why we do it</dc:creator>
		<pubDate>Fri, 07 Dec 2007 09:25:20 +0000</pubDate>
		<guid isPermaLink="false">http://www.chemspider.com/blog/?p=213#comment-8237</guid>
		<description>[...] read my highly opinionated views of what was originally entitled “Open Notebook Science NMR” (1,2). My views around that work were very strong…in fact I didn’t really “get it”. I didn’t [...]</description>
		<content:encoded><![CDATA[<p>[...] read my highly opinionated views of what was originally entitled “Open Notebook Science NMR” (1,2). My views around that work were very strong…in fact I didn’t really “get it”. I didn’t [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Wolfgang Robien</title>
		<link>http://www.chemspider.com/blog/an-invitation-to-collaborate-on-open-notebook-science-for-an-nmr-study.html/comment-page-1#comment-3870</link>
		<dc:creator>Wolfgang Robien</dc:creator>
		<pubDate>Wed, 24 Oct 2007 11:30:25 +0000</pubDate>
		<guid isPermaLink="false">http://www.chemspider.com/blog/?p=213#comment-3870</guid>
		<description>Dear Tony;

there is a searchable database of 16.4 millions of calculated C13-NMR spectra available since approx. 1 year on http://nmrpredict.orc.univie.ac.at/identify
(Peter: it&#039;s free of charge !  ;-)) )

The spectra have been calculated for 16,4 millions of the PUBCHEM-structures using the CSEARCH NN-approach. The search technology used, is a modified SAHO-approach as implemented in CSEARCH.

If there is more interest in using this, no problem to upgrade the data file to the actual size of the PUBCHEM-collection. The calculation of approx. 40 millions of spectra can be done in less than one week on a 4-processor box.

Best regards, Wolfgang Robien</description>
		<content:encoded><![CDATA[<p>Dear Tony;</p>
<p>there is a searchable database of 16.4 millions of calculated C13-NMR spectra available since approx. 1 year on <a href="http://nmrpredict.orc.univie.ac.at/identify" rel="nofollow">http://nmrpredict.orc.univie.ac.at/identify</a><br />
(Peter: it&#8217;s free of charge !  <img src='http://www.chemspider.com/blog/wp-includes/images/smilies/icon_wink.gif' alt=';-)' class='wp-smiley' /> ) )</p>
<p>The spectra have been calculated for 16,4 millions of the PUBCHEM-structures using the CSEARCH NN-approach. The search technology used, is a modified SAHO-approach as implemented in CSEARCH.</p>
<p>If there is more interest in using this, no problem to upgrade the data file to the actual size of the PUBCHEM-collection. The calculation of approx. 40 millions of spectra can be done in less than one week on a 4-processor box.</p>
<p>Best regards, Wolfgang Robien</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Antony Williams</title>
		<link>http://www.chemspider.com/blog/an-invitation-to-collaborate-on-open-notebook-science-for-an-nmr-study.html/comment-page-1#comment-3857</link>
		<dc:creator>Antony Williams</dc:creator>
		<pubDate>Wed, 24 Oct 2007 00:45:25 +0000</pubDate>
		<guid isPermaLink="false">http://www.chemspider.com/blog/?p=213#comment-3857</guid>
		<description>Response to blog posting by Peter Murray-Rust posted (http://wwmm.ch.cam.ac.uk/blogs/murrayrust/?p=715) here for completeness

Peter, I&#039;ll comment in more detail on this after your readers have a chance to comment. But, for clarity, I point your readers to http://www.chemspider.com/blog/?p=213 where I took your &quot;George Whitesides approach to writing papers&quot; and added to the conclusion. Here&#039;s the piece I added.

&quot;The results from the GIAO calculations were compared with three other prediction approaches provided by Advanced Chemistry Development. These algorithms were not limited in the number of heavy atoms that could be handled by the algorithm, The algorithms were a HOSE-code based approach, a neural network approach and an â€œincrement approachâ€. A distinct advantage of these approaches is the time for prediction relative to the quantum-mechanical calculations. The QM calculation took a number of weeks to perform on the dataset of 23475 structures on a cluster of computers. However, a standard PC enabled the HOSE code based predictions to be performed in a few hours, the Neural Net predictions in about 4 minutes and the Increment based predictions in less than 3 minutes.
A comparison of the approaches gave statistics for the non-QM approaches superior to those of the QM approach. Outliers were observed in both cases and were traced to misassignments. QM approaches were generally more capable of predicting exotic structures while for the majority of the NMRShiftDB made up of general organic chemicals non-QM approaches were superior.&quot;

There should be no surprise to you that ACD/Labs stepped forward to participate. I declared it explicitly in my blog posting.

I also posted in that blog the following statement &quot;I believe this project offers the ability to help build a bridge between the Open Data community, the academic community and the commercial software community for the benefit of science. There has never been a study of the magnitude being discussed here comparing quantum-mechanical NMR prediction methods with the methods represented by commercial software products. I look forward to it!&quot;

I fully acknowledge your stance on commercial software companies. Also on publishers. And many other areas. You&#039;re not shy with your judgments. Having worked in academia, a Fortune 500 company and in a commercial software company I can comment that all three have good science going on, some excellent people in their organizations and certainly people committed to their roles and to science. I beg the question why not help build the bridge rather than maintain the distance.

Further clarification..ChemSpider does not have access to any NMR prediction algorithms. However, they would be willing to work on this project for the science. The hypothesis under question is whether HOSE, Neural Net or Increment based algorithms can outperform GIAO predictions. It is already known that they are faster ...these are real numbers generated already &quot;on the dataset of 23475 structures on a cluster of computers a standard PC enabled the HOSE code based predictions to be performed in a few hours, the Neural Net predictions in about 4 minutes and the Increment based predictions in less than 3 minutes.&quot; What is the statement on accuracy? I believe it&#039;s a valid scientific question to be answered. 

We have just submitted a publication regarding one aspect of this validated on NMRShiftDB with your collaborator, Christoph Steinbeck, as our collaborator. The title and authors are below..should be in JCIM shortly. I have already sent you a copy I believe.

The Performance Validation of Neural Network Based 13C NMR Prediction
Using a Publicly Available Data Source.
K.A. BlinovÂ§, Y.D. SmurnyyÂ§, M.E. ElyashbergÂ§, T.S. ChuranovaÂ§, M. KvashaÂ§, C. Steinbeck#, B.A. Lefebvreâ€  and A.J Williamsâ€¡
Â§ Advanced Chemistry Development, Moscow Department, 6 Akademik Bakulev Street, Moscow 117513, Russian Federation
â€  Advanced Chemistry Development, Inc., 110 Yonge Street, 14th floor, Toronto, Ontario, Canada, M5C 1T4
# Steinbeck Molecular Informatics, Franz-John-Str. 10, 77855 Achern, Germany.
â€¡ ChemZoo Inc., 904 Tamaras Circle, Wake Forest, NC-27587</description>
		<content:encoded><![CDATA[<p>Response to blog posting by Peter Murray-Rust posted (<a href="http://wwmm.ch.cam.ac.uk/blogs/murrayrust/?p=715" rel="nofollow">http://wwmm.ch.cam.ac.uk/blogs/murrayrust/?p=715</a>) here for completeness</p>
<p>Peter, I&#8217;ll comment in more detail on this after your readers have a chance to comment. But, for clarity, I point your readers to <a href="http://www.chemspider.com/blog/?p=213" rel="nofollow">http://www.chemspider.com/blog/?p=213</a> where I took your &#8220;George Whitesides approach to writing papers&#8221; and added to the conclusion. Here&#8217;s the piece I added.</p>
<p>&#8220;The results from the GIAO calculations were compared with three other prediction approaches provided by Advanced Chemistry Development. These algorithms were not limited in the number of heavy atoms that could be handled by the algorithm, The algorithms were a HOSE-code based approach, a neural network approach and an â€œincrement approachâ€. A distinct advantage of these approaches is the time for prediction relative to the quantum-mechanical calculations. The QM calculation took a number of weeks to perform on the dataset of 23475 structures on a cluster of computers. However, a standard PC enabled the HOSE code based predictions to be performed in a few hours, the Neural Net predictions in about 4 minutes and the Increment based predictions in less than 3 minutes.<br />
A comparison of the approaches gave statistics for the non-QM approaches superior to those of the QM approach. Outliers were observed in both cases and were traced to misassignments. QM approaches were generally more capable of predicting exotic structures while for the majority of the NMRShiftDB made up of general organic chemicals non-QM approaches were superior.&#8221;</p>
<p>There should be no surprise to you that ACD/Labs stepped forward to participate. I declared it explicitly in my blog posting.</p>
<p>I also posted in that blog the following statement &#8220;I believe this project offers the ability to help build a bridge between the Open Data community, the academic community and the commercial software community for the benefit of science. There has never been a study of the magnitude being discussed here comparing quantum-mechanical NMR prediction methods with the methods represented by commercial software products. I look forward to it!&#8221;</p>
<p>I fully acknowledge your stance on commercial software companies. Also on publishers. And many other areas. You&#8217;re not shy with your judgments. Having worked in academia, a Fortune 500 company and in a commercial software company I can comment that all three have good science going on, some excellent people in their organizations and certainly people committed to their roles and to science. I beg the question why not help build the bridge rather than maintain the distance.</p>
<p>Further clarification..ChemSpider does not have access to any NMR prediction algorithms. However, they would be willing to work on this project for the science. The hypothesis under question is whether HOSE, Neural Net or Increment based algorithms can outperform GIAO predictions. It is already known that they are faster &#8230;these are real numbers generated already &#8220;on the dataset of 23475 structures on a cluster of computers a standard PC enabled the HOSE code based predictions to be performed in a few hours, the Neural Net predictions in about 4 minutes and the Increment based predictions in less than 3 minutes.&#8221; What is the statement on accuracy? I believe it&#8217;s a valid scientific question to be answered. </p>
<p>We have just submitted a publication regarding one aspect of this validated on NMRShiftDB with your collaborator, Christoph Steinbeck, as our collaborator. The title and authors are below..should be in JCIM shortly. I have already sent you a copy I believe.</p>
<p>The Performance Validation of Neural Network Based 13C NMR Prediction<br />
Using a Publicly Available Data Source.<br />
K.A. BlinovÂ§, Y.D. SmurnyyÂ§, M.E. ElyashbergÂ§, T.S. ChuranovaÂ§, M. KvashaÂ§, C. Steinbeck#, B.A. Lefebvreâ€  and A.J Williamsâ€¡<br />
Â§ Advanced Chemistry Development, Moscow Department, 6 Akademik Bakulev Street, Moscow 117513, Russian Federation<br />
â€  Advanced Chemistry Development, Inc., 110 Yonge Street, 14th floor, Toronto, Ontario, Canada, M5C 1T4<br />
# Steinbeck Molecular Informatics, Franz-John-Str. 10, 77855 Achern, Germany.<br />
â€¡ ChemZoo Inc., 904 Tamaras Circle, Wake Forest, NC-27587</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Unilever Centre for Molecular Informatics, Cambridge - petermr&#8217;s blog &#187; Blog Archive &#187; Open Notebook NMR: Commercial re-use of data?</title>
		<link>http://www.chemspider.com/blog/an-invitation-to-collaborate-on-open-notebook-science-for-an-nmr-study.html/comment-page-1#comment-3856</link>
		<dc:creator>Unilever Centre for Molecular Informatics, Cambridge - petermr&#8217;s blog &#187; Blog Archive &#187; Open Notebook NMR: Commercial re-use of data?</dc:creator>
		<pubDate>Tue, 23 Oct 2007 22:46:23 +0000</pubDate>
		<guid isPermaLink="false">http://www.chemspider.com/blog/?p=213#comment-3856</guid>
		<description>[...] Antony Williams of Chemspider has offered to participate in our Open Notebook NMR experiment. Now this offer has been joined by ACDLabs - I am not sure of the formal relation between the companies but they have clear common interests. I had originally thought this was one individual making a personal offer - now there is a company that is requesting our data for them to work on.I have some genuine concerns about how we should proceed so am clearing my thoughts online. Readers will recall that I have been strongly critical of companies or nonprofits which use closed source and protect closed data. I have roughly equal numbers of correspondents who think I have been too hard on these organizations and those who think I need to be tougher. So I am taking a measured tone here. Peter - FYI ACD/Labs are ready to participate in the work as discussed: http://www.chemspider.com/blog/?p=213#comment-3735 [...]</description>
		<content:encoded><![CDATA[<p>[...] Antony Williams of Chemspider has offered to participate in our Open Notebook NMR experiment. Now this offer has been joined by ACDLabs &#8211; I am not sure of the formal relation between the companies but they have clear common interests. I had originally thought this was one individual making a personal offer &#8211; now there is a company that is requesting our data for them to work on.I have some genuine concerns about how we should proceed so am clearing my thoughts online. Readers will recall that I have been strongly critical of companies or nonprofits which use closed source and protect closed data. I have roughly equal numbers of correspondents who think I have been too hard on these organizations and those who think I need to be tougher. So I am taking a measured tone here. Peter &#8211; FYI ACD/Labs are ready to participate in the work as discussed: <a href="http://www.chemspider.com/blog/?p=213#comment-3735" rel="nofollow">http://www.chemspider.com/blog/?p=213#comment-3735</a> [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Unilever Centre for Molecular Informatics, Cambridge - petermr&#8217;s blog &#187; Blog Archive &#187; Open Notebook NMR - cont</title>
		<link>http://www.chemspider.com/blog/an-invitation-to-collaborate-on-open-notebook-science-for-an-nmr-study.html/comment-page-1#comment-3824</link>
		<dc:creator>Unilever Centre for Molecular Informatics, Cambridge - petermr&#8217;s blog &#187; Blog Archive &#187; Open Notebook NMR - cont</dc:creator>
		<pubDate>Mon, 22 Oct 2007 17:15:57 +0000</pubDate>
		<guid isPermaLink="false">http://www.chemspider.com/blog/?p=213#comment-3824</guid>
		<description>[...] Peter - FYI ACD/Labs are ready to participate in the work as discussed: http://www.chemspider.com/blog/?p=213#comment-3735 [...]</description>
		<content:encoded><![CDATA[<p>[...] Peter &#8211; FYI ACD/Labs are ready to participate in the work as discussed: <a href="http://www.chemspider.com/blog/?p=213#comment-3735" rel="nofollow">http://www.chemspider.com/blog/?p=213#comment-3735</a> [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Brent Lefebvre</title>
		<link>http://www.chemspider.com/blog/an-invitation-to-collaborate-on-open-notebook-science-for-an-nmr-study.html/comment-page-1#comment-3735</link>
		<dc:creator>Brent Lefebvre</dc:creator>
		<pubDate>Fri, 19 Oct 2007 14:35:55 +0000</pubDate>
		<guid isPermaLink="false">http://www.chemspider.com/blog/?p=213#comment-3735</guid>
		<description>Peter and Tony,

I think this is a fantastic project and am very keen to see how accurate the QM techniques prove to be for the subset of structures that you choose from the NMRShiftDB, and then how helpful they can be in improving the accuracy of experimental shifts in this wonderful resource. 

For the purposes of this work, we would be willing to provide the chemical shift predictions from the ACD/Labs software if you would like to use them in your comparison.  If, for instance, they prove to be accurate enough to find many of these problems without the need for time consuming QM calculations, it may be preferrable to use the faster calculation algorithms that are available in our software.  It may turn out that the ACD/Labs predictions could serve as a pre-filter to define which structures need the QM calculations and which don&#039;t.  Many variations on this theme come to mind, but we won&#039;t know which are useful until we do the work.

Sincerely,

Brent Lefebvre
NMR Product Manager
Advanced Chemistry Development, Inc.</description>
		<content:encoded><![CDATA[<p>Peter and Tony,</p>
<p>I think this is a fantastic project and am very keen to see how accurate the QM techniques prove to be for the subset of structures that you choose from the NMRShiftDB, and then how helpful they can be in improving the accuracy of experimental shifts in this wonderful resource. </p>
<p>For the purposes of this work, we would be willing to provide the chemical shift predictions from the ACD/Labs software if you would like to use them in your comparison.  If, for instance, they prove to be accurate enough to find many of these problems without the need for time consuming QM calculations, it may be preferrable to use the faster calculation algorithms that are available in our software.  It may turn out that the ACD/Labs predictions could serve as a pre-filter to define which structures need the QM calculations and which don&#8217;t.  Many variations on this theme come to mind, but we won&#8217;t know which are useful until we do the work.</p>
<p>Sincerely,</p>
<p>Brent Lefebvre<br />
NMR Product Manager<br />
Advanced Chemistry Development, Inc.</p>
]]></content:encoded>
	</item>
</channel>
</rss>

