<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: An Invitation to Open Access Publishers to Develop a Deposition API with ChemSpider</title>
	<atom:link href="http://www.chemspider.com/blog/an-invitation-to-open-access-publishers-to-develop-a-deposition-api-with-chemspider.html/feed" rel="self" type="application/rss+xml" />
	<link>http://www.chemspider.com/blog/an-invitation-to-open-access-publishers-to-develop-a-deposition-api-with-chemspider.html</link>
	<description>Building Community for Chemists</description>
	<lastBuildDate>Fri, 10 Feb 2012 11:07:05 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.1.2</generator>
	<item>
		<title>By: Eric Milgram</title>
		<link>http://www.chemspider.com/blog/an-invitation-to-open-access-publishers-to-develop-a-deposition-api-with-chemspider.html/comment-page-1#comment-28576</link>
		<dc:creator>Eric Milgram</dc:creator>
		<pubDate>Fri, 04 Apr 2008 19:46:42 +0000</pubDate>
		<guid isPermaLink="false">http://www.chemspider.com/blog/an-invitation-to-open-access-publishers-to-develop-a-deposition-api-with-chemspider.html#comment-28576</guid>
		<description>Tony,

The NIH&#039;s public access policy &quot;requires scientists to submit journal articles that arise from NIH funds to the digital archive PubMed Central (http://www.pubmedcentral.nih.gov/)...&quot;  The NIH has published a list of journals that by default, will upload their publications to pubmed: http://publicaccess.nih.gov/submit_process_journals.htm.  Perhaps some of these same journals would also require that manuscript authors submit their data electronically to a source such as ChemSpider.

One of the key foundations of science is the exchange of ideas, results, and data.  Some prestigous journals, such as Nature or Cell, require authors to make rare or precious reagents available to others for purposes of validating novel or controversial results.

Many people are aware of a number of cases where high profile scientific results  turned out to be fraudulent, as reported in the popular media.  There is no reason why authors should not be required to submit electronic copies of their data along with their manuscripts.  Before the advent of the WWW, publishers did not have a good mechanism for accepting this data and making it widely available.

However, with sites such as Wikipedia or Chemspider, the publishers do not need to incur the cost of maintaining a facility for data deposition and dissemination.  Also, if the process for uploading and searching this data is well-constructed, then authors wouldn&#039;t feel burdened by using it, and the scientific community benefits immensely by having access to it.

As scientific experiments become more complex, especially in fields like systems biology, where a small number of samples can generate a tremendous amount of data, being able to access the raw data will become more important than ever for a number of reasons.  For one thing, these fields are evolving rapidly, but there are no widely accepted best practices or QA/QC processes, even though a number of proposals are being discussed.  Thus, today, if one were to take the same data set (e.g. proteomics) and give it to 10 different groups to process, the results from those groups, depending upon how each group decides to process and interpret the results, very different conclusions can be drawn from the same data set.

A great example demonstrating this situation can be seen in the Jan 2008 journal, Statistical Applications in Genetics and Molecular Biology (http://www.bepress.com/sagmb/vol7/iss2/#competition).  In this issue, a clinical proteomics data set was made available to participants to analyze and submit their conclusions.  The journal selected some of the submissions and published the results, which were very interesting.

Making the raw data available has so many advantages.  Other researchers can use the data to verify the original conclusions, as well as to draw new conclusions.  The likelihood of fraud is decreased.  Also, so long as the sites where the data is being submitted have an adequate storage and retention policy, data archival for posterity becomes more facile.

I&#039;m glad to see that you&#039;re making available a facility for data storage.  The scientific community is slowly making the transition from the Gutenberg printing press era to the modern digital era.  At some point, having electronic data be available will certainly be required.</description>
		<content:encoded><![CDATA[<p>Tony,</p>
<p>The NIH&#8217;s public access policy &#8220;requires scientists to submit journal articles that arise from NIH funds to the digital archive PubMed Central (<a href="http://www.pubmedcentral.nih.gov/" rel="nofollow">http://www.pubmedcentral.nih.gov/</a>)&#8230;&#8221;  The NIH has published a list of journals that by default, will upload their publications to pubmed: <a href="http://publicaccess.nih.gov/submit_process_journals.htm" rel="nofollow">http://publicaccess.nih.gov/submit_process_journals.htm</a>.  Perhaps some of these same journals would also require that manuscript authors submit their data electronically to a source such as ChemSpider.</p>
<p>One of the key foundations of science is the exchange of ideas, results, and data.  Some prestigous journals, such as Nature or Cell, require authors to make rare or precious reagents available to others for purposes of validating novel or controversial results.</p>
<p>Many people are aware of a number of cases where high profile scientific results  turned out to be fraudulent, as reported in the popular media.  There is no reason why authors should not be required to submit electronic copies of their data along with their manuscripts.  Before the advent of the WWW, publishers did not have a good mechanism for accepting this data and making it widely available.</p>
<p>However, with sites such as Wikipedia or Chemspider, the publishers do not need to incur the cost of maintaining a facility for data deposition and dissemination.  Also, if the process for uploading and searching this data is well-constructed, then authors wouldn&#8217;t feel burdened by using it, and the scientific community benefits immensely by having access to it.</p>
<p>As scientific experiments become more complex, especially in fields like systems biology, where a small number of samples can generate a tremendous amount of data, being able to access the raw data will become more important than ever for a number of reasons.  For one thing, these fields are evolving rapidly, but there are no widely accepted best practices or QA/QC processes, even though a number of proposals are being discussed.  Thus, today, if one were to take the same data set (e.g. proteomics) and give it to 10 different groups to process, the results from those groups, depending upon how each group decides to process and interpret the results, very different conclusions can be drawn from the same data set.</p>
<p>A great example demonstrating this situation can be seen in the Jan 2008 journal, Statistical Applications in Genetics and Molecular Biology (<a href="http://www.bepress.com/sagmb/vol7/iss2/#competition" rel="nofollow">http://www.bepress.com/sagmb/vol7/iss2/#competition</a>).  In this issue, a clinical proteomics data set was made available to participants to analyze and submit their conclusions.  The journal selected some of the submissions and published the results, which were very interesting.</p>
<p>Making the raw data available has so many advantages.  Other researchers can use the data to verify the original conclusions, as well as to draw new conclusions.  The likelihood of fraud is decreased.  Also, so long as the sites where the data is being submitted have an adequate storage and retention policy, data archival for posterity becomes more facile.</p>
<p>I&#8217;m glad to see that you&#8217;re making available a facility for data storage.  The scientific community is slowly making the transition from the Gutenberg printing press era to the modern digital era.  At some point, having electronic data be available will certainly be required.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Antony Williams</title>
		<link>http://www.chemspider.com/blog/an-invitation-to-open-access-publishers-to-develop-a-deposition-api-with-chemspider.html/comment-page-1#comment-28561</link>
		<dc:creator>Antony Williams</dc:creator>
		<pubDate>Fri, 04 Apr 2008 18:12:05 +0000</pubDate>
		<guid isPermaLink="false">http://www.chemspider.com/blog/an-invitation-to-open-access-publishers-to-develop-a-deposition-api-with-chemspider.html#comment-28561</guid>
		<description>Yes..you are of course correct. One would hope that some validation of the data is done by the reviewers but how often do they have access to raw data! So, following deposition some curation and checking would need to be done. That coule be manual by ChemSpider curators or could be done by integrating to other tools/systems. Maybe an opportunity to send structures and spectra through an automated checking system ,....sound like something of interest to setup?</description>
		<content:encoded><![CDATA[<p>Yes..you are of course correct. One would hope that some validation of the data is done by the reviewers but how often do they have access to raw data! So, following deposition some curation and checking would need to be done. That coule be manual by ChemSpider curators or could be done by integrating to other tools/systems. Maybe an opportunity to send structures and spectra through an automated checking system ,&#8230;.sound like something of interest to setup?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Wolfgang Robien</title>
		<link>http://www.chemspider.com/blog/an-invitation-to-open-access-publishers-to-develop-a-deposition-api-with-chemspider.html/comment-page-1#comment-28533</link>
		<dc:creator>Wolfgang Robien</dc:creator>
		<pubDate>Fri, 04 Apr 2008 16:06:10 +0000</pubDate>
		<guid isPermaLink="false">http://www.chemspider.com/blog/an-invitation-to-open-access-publishers-to-develop-a-deposition-api-with-chemspider.html#comment-28533</guid>
		<description>ad C-NMR:

Deposition of spectra saves the first 10% of the &#039;job&#039; - the hard work starts when checking and curating !

For details see:
http://nmrpredict.orc.univie.ac.at/csearchlite/NMR_misinterpretation.html
I have approx. 6,000 examples more in my queue and I am definitely NOT talking about assignment problems below 5ppm !</description>
		<content:encoded><![CDATA[<p>ad C-NMR:</p>
<p>Deposition of spectra saves the first 10% of the &#8216;job&#8217; &#8211; the hard work starts when checking and curating !</p>
<p>For details see:<br />
<a href="http://nmrpredict.orc.univie.ac.at/csearchlite/NMR_misinterpretation.html" rel="nofollow">http://nmrpredict.orc.univie.ac.at/csearchlite/NMR_misinterpretation.html</a><br />
I have approx. 6,000 examples more in my queue and I am definitely NOT talking about assignment problems below 5ppm !</p>
]]></content:encoded>
	</item>
</channel>
</rss>

