Copyright©2008 Antony Williams
ChemSpider IS polluted with interesting identifiers associated with chemical structures and I have blogged many times about our efforts to clean it up. I’ve also suggested that systems such as ChemSpider, and their are many, needs an easy way to provide feedback and we have done this as discussed here. All of us hosting such large data collections deal with these issues. Today I found a classic though. A search on a CAS Number brought me to this page:
The information seems fair enough but the list of names is quite amusing:
These might be a new form of “International Name”. We have had disasters just like this on our own site. At the weekend I was informed by a user of one of our structures having over 70,000 identifiers! We looked at it. It was the ONLY structure on the database with more than 300 identifiers and this one user found it. We’ve cleaned it out now. Hosting services like this is a lot of funStumble it!