Simpler URLS For Better Indexing and Easier Searching
Posted by: Antony Williams in ChemSpider ServicesCopyright©2007 Antony Williams
We have noticed that despite the millions of InChIStrings and InChIKeys on ChemSpider they are not being indexed very well. Thanks to some constructive feedback from an interested party from Google (Thanks Simon!) we have revamped our URLs and improved the nature of how structures can be identified by various identifiers as shown below. These are now all valid URLs to identify a molecule and, as people mbed such links in their sites, indexing is more likely to be catalyzed.
http://www.chemspider.com/InChIKey=DEIYFTQMQPDXOT-RERXVCSDCZ
http://www.chemspider.com/Molecular-Formula=C28H38N6O11S
http://www.chemspider.com/Substance-Suppliers.10482071.html
http://www.chemspider.com/1234
http://www.chemspider.com/q=benzene
http://www.chemspider.com/InChI=1/C6H6/c1-2-4-6-5-3-1/h1-6H
We will also submit an updated version of our SiteMap to Google and see if we can catalyze the indexing of the InChIKeys and InChIStrings at least. This will definitely help in making the web more structure searchable.
Entries (RSS)
September 29th, 2007 at 10:39 am
That is very good news – getting efficiently indexed on Google is key to having ChemSpider become ubiquitous.
September 29th, 2007 at 1:09 pm
Great move Tony!
Also create a urllist.txt file (it’s like a Google sitemap) and lodge that with Yahoo!
db
September 30th, 2007 at 2:49 am
Excellent! I particularly like those which include the InChI(Key), which are rather close to those I use for rdf.openmolecules.net.
Now, I would love to start talking about a rdf.chemspider.com/InChI=1/bla, which would spit Resource Description Format. Together with that urllist.txt David proposed, you got the largest real world triple list ever composed! That will keep all those KM, and triple store developers busy for a while
October 1st, 2007 at 12:59 am
David…the sitemap equivalents will be submitted to all appropriate engines..
October 1st, 2007 at 1:00 am
Egon, We’re ready to start chatting about this and will do it offline. Sorry for the delay in discussing RDF’ing..you can see we’ve been distracted