Copyright©2008 Antony Williams
That is not a misspelling in the title…I do mean Word Docmination and not world domination. It’s a rather tongue in cheek comment based on some recent discussions where a friend working in the domain of cheminformatics laughingly joked that continuing to develop ChemSpider at our present rate and linking up document markup capabilities could lead to “world domination”. Hardly. Anyhow, we’re more for collaboration and integration rather than domination. However, taking the comment with it’s intent I did find it funny considering what we are working on at present…”Word docmination for Chemistry ”
Over the past couple of weeks you will have noticed that the blog has gone quite quiet. There have been a number of reasons for this…all positive. Collectively our team us involved in a number of projects and these have timelines and deliverables. My own personal time that can be dedicated to blogging is much diminished, for right now. That said there are some exciting things to report. We have progressed a long way with our document markup system since the presentation at ACS-Philadelphia. What we showed at the conference was very much proof of concept. Since then we have been improving our workflows for markup, have been validating the identification of chemical names versus “other text”, have been working on easy ways to build “dictionaries” of good and bad names into the interface, have been improving our structure layout and depiction and have started to compare with other document markup approaches. Our intention with our document markup approach has been to take advantage of much of the work we have done over the past 18 months. Specifically, we have created a large foundation of chemical entities with associated properties (identifiers, experimental and predicted properties, links to publications, services and other related information) and now we can leverage this database as we perform document markup. Not only can the validated structure-name pairs be used to great effect for this work but when chemical names are converted to structures they can immediately be used for lookup on the ChemSpider database. When chemical structures are identified in documents (either via chemical name conversion or by extraction of the chemical structure from the OLE container in a Word Document) then they can be deposited into the ChemSpider database together with a link to the original document or appropriate meta data. As we have moved through our project we have been focusing in on the next phase of the project which should be integration to the most common desktop word processor (?) Microsoft Word. At present we are working on utilizing the markup capabilities we have been developing in Internet Explorer and building integration to Word. These capabilities will initially be called via our ChemSpider web services but in the future might involve some desktop components for markup remote from ChemSpider. Time will tell. Watch this space for more news as we unveil the new capabilties. We are certainly interested in talking to any publishers who might be interested in looking at our markup capabilities as we add them.Stumble it!