ChIN简介页:Oscar3 (化学文本解析:化合物名称自动识别)
首  页 | ChemDB Portal | 帮  助 | 论  坛 |  关于本站 | 动态网页

 

Oscar3 (化学文本解析:化合物名称自动识别)

【URL】 http://www-pmr.ch.cam.ac.uk/wiki/Oscar3

【收费情况】 免费

【简介】
     Oscar3 is a tool for shallow, chemistry-specific parsing of chemical documents. It identifies (or attempts to identify):

Chemical names: singular nouns, plurals, verbs etc., also formulae and acronyms, some enzymes and reaction names.
Ontology terms: if you can do it by string-matching, you can get OSCAR to do it.
Chemical data: Spectra, melting/boiling point, yield etc. in experimental sections.
In addition, where possible the chemical names that are detected are annotated with structures, either via lookup or name-to-structure parsing ("OPSIN"), and with identifiers from the chemical ontology ChEBI

Current work on OSCAR3 by Peter Corbett focuses on its use in SciBorg, a framework for the deep parsing of chemical text.

OSCAR3 also includes the Oscar Server, a Jetty-powered set of servlets. These provide the following services:

Parsing of text/HTML by OSCAR.

Text/InChI/SMILES/SMILES substructues/SMILES similarity search of papers, coupled with keyword and ontology-based search, using Lucene and the CDK.

List of all names found / all names that co-occur with a search term or terms.

Online management of a chemical/stopword lexicon.

Manual editing of SciXML fragments containing named entities, for creating of gold standards and training data.

【相关链接】
  8位参与ChemSpider的化学信息学专家分享Microsoft Jim Gray eScience的奖项奖金


Summary by 李晓霞 on 2013-04-24

Last updated by 李晓霞 on 2013-04-24

访问统计 | 软 件 | 论文专著 | 请留言 请在800x600分辨率以上,用IE4.0以上版本浏览
版权所有 © 1998 - 2015 中国科学院过程工程研究所
       高性能计算与化学信息学课题组
       中国·北京·京ICP备05058588