SlideShare a Scribd company logo
Lexichem, a New Era
Overview

•

•

•

•

•




           2
Lexichem
Lexichem




     Nordefrin

Supported Nomenclature     Supported Languages (17)

•IUPAC 79 / 93 / 2005      •English (American / British)
•Chemical Abstract / CAS   •German
•Traditional               •Japanese
•MDL / Beilstein           •Spanish
•AutoNom                   •Swedish
•OpenEye
                                                           4
Command Line Applications

                        Mol2Nam
                        • Convert a file of molecules to names
bicyclo[3.2.1]octane



 bicyclo[3.2.1]octane
                        Nam2Mol
                        • Convert a file of names to molecules


  Glycinate             Translate
        • Convert a file of chemical names into a different
 グリシナート   language
                                                                 5
Lexichem TK




              6
Applications
Pipelines




• Large scale
  conversion of
  structures to names
  and names to
  structures

• Easy integration in
  workflows
                        8
Webservices


• Lexichem Webservice available

• Integration with 3rd party webservices

                                            • PubChem uses
                                            Lexichem



                    Mol2Nam

         Molecule
                               PUG          Hits in
            s
                              SOAP         PubChem
                                                        9
Lexiparser

• Automated extraction of structures and names from documents
• Supported formats:
        - .txt                    - .docx
        - .html                   - .rtf
        - .doc                    - .pdf FUTURE




                                                         10
Extracting Structures from a
  Patent
Patent URL




             Names extracted




                                 Generate Structures
                                                11
Desktop Applications




Electronic Laboratory Notebook




                                 NEW! Lexichem Workbench
                                                       12
Performance Metric
Why a New Performance Metric ?


• Ensures consistent improvement of Lexichem

• Identify areas in need of development

• Gold standard for all chemical nomenclature software




                                                    14
Round Tripping


  • Compare the initial                   Canonical
    and final structure
    after name generation              Isomeric SMILES

  • Easy to calculate
                  TP
  %RTCS =                   ×100
            TotalStructures

                                         English IUPAC
                                            Name

*E.O.Cannon. JCIM 2012, DOI: 10.1021/ci3000419
                                                         15
Results
Performance

                                %RTCS
                 98.71
100.00       88.94         89.05 92.43     89.88         93.83
 90.00                          84.54
 80.00
 70.00
 60.00                                               60.02
                        48.69            52.80
  50.00
  40.00                                                                           V2.0.2
  30.00                                                                           V2.1.1
  20.00
  10.00
    0.00
                                                                    V2.1.1                        Speed
           Maybridge    MDDR                                      V2.0.2
                                                                 12000
                                NCI
                                      Wombat
                                                   PubChem
                                                                 10000


                                                                  8000

                                                 Names s-1
                                                                  6000
                                                 Molecules s-1

                                                                  4000


                                                                  2000


       NCI             Wombat    PubChem                             0
                                                                             Maybridge     MDDR      NCI   Wombat
                                                                                                                    17
                                                                                                                     PubChem
New Features
Nam2mol – New Features

–




                             19
Nam2mol – New Features

–

–




                             18
Nam2mol – New Features

–

–

–




                             18
Nam2mol – New Features

–

–

–

–

               H       H       H        H




                           H       H
                   H




                                   18
Nam2mol – New Features

–

–

–

–

–




                             18
Nam2mol – New Features

–

–

–

–

–

–




                             18
Nam2mol – New Features

–

–

–

–

–

–           L-Arginine       D-Arginine

–


                                          18
Nam2mol – New Features

–

–

–

–

–

–

–

–
                             18
Octahydro-1H-4,7-epoxyisoindole                   Benzo[cd]indole

                                        Ring
                                      templates




5,6,6a,7-tetrahydro-4H-dibenzo[de,g]quinoline       Yohimban

                                                                    19
Lexichem Workbench
Lexichem Workbench

•
•
    –
    –
    –
    –
    –
    –
    –


                     21
Main Window




• Converts input SMILES string      • Chemical information:
  or chemical name                     - Molecular weight
• Visual display of the structure      - SMILES
                                       - IUPAC name
                                                          22
Results




• Results history
• Original input on display


                              23
Results




• Text options:
   - Copy selected cells
   - Save table
   - Display selection
                           24
Results




• Display options:
   - Save
   - Copy
   - Print
                     25
Substructure Search




• Options:
  - Functional group from list
  - Custom SMILES/SMARTS pattern
  - Custom name
                                   26
Conclusions

•



•

•




              27
•
•
•
•
•
•


    28

More Related Content

PPTX
Intro to Water-based Coatings
PDF
Dispersants & Coupling Agents: New Chemistry for a Mature Industry
PDF
The Value in Recycled Polyamides
PPTX
Minerals in Compounding
PDF
Adding Value to Recycled Polyamides with Polyfunctional Chain Extenders
PPT
Polymers in Cables
PPTX
Form & Structure: The Effects of Polybutadiene TPU with Polypropylene
PDF
Visual Studio dot net
Intro to Water-based Coatings
Dispersants & Coupling Agents: New Chemistry for a Mature Industry
The Value in Recycled Polyamides
Minerals in Compounding
Adding Value to Recycled Polyamides with Polyfunctional Chain Extenders
Polymers in Cables
Form & Structure: The Effects of Polybutadiene TPU with Polypropylene
Visual Studio dot net

Viewers also liked (17)

DOC
Announcement 3
PPTX
catalytic enantioselective trioxygenation
PPTX
Монголія
PDF
Lady Bird Deeds
PDF
Corgraf Papelaria 2015
DOCX
Tectonica de Placas
PPT
教學Ppt吉貝國中陳良毓
PPTX
Presentación1
PPT
orchid mounted on tree slab
PDF
*Annual Report 2014-15
PDF
Free facebook insights_report_on_tremont_tearoom_(07_29_2014-08_10_2014_pdt)
PPTX
SMAC- Facebook Analytics
PPTX
History and curiosities of clasical movies
PPTX
COMO ATRAIR 3 VEZES MAIS CLIENTES SEM INVESTIR UM CENTAVO?
PPT
黄山
PDF
Catálogo discos ff
PPTX
Announcement 3
catalytic enantioselective trioxygenation
Монголія
Lady Bird Deeds
Corgraf Papelaria 2015
Tectonica de Placas
教學Ppt吉貝國中陳良毓
Presentación1
orchid mounted on tree slab
*Annual Report 2014-15
Free facebook insights_report_on_tremont_tearoom_(07_29_2014-08_10_2014_pdt)
SMAC- Facebook Analytics
History and curiosities of clasical movies
COMO ATRAIR 3 VEZES MAIS CLIENTES SEM INVESTIR UM CENTAVO?
黄山
Catálogo discos ff
Ad

Similar to Jcup 3 (2012) Presentation: Lexichem, a new Era. By Ed Cannon (20)

PPTX
some_other_API
PDF
(ATS4-PLAT04) Chemistry Data Model Enhancements in Pipeline Pilot 9.0: what a...
PPT
Annular nanolayered shrink film 3 26-2018
PDF
MOCVD Brochure.PDF
PDF
Ee325 cmos design lab 4 report - loren k schwappach
PPT
ELMARCO - Company overview
PDF
Riak - From Small to Large - StrangeLoop
PDF
Riak - From Small to Large
PDF
PEM-Alkaline-Comparison-of-technologies-CEMAC
PDF
NEGOCIOS DEL AMONIACO INNOVACION Y FUTURO EN CHILE
PDF
PDF
16f877
PDF
Ali max-bev demo presentation 2-9-18
PDF
Large customers want postgresql too !!
PDF
Hydroxylamine Cleaning Chemistries
PPTX
Internsheereresresfsdddsddddrsfuvvfip.pptx
PDF
NEW BATTERY TECHNOLOGY FOR ELECTROLYTE AND ANODE MATERIALS
PDF
Mil Aer Homeland
PDF
Johnson Matthey - Sean Balkissoon V1.pdf
PDF
2010 engine aftertreatment nrel
some_other_API
(ATS4-PLAT04) Chemistry Data Model Enhancements in Pipeline Pilot 9.0: what a...
Annular nanolayered shrink film 3 26-2018
MOCVD Brochure.PDF
Ee325 cmos design lab 4 report - loren k schwappach
ELMARCO - Company overview
Riak - From Small to Large - StrangeLoop
Riak - From Small to Large
PEM-Alkaline-Comparison-of-technologies-CEMAC
NEGOCIOS DEL AMONIACO INNOVACION Y FUTURO EN CHILE
16f877
Ali max-bev demo presentation 2-9-18
Large customers want postgresql too !!
Hydroxylamine Cleaning Chemistries
Internsheereresresfsdddsddddrsfuvvfip.pptx
NEW BATTERY TECHNOLOGY FOR ELECTROLYTE AND ANODE MATERIALS
Mil Aer Homeland
Johnson Matthey - Sean Balkissoon V1.pdf
2010 engine aftertreatment nrel
Ad

Recently uploaded (20)

PDF
AI And Its Effect On The Evolving IT Sector In Australia - Elevate
PDF
DevOps & Developer Experience Summer BBQ
PDF
REPORT: Heating appliances market in Poland 2024
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
GamePlan Trading System Review: Professional Trader's Honest Take
PDF
HCSP-Presales-Campus Network Planning and Design V1.0 Training Material-Witho...
PDF
SAP855240_ALP - Defining the Global Template PUBLIC.pdf
PPTX
MYSQL Presentation for SQL database connectivity
PDF
KodekX | Application Modernization Development
PDF
CIFDAQ's Market Wrap: Ethereum Leads, Bitcoin Lags, Institutions Shift
PDF
solutions_manual_-_materials___processing_in_manufacturing__demargo_.pdf
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
CIFDAQ's Teaching Thursday: Moving Averages Made Simple
PDF
Event Presentation Google Cloud Next Extended 2025
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Omni-Path Integration Expertise Offered by Nor-Tech
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PDF
NewMind AI Monthly Chronicles - July 2025
PDF
Chapter 2 Digital Image Fundamentals.pdf
AI And Its Effect On The Evolving IT Sector In Australia - Elevate
DevOps & Developer Experience Summer BBQ
REPORT: Heating appliances market in Poland 2024
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
GamePlan Trading System Review: Professional Trader's Honest Take
HCSP-Presales-Campus Network Planning and Design V1.0 Training Material-Witho...
SAP855240_ALP - Defining the Global Template PUBLIC.pdf
MYSQL Presentation for SQL database connectivity
KodekX | Application Modernization Development
CIFDAQ's Market Wrap: Ethereum Leads, Bitcoin Lags, Institutions Shift
solutions_manual_-_materials___processing_in_manufacturing__demargo_.pdf
NewMind AI Weekly Chronicles - August'25 Week I
CIFDAQ's Teaching Thursday: Moving Averages Made Simple
Event Presentation Google Cloud Next Extended 2025
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Omni-Path Integration Expertise Offered by Nor-Tech
CIFDAQ's Market Insight: SEC Turns Pro Crypto
NewMind AI Monthly Chronicles - July 2025
Chapter 2 Digital Image Fundamentals.pdf

Jcup 3 (2012) Presentation: Lexichem, a new Era. By Ed Cannon

Editor's Notes

  • #3: Start by talking about some real world applications of Lexichem, what you can do with it and who is using it.then move on to talk about Lexichem, TK v2.11….released Metric to assess how well Lexichem is performingNew features from v2.0.2Finish off with GUI
  • #4: Now lets talk about the main purpose of my were here, Lexichem.
  • #5: Lexichem is OpenEyes chemical nomenclature software, you can:->convert names to molecules->molecules to names->translate names from one language to anotherLexichem comes in two flavours
  • #6: Standalone applications run from command line.
  • #7: For those who want to program and use Lexichem:Lexichem TK is written in C++ and Swig (Simplified Wrapper and Interface Generator) wrapped to python, Java and C#.
  • #8: So what can Lexichem do, other than help you buy heroin?
  • #9: Keith Taylor showed yesterday a Pipeline pilot workflow with a node for converting structures to namesWorkflow integration with Pipeline Pilot.Node use 1 of Lexichem’s functions-> mol2nam, nam2mol, translateMatt Stahl working on nodes using OpenEye Software.Lexichem node highlighted in square ->convert structures to names.
  • #10: Craig Bruce hired recently, been developing Webservices for OE, one being LexichemUse Lexichem prior / post processing to Webservice.PUG (Power User Gateway)Search for structures with the names found in across PubChem.
  • #11: Chemical name extraction from patents/documents and structure generation (Lexiparser uses Lexichem)->Uses Lexichem after it’s extracted chemical names to convert them to molecules.
  • #13: Lexichem is the engine beneath the hood, when a user draws a structure a call to Lexichem is made which generates a name which can be rendered.Alternatively you can import chemical names convert them to molecules and visualize the image.
  • #14: Purpose of metric-> ensure we are not regressing but improving Lexichem-> identify areas / features in need of improvement-> gold standard which other companies can then compare chemical nomenclature software
  • #15: ->Concept: start pt, and an end pt after some processing, then compare start pt to end pt+ve: quick to calculate, gives one figure value of how accurate Lexichem is on dataset->Paper acceptedAdv SMILEs: human readable, less verbose than inchi, tautomer support
  • #16: ->Concept: start pt, and an end pt after some processing, then compare start pt to end pt+ve: quick to calculate, gives one figure value of how accurate Lexichem is on dataset->Paper acceptedAdv SMILEs: human readable, less verbose than inchi, tautomer support
  • #17: The concept of a benchmark is good, but do we have good results using it?
  • #18: Whilst these results are good, is Lexichem feasible on a large scale?Seen Lexichem performs well and is feasible on large datasets, now lets look at what features been added.----- Meeting Notes (3/28/12 11:38) -----Mol2Nam -> canonicalize atoms & bonds,identify atom types, identify ring systems and size, bridges, locants and positions,identify stereo / walk the graph
  • #19: So what have we added since v2.0.2?Our main drive has been looking nam2mol features (as they’re not quite as well supported as mol2nam), in particular the ability to generate molecules for large ring systems. (one of Lexichem’s weaker points in the previous releases)
  • #20: Von Baeyer -> polyalicyclic ring systems-> previously only bicyclic supported.-> working hard on augmenting natural productsBeta-carotene in carrots -> Provitamin A carotenoid
  • #21: Von Baeyer -> polyalicyclic ring systems-> previously only bicyclic supported.-> working hard on augmenting natural productsBeta-carotene in carrots -> Provitamin A carotenoid
  • #22: Von Baeyer -> polyalicyclic ring systems-> previously only bicyclic supported.-> working hard on augmenting natural productsBeta-carotene in carrots -> Provitamin A carotenoid
  • #23: Von Baeyer -> polyalicyclic ring systems-> previously only bicyclic supported.-> working hard on augmenting natural productsBeta-carotene in carrots -> Provitamin A carotenoid
  • #24: Von Baeyer -> polyalicyclic ring systems-> previously only bicyclic supported.-> working hard on augmenting natural productsBeta-carotene in carrots -> Provitamin A carotenoid
  • #25: Von Baeyer -> polyalicyclic ring systems-> previously only bicyclic supported.-> working hard on augmenting natural productsBeta-carotene in carrots -> Provitamin A carotenoid
  • #26: Von Baeyer -> polyalicyclic ring systems-> previously only bicyclic supported.-> working hard on augmenting natural productsBeta-carotene in carrots -> Provitamin A carotenoid
  • #27: Von Baeyer -> polyalicyclic ring systems-> previously only bicyclic supported.-> working hard on augmenting natural productsBeta-carotene in carrots -> Provitamin A carotenoid
  • #28: Ring templates for conversion of molecules to names.Mainly bridge and fused ring templates have been added.
  • #29: Primary goal of the GUI was to:-> Lower the bar to use Lexichem’s functionality (for people not keen on using an API to program against, or using the command line)
  • #31: ->Primarily modeled on the command line tools, but provided numerous additional features2 Menus: Open a molecular fileImport name, SMILES, set default options, clear the view.
  • #35: Filter the results
  • #36: Story about Lexichem, you know you want it, then please feel free to contact usFuture work: continue to work on fused polycyclic ring systems, natural products