twenty five years of progress in cheminformatics - wendy warr

62
Wendy Warr & Associates Twenty Five Years of Progress in Twenty Five Years of Progress in Cheminformatics Cheminformatics Dr. Wendy A. Warr Dr. Wendy A. Warr http:// http:// www.warr.com www.warr.com

Upload: others

Post on 12-Feb-2022

3 views

Category:

Documents


0 download

TRANSCRIPT

Wendy Warr & Associates

Twenty Five Years of Progress in Twenty Five Years of Progress in CheminformaticsCheminformatics

Dr. Wendy A. WarrDr. Wendy A. Warrhttp://http://www.warr.comwww.warr.com

Wendy Warr & Associates

JCIM

J. Chem. Inf. Comput. Sci.(JCICS)is nowJ. Chem. Inf. Model. (JCIM)

Wendy Warr & Associates

The ChemistThe ChemistReaderReaderAuthorAuthor

ReviewerReviewer EditorEditor

Wendy Warr & Associates

Our AuthorsOur Authors

Wendy Warr & Associates

SciFinderSciFinderGasteigerGasteiger

Wendy Warr & Associates

ACS Award for Computers in Chemical ACS Award for Computers in Chemical and Pharmaceutical Researchand Pharmaceutical Research

1986 Raymond E. 1986 Raymond E. DessyDessy1987 W. Todd 1987 W. Todd WipkeWipke1988 W.A. Goddard III1988 W.A. Goddard III1989 Christie G. 1989 Christie G. EnkeEnke1990 Peter C. 1990 Peter C. JursJurs1991 John A. 1991 John A. PoplePople1992 Ernest R. Davidson1992 Ernest R. Davidson1993 W. Clark Still1993 W. Clark Still1994 Michael J.S. Dewar1994 Michael J.S. Dewar1995 Peter A. 1995 Peter A. KollmanKollman1996 Norman L. 1996 Norman L. Allinger

1997 Harold A. 1997 Harold A. ScheragaScheraga1998 William L. Jorgensen1998 William L. Jorgensen1999 Corwin H. 1999 Corwin H. HanschHansch2000 Donald G. 2000 Donald G. TruhlarTruhlar2001 Martin 2001 Martin KarplusKarplus2002 Irwin D. Kuntz2002 Irwin D. Kuntz2003 Kendall N. 2003 Kendall N. HoukHouk2004 W. Graham Richards2004 W. Graham Richards2005 Peter Willett2005 Peter Willett2006 Johann 2006 Johann GasteigerGasteiger

Allinger

Wendy Warr & Associates

ACS Division of Chemical InformationACS Division of Chemical InformationHerman Herman SkolnikSkolnik AwardAward

Herman Herman SkolnikSkolnik (1976)(1976)Eugene Garfield (1977)Eugene Garfield (1977)Fred A. Tate (1978)Fred A. Tate (1978)William J. William J. WiswesserWiswesser (1979)(1979)Ben H. Weil (1981)Ben H. Weil (1981)*Robert *Robert FugmannFugmann (1982)(1982)Russell J. Rowlett, Jr. (1983)Russell J. Rowlett, Jr. (1983)MontaguMontagu HyamsHyams (1984)(1984)Dale B. Baker (1986)Dale B. Baker (1986)William William TheilheimerTheilheimer (1987)(1987)David R. David R. LideLide, Jr. (1988), Jr. (1988)Michael F. Lynch (1989)Michael F. Lynch (1989)Stuart A. Stuart A. MarsonMarson (1989)(1989)*Ernst Meyer (1990)*Ernst Meyer (1990)W. Todd W. Todd WipkeWipke (1991)(1991)JaquesJaques--Emile Dubois (1992)

Peter Willett (1993)Peter Willett (1993)AlexandruAlexandru T. T. BalabanBalaban (1994)(1994)**ReinerReiner LuckenbachLuckenbach (1995)(1995)*Clemens *Clemens JochumJochum (1995)(1995)Milan Milan RandicRandic (1996)(1996)*Johann *Johann GasteigerGasteiger (1997)(1997)Gary Wiggins (1998)Gary Wiggins (1998)Stuart Stuart KabackKaback (1999)(1999)Stephen R. Heller (2000)Stephen R. Heller (2000)G.W.A. (Bill) Milne (2000)G.W.A. (Bill) Milne (2000)GuenterGuenter GretheGrethe (2001)(2001)Peter Norton (2002)Peter Norton (2002)Frank Allen (2003)Frank Allen (2003)Peter Johnson (2004)Peter Johnson (2004)LorrinLorrin Garson (2005)Garson (2005)Hugo Hugo KubinyiKubinyi (2006)(2006)Emile Dubois (1992)

Wendy Warr & Associates

ACS Division of Chemical InformationACS Division of Chemical InformationHerman Herman SkolnikSkolnik AwardAward

•• Robert Robert FugmannFugmann (1982)(1982)•• Ernst Meyer (1990)Ernst Meyer (1990)•• ReinerReiner LuckenbachLuckenbach (1995)(1995)•• Clemens Clemens JochumJochum (1995)(1995)•• Johann Johann GasteigerGasteiger (1997)(1997)

•• GuenterGuenter GretheGrethe (2001)(2001)•• Hugo Hugo KubinyiKubinyi (2006)(2006)

Wendy Warr & Associates

WiswesserWiswesser Line NotationLine Notation

NO2

OSO2

N

Cl

T6NJ NWB C DOSWR G

Wendy Warr & Associates

CROSSBOWCROSSBOWCComputerizedomputerizedRRetrieval ofetrieval ofSStructuretructureSSBBasedasedOOnnWWiswesseriswesser

Wendy Warr & Associates

Structure RepresentationStructure Representation

•• WeiningerWeininger, D., D. SMILES, a chemical language SMILES, a chemical language and information system. 1. Introduction to and information system. 1. Introduction to methodology and encoding rules. methodology and encoding rules. J. Chem. J. Chem. Inf. Inf. ComputComput. . SciSci.. 19881988, , 2828, 31, 31--36.36.

•• WeiningerWeininger, D.; , D.; WeiningerWeininger, A.; , A.; WeiningerWeininger, J. L. , J. L. SMILES. 2. Algorithm for generation of SMILES. 2. Algorithm for generation of unique SMILES notation. unique SMILES notation. J. Chem. Inf. J. Chem. Inf. ComputComput. . SciSci.. 19891989, , 2929, 97, 97--101.101.

Wendy Warr & Associates

Structure RepresentationStructure Representation

•• Morgan, H. L. The generation of a unique Morgan, H. L. The generation of a unique machine description for chemical structures machine description for chemical structures --a technique developed at Chemical Abstracts a technique developed at Chemical Abstracts Service. Service. J. Chem. Doc.J. Chem. Doc. 19651965, , 55, 107, 107--113.113.

•• WipkeWipke, W. T.;, W. T.; DyottDyott, T. M. , T. M. StereochemicallyStereochemicallyunique naming algorithm.unique naming algorithm. J. Am. Chem. Soc.J. Am. Chem. Soc.19741974, , 9696(15), 4834(15), 4834--4842.4842.

Wendy Warr & Associates

Structure RepresentationStructure Representation

•• Canonical numbering for Canonical numbering for InChIInChI: : modified from McKay, B. D. Practical modified from McKay, B. D. Practical graph isomorphism. graph isomorphism. CongressusCongressusNumerantiumNumerantium 19811981, , 3030, 45, 45––87.87.

Wendy Warr & Associates

Substructure SearchingSubstructure Searching

•• NIH/EPA Chemical Information SystemNIH/EPA Chemical Information System•• DARCDARC•• CAS ONLINECAS ONLINE•• MACCS/ISISMACCS/ISIS

Wendy Warr & Associates

MeyerMeyer’’s Formula Reading s Formula Reading MachineMachine

Wendy Warr & Associates

ImlacImlac TerminalTerminal

Wendy Warr & Associates

VT640 TerminalVT640 Terminal

Wendy Warr & Associates

1984 VAX 11/7501984 VAX 11/750

•• Clock speed 6 MHzClock speed 6 MHz•• 2 Mb memory2 Mb memory•• 134 Mb fixed disk134 Mb fixed disk•• Two 67 Mb exchangeable disk drivesTwo 67 Mb exchangeable disk drives•• Shared peripheralsShared peripherals•• ££100,000 (1984 price)100,000 (1984 price)

Wendy Warr & Associates

WWindowsindows

IIconscons

MMiceice

PPointersointers

Wendy Warr & Associates

MicrocomputersMicrocomputers

Wendy Warr & Associates

Wendy Warr & Associates

IvarIvar UgiUgi1930 1930 -- 20052005

Wendy Warr & Associates

CAOSCAOS

•• Corey, E.J.; Corey, E.J.; WipkeWipke, W.T. Computer, W.T. Computer--assisted Design assisted Design of Complex Organic Syntheses. of Complex Organic Syntheses. ScienceScience, , 19691969, , 166 166 ((39023902)), , 178178--192.192.

•• DugundjiDugundji, J.; , J.; UgiUgi, I. Algebraic Model of , I. Algebraic Model of Constitutional Chemistry as a basis for Chemical Constitutional Chemistry as a basis for Chemical Computer Programs. Computer Programs. Top. Top. CurrCurr. Chem.. Chem. 19731973, , 3939, , 1919--64.64.

•• Bauer, J.; Bauer, J.; HergesHerges, R.; , R.; FontainFontain, E.; , E.; UgiUgi, I. IGOR and , I. IGOR and Computer Assisted Innovation in Chemistry. Computer Assisted Innovation in Chemistry. ChimiaChimia, , 1985, 39, 451985, 39, 45--53.53.

Wendy Warr & Associates

Reaction ProgramsReaction Programs

•• Reaction retrievalReaction retrieval•• Synthetic analysisSynthetic analysis

–– Synthesis designSynthesis design–– Reaction predictionReaction prediction–– Mechanism elucidationMechanism elucidation

Wendy Warr & Associates

Reaction RetrievalReaction Retrieval

•• REACCS/ISISREACCS/ISIS•• ORACORAC•• CASREACTCASREACT

Wendy Warr & Associates

Synthetic AnalysisSynthetic Analysis

•• LogicLogic--orientedoriented–– IGORIGOR–– EROS EROS –– SYNGENSYNGEN–– CAMEOCAMEO–– WODCAWODCA

•• InformationInformation--orientedoriented–– LHASALHASA–– SECS/CASPSECS/CASP

Wendy Warr & Associates

Reaction ClassificationReaction Classification

InfoChemInfoChem Classification AlgorithmClassification Algorithm

CLASSIFYCLASSIFY

Wendy Warr & Associates

Cambridge Structural DatabaseCambridge Structural Database

•• Allen, F. H. The Cambridge Structural Allen, F. H. The Cambridge Structural Database: a quarter of a million structures Database: a quarter of a million structures and rising. and rising. ActaActa CrystCryst. Section B. Section B 20022002, , 5858, , 380380--388.388.

•• Allen, F. H.; Davies, J. E.; Allen, F. H.; Davies, J. E.; GalloyGalloy, J. J.; , J. J.; Johnson, O.; Kennard, O.; Johnson, O.; Kennard, O.; MacraeMacrae, C. F.; , C. F.; Mitchell, E. M.; Mitchell, G. F.; Smith, J. M.; Mitchell, E. M.; Mitchell, G. F.; Smith, J. M.; Watson, D. G. The development of versions 3 Watson, D. G. The development of versions 3 and 4 of the Cambridge Structural Database and 4 of the Cambridge Structural Database System. System. J. Chem. Inf. J. Chem. Inf. ComputComput. . SciSci.. 19911991, , 31,31,187187--204.204.

Wendy Warr & Associates

3D Structures3D Structures

•• Pearlman, R. S. Rapid generation of high quality Pearlman, R. S. Rapid generation of high quality approximate 3D molecular structures. approximate 3D molecular structures. Chemical Chemical Design Automation NewsDesign Automation News, , 19871987, , 2,2, 1, 51, 5--7.7.

•• Hiller, C.; Hiller, C.; GasteigerGasteiger, J. , J. EinEin automatisierterautomatisierterMolekMoleküülbaukastenlbaukasten. In . In SoftwareSoftware--EntwicklungEntwicklung in in derderChemieChemie, , GasteigerGasteiger, J., Ed.; Springer: Berlin, 1987; , J., Ed.; Springer: Berlin, 1987; Vol. 1; pp. 53Vol. 1; pp. 53--66.66.

•• GasteigerGasteiger, J.; Rudolph, C.; , J.; Rudolph, C.; SadowskiSadowski, J., J.Automatic generation of 3D atomic coordinates for Automatic generation of 3D atomic coordinates for organic molecules. organic molecules. Tetrahedron Computer Tetrahedron Computer MethodologyMethodology 19901990, , 33, 537, 537--547. 547.

Wendy Warr & Associates

GasteigerGasteiger Most CitedMost Cited

SadowskiSadowski, J.; , J.; GasteigerGasteiger, J.; , J.; KlebeKlebe, G. , G. Comparison of Automatic ThreeComparison of Automatic Three--Dimensional Model Builders Using 639 Dimensional Model Builders Using 639 XX--ray Structuresray Structures. J. Chem. Inf. J. Chem. Inf. ComputComput. . SciSci.. 19941994, , 3434, 1000, 1000--1008. 1008.

Wendy Warr & Associates

3D Searching3D Searching•• Jakes, S.E.; Willett, P. Jakes, S.E.; Willett, P. PharmacophoricPharmacophoric pattern pattern

matching in files of 3D chemical structures: selection matching in files of 3D chemical structures: selection of interof inter--atomic distance screens. atomic distance screens. J. Mol. Graph.J. Mol. Graph. 19861986, , 44, 12, 12-- 20.20.

•• CringeanCringean, J.K.; , J.K.; PepperrellPepperrell, C.A.; , C.A.; PoirrettePoirrette, A.R.; , A.R.; Willett, P. Selection of screens for threeWillett, P. Selection of screens for three--dimensional dimensional substructure searching. substructure searching. Tetrahedron Computer Tetrahedron Computer MethodologyMethodology 19901990, , 33, 37, 37--46.46.

•• BrintBrint, A. T.; Willett, P. , A. T.; Willett, P. PharmacophoricPharmacophoric Pattern Pattern Matching in Files of 3D Chemical Structures: Matching in Files of 3D Chemical Structures: Comparison of Geometric Searching Algorithms. Comparison of Geometric Searching Algorithms. J. J. Mol. Graph.Mol. Graph. 19871987, , 55, 49, 49--56.56.

Wendy Warr & Associates

3D Searching3D Searching

•• Martin, Y. C.; Danaher, E. B.; May, C. S.; Martin, Y. C.; Danaher, E. B.; May, C. S.; WeiningerWeininger, D. , D. MENTHOR, a database system for the storage and MENTHOR, a database system for the storage and retrieval of threeretrieval of three--dimensional molecular structures and dimensional molecular structures and associated data searchable by associated data searchable by substructuralsubstructural, biologic, , biologic, physical, or geometric properties. physical, or geometric properties. J. J. ComputComput..--Aided Mol. Aided Mol. DesignDesign 19881988, , 22((11), 15), 15--29.29.

•• Van Van DrieDrie, J. H.; , J. H.; WeiningerWeininger, D.; Martin, Y. C. ALADDIN: , D.; Martin, Y. C. ALADDIN: an integrated tool for computeran integrated tool for computer--assisted molecular assisted molecular design and design and pharmacophorepharmacophore recognition from geometric, recognition from geometric, stericsteric, and substructure searching of three, and substructure searching of three--dimensional dimensional molecular structures. molecular structures. J. J. ComputComput..--Aided Mol. DesignAided Mol. Design19891989, , 33((33), 225), 225--251.251.

Wendy Warr & Associates

Molecules TwistMolecules Twist

Wendy Warr & Associates

O

OH

NH2

N

O

N

N OO

Br

N

O

N

O

NN

O

O

O

O

OO MolecularMolecular

DiversityDiversity

NHO

N

O

R1

O

O

O

O O

O

O

O

P

O

O

Wendy Warr & Associates

SelectionSelection

1010180180 possible drugs...10possible drugs...101818 likely drugs...likely drugs...101077 known compounds...10known compounds...1066 commerciallycommerciallyavailable...10available...106 6 in corporate databases...in corporate databases...101044 in drug databases...10in drug databases...1033 commercial commercial drugs...drugs...

101022 profitable drugsprofitable drugs

Wendy Warr & Associates

Library DesignLibrary Design

•• Random screening is too expensiveRandom screening is too expensive•• Its hit rate is lowIts hit rate is low•• False positives may be a problemFalse positives may be a problem•• HTS consumes expensive compoundsHTS consumes expensive compounds•• There are too many possible There are too many possible

compoundscompounds•• How to choose those most likely to be How to choose those most likely to be

hits?hits?

Wendy Warr & Associates

Selecting Diverse SubsetsSelecting Diverse Subsets

•• ClusteringClustering•• DissimilarityDissimilarity--based selectionbased selection•• Partitioning/cellPartitioning/cell--based approachesbased approaches•• OptimizationOptimization--based methodsbased methods

Wendy Warr & Associates

Measuring DiversityMeasuring Diversity

Martin, E. J.; Martin, E. J.; BlaneyBlaney, J. M.; , J. M.; SianiSiani, M. A.; , M. A.; SpellmeyerSpellmeyer, D. C.; Wong, A. K.; Moos, , D. C.; Wong, A. K.; Moos, W. H. W. H. Measuring diversity: experimental Measuring diversity: experimental design of combinatorial libraries for drug design of combinatorial libraries for drug discovery.discovery. J. Med. Chem.J. Med. Chem. 19951995, , 3838, , 14311431--1436.1436.

Wendy Warr & Associates

DiversityDiversity

•• Brown, R. D.; Martin, Y. C. Use of structureBrown, R. D.; Martin, Y. C. Use of structure--activity data to compare structureactivity data to compare structure--based based clustering methods and descriptors for use in clustering methods and descriptors for use in compound selection. compound selection. J. Chem. Inf. J. Chem. Inf. ComputComput. . SciSci.. 19961996, , 3636, 572 , 572 --584.584.

•• Brown, R. D.; Martin, Y. C. The information Brown, R. D.; Martin, Y. C. The information content of 2D and 3D structural descriptors content of 2D and 3D structural descriptors relevant to relevant to ligandligand--receptor binding. receptor binding. J. Chem. J. Chem. Inf. Inf. ComputComput. . SciSci.., , 3737 (1), 1 (1), 1 --9, 1997.9, 1997.

Wendy Warr & Associates

ProductProduct--based orbased orReagentReagent--based Designbased Design

GilletGillet, V. J.; Willett, P.; Bradshaw, J. , V. J.; Willett, P.; Bradshaw, J. The effectiveness of reactant pools for The effectiveness of reactant pools for generating structurally diverse generating structurally diverse combinatorial libraries. combinatorial libraries. J. Chem. Inf. J. Chem. Inf. ComputComput. . SciSci.. 19971997, , 3737, 731, 731--740.740.

Wendy Warr & Associates

Leads and DrugsLeads and Drugs•• Lipinski, C. A.; Lombardo, F.; Lipinski, C. A.; Lombardo, F.; DominyDominy, B. W.; , B. W.;

Feeney, P. J. Experimental and Feeney, P. J. Experimental and computational approaches to estimate computational approaches to estimate solubility and permeability in drug discovery solubility and permeability in drug discovery and development settingsand development settings. Adv. Drug Delivery . Adv. Drug Delivery Rev.Rev. 19971997, , 23(123(1--3)3), 3, 3--25.25.

•• HannHann, M. H.; Leach, A. R.; Harper, G. , M. H.; Leach, A. R.; Harper, G. Molecular complexity and its impact on the Molecular complexity and its impact on the probability of finding leads for drug discovery. probability of finding leads for drug discovery. J. Chem. Inf. J. Chem. Inf. ComputComput. . SciSci.. 20012001, , 4141, 856 , 856 --864. 864.

Wendy Warr & Associates

Leads and DrugsLeads and Drugs

•• OpreaOprea, T. I.; Davis, A. M.; Teague, S. J.; , T. I.; Davis, A. M.; Teague, S. J.; LeesonLeeson, P. D. Is there a difference , P. D. Is there a difference between leads and drugs? A historical between leads and drugs? A historical perspective. perspective. J. Chem. Inf. J. Chem. Inf. ComputComput. . SciSci..20012001, , 4141, 1308 , 1308 --1315. 1315.

•• Teague, S. J.; Davis, A. M.; Teague, S. J.; Davis, A. M.; LeesonLeeson, P. , P. D.; D.; OpreaOprea, T. The design of , T. The design of leadlikeleadlikecombinatorial libraries. combinatorial libraries. AngewAngew. Chem., . Chem., Int. EdInt. Ed. . Engl.Engl. 19991999, , 38(24)38(24), 3743, 3743--3748.3748.

Wendy Warr & Associates

Methods for Methods for vHTSvHTS

•• Identifying drugIdentifying drug--like structureslike structures•• 2D similarity2D similarity•• 3D pharmacophores3D pharmacophores•• DockingDocking

Wendy Warr & Associates

DockingDocking•• Kuntz, I. D.; Kuntz, I. D.; BlaneyBlaney, J. M.; , J. M.; OatleyOatley, S. J.; , S. J.;

LangridgeLangridge, R.; , R.; FerrinFerrin, T. E. , T. E. J. Mol. Biol.J. Mol. Biol., , 19821982, , 161161, 269., 269.

•• ShoichetShoichet, B. K.; Kuntz, I.D. Protein docking , B. K.; Kuntz, I.D. Protein docking and and complementaritycomplementarity.. J. Mol. Biol.J. Mol. Biol. 19911991, , 221221, , 327327--46. 46.

•• ShoichetShoichet, B. K.; Stroud, R. M.; , B. K.; Stroud, R. M.; SantiSanti, D. V.; , D. V.; Kuntz, I. D.; Perry, K. M. StructureKuntz, I. D.; Perry, K. M. Structure--based based discovery of inhibitors of discovery of inhibitors of thymidylatethymidylatesynthasesynthase. . ScienceScience 1993, 1993, 259259, 1445, 1445--1450.1450.

Wendy Warr & Associates

DockingDocking•• BBööhmhm, H. J. , H. J. The development of a simple The development of a simple

empirical scoring function to estimate the empirical scoring function to estimate the binding constant for a proteinbinding constant for a protein--ligandligand complex complex of known threeof known three--dimensional structure. dimensional structure. J. J. ComputComput..--Aided Mol. Des.Aided Mol. Des. 19941994, , 88((33), ), 243243--56.56.

•• Jones, G.; Willett, P.; Glen, R. C.; Leach, A. Jones, G.; Willett, P.; Glen, R. C.; Leach, A. R.; Taylor, R. Development and validation of R.; Taylor, R. Development and validation of a genetic algorithm for flexible docking. a genetic algorithm for flexible docking. J. J. Mol. Biol.Mol. Biol. 19971997, , 267267, 727, 727--748.748.

Wendy Warr & Associates

GOLD docking result for PDB 1FKG

Wendy Warr & Associates

Most Cited. Top TwentyMost Cited. Top Twenty•• 20.20. GilletGillet, V. J.; Willett, P.; Bradshaw, J. Identification , V. J.; Willett, P.; Bradshaw, J. Identification

of Biological Activity Profiles Using of Biological Activity Profiles Using SubstructuralSubstructuralAnalysis and Genetic Algorithms.Analysis and Genetic Algorithms. J. Chem. Inf. J. Chem. Inf. ComputComput. . SciSci.. 19981998, , 3838, 165, 165--179.179.

•• 19.19. Pearlman, R. S.; Smith, K. M. Metric Validation Pearlman, R. S.; Smith, K. M. Metric Validation and the Receptorand the Receptor--Relevant Subspace ConceptRelevant Subspace Concept.. J. J. Chem. Inf. Chem. Inf. ComputComput. . SciSci. . 19991999, , 3939, 28, 28--35.35.

•• 18.18. PlattsPlatts, J. A.; , J. A.; ButinaButina, D.; Abraham, M. H.; , D.; Abraham, M. H.; HerseyHersey, , A. Estimation of Molecular Linear Free Energy A. Estimation of Molecular Linear Free Energy Relation Descriptors Using a Group Contribution Relation Descriptors Using a Group Contribution Approach. Approach. J. Chem. Inf. J. Chem. Inf. ComputComput. . SciSci.. 19991999; ; 3939, 835, 835--845.845.

Wendy Warr & Associates

Most Cited. Top TwentyMost Cited. Top Twenty•• 17.17. GilletGillet, V. J.; Willett, P.; Bradshaw, J. The , V. J.; Willett, P.; Bradshaw, J. The

Effectiveness of Reactant Pools for Generating Effectiveness of Reactant Pools for Generating Structurally Diverse Combinatorial Libraries. Structurally Diverse Combinatorial Libraries. J. J. Chem. Inf. Chem. Inf. ComputComput. . SciSci.. 19971997, , 3737, 731, 731--740.740.

•• 16.16. KrygowskiKrygowski, T. M. Crystallographic Studies of , T. M. Crystallographic Studies of InterInter-- and and IntramolecularIntramolecular Interactions Reflected in Interactions Reflected in AaromaticAaromatic Character of Character of ππ--Electron Systems. Electron Systems. J. J. Chem. Inf. Chem. Inf. ComputComput. . SciSci.. 19931993, , 3333, 70, 70--78.78.

•• 1515.. HannHann, M. M.; Leach, A. R.; Harper, G. Molecular , M. M.; Leach, A. R.; Harper, G. Molecular Complexity and Its Impact on the Probability of Complexity and Its Impact on the Probability of Finding Leads for Drug Discovery. Finding Leads for Drug Discovery. J. Chem. Inf. J. Chem. Inf. ComputComput. . SciSci.. 20012001,, 41,41, 856856--864.864.

Wendy Warr & Associates

Most CitedMost Cited•• 14.14. GilletGillet, V. J.; Willett, P.; Bradshaw, J. The , V. J.; Willett, P.; Bradshaw, J. The

effectiveness of reactant pools for generating effectiveness of reactant pools for generating structurally diverse combinatorial libraries. structurally diverse combinatorial libraries. J. Chem. J. Chem. Inf. Inf. ComputComput. . SciSci.. 19971997, , 3737, 731, 731--740.740.

•• 13.13. Hall, L. H.; Hall, L. H.; MohneyMohney, B; Kier, L. B. The , B; Kier, L. B. The electrotopologicalelectrotopological state: structure information at the state: structure information at the atomic level for molecular graphs. atomic level for molecular graphs. J. Chem. Inf. J. Chem. Inf. ComputComput. . SciSci. . 19911991, , 31,31, 7676--82.82.

•• 12.12. Hall, L. H.;Hall, L. H.; Kier, L. B. Kier, L. B. ElectrotopologicalElectrotopological state state indices for atom types: a novel combination of indices for atom types: a novel combination of electronic, topological, and valence state information. electronic, topological, and valence state information. J. Chem. Inf. J. Chem. Inf. ComputComput. . SciSci.. 19951995, , 3535, 1039, 1039--1045.1045.

Wendy Warr & Associates

Most CitedMost Cited•• 11.11. GhoseGhose, A. K.; , A. K.; CrippenCrippen, G. M. Atomic , G. M. Atomic

physicochemical parameters for threephysicochemical parameters for three--dimensionaldimensional--structurestructure--directed quantitative structuredirected quantitative structure--activity activity relationships. 2. Modeling dispersive and relationships. 2. Modeling dispersive and hydrophobic interactions. hydrophobic interactions. J. Chem. Inf. J. Chem. Inf. ComputComput. . SciSci.;.;19871987, , 2727, 21, 21--35.35.

•• 10.10. CarhartCarhart, R. E.; Smith, D. H.; , R. E.; Smith, D. H.; VenkataraghavanVenkataraghavan, R. , R. Atom pairs as molecular features in structureAtom pairs as molecular features in structure--activity activity studies: definition and applications. studies: definition and applications. J. Chem. Inf. J. Chem. Inf. ComputComput. . SciSci.. 19851985, , 2525, 64, 64--73.73.

•• 9.9. WesselWessel, M. D.; , M. D.; JursJurs, P. C.; , P. C.; TolanTolan, J. W.; , J. W.; MuskalMuskal, S. , S. M. Prediction of human intestinal absorption of drug M. Prediction of human intestinal absorption of drug compounds from molecular structure. compounds from molecular structure. J. Chem. Inf. J. Chem. Inf. ComputComput. . SciSci.. 19981998, , 3838, 726, 726--735.735.

Wendy Warr & Associates

Most CitedMost Cited

•• 8.8. Brown, R. D.; Martin, Y. C. The information content Brown, R. D.; Martin, Y. C. The information content of 2D and 3D structural descriptors relevant to of 2D and 3D structural descriptors relevant to ligandligand--receptor binding. receptor binding. J. Chem. Inf. J. Chem. Inf. ComputComput. . SciSci. . 19971997, , 3737, 1, 1--9.9.

•• 7.7. Willett, P.; Barnard, J. M.; Downs, G. M. Chemical Willett, P.; Barnard, J. M.; Downs, G. M. Chemical similarity searching. similarity searching. J. Chem. Inf. J. Chem. Inf. ComputComput. . SciSci.. 19981998, , 3838, 983, 983--996.996.

•• 6.6. Rogers, D.; Rogers, D.; HopfingerHopfinger, A. J. Application of genetic , A. J. Application of genetic function approximation to quantitative structurefunction approximation to quantitative structure--activity relationships and quantitative structureactivity relationships and quantitative structure--property relationships. property relationships. J. Chem. Inf. J. Chem. Inf. ComputComput. . SciSci..19941994, , 3434, 854, 854--866.866.

Wendy Warr & Associates

Most CitedMost Cited

•• 5.5. WeiningerWeininger, D. SMILES, a chemical language and information , D. SMILES, a chemical language and information system. 1. Introduction to methodology and encoding rules. system. 1. Introduction to methodology and encoding rules. J. J. Chem. Inf. Chem. Inf. ComputComput. . SciSci.. 19881988, , 2828, 31, 31--36.36.

•• 4.4. ViswanadhanViswanadhan, V. N.; , V. N.; GhoseGhose, A. K.; , A. K.; RevankarRevankar, G. R.; Robins, , G. R.; Robins, R. K. Atomic physicochemical parameters for three dimensional R. K. Atomic physicochemical parameters for three dimensional structure directed quantitative structurestructure directed quantitative structure--activity relationships. 4. activity relationships. 4. Additional parameters for hydrophobic and dispersive Additional parameters for hydrophobic and dispersive interactions and their application for an automated superpositiointeractions and their application for an automated superposition n of certain naturally occurring nucleoside antibiotics. of certain naturally occurring nucleoside antibiotics. J. Chem. J. Chem. Inf. Inf. ComputComput. . SciSci.. 19891989, , 2929, 163, 163--172.172.

•• 3.3. Brown, R. D.; Martin, Y. C. Use of structureBrown, R. D.; Martin, Y. C. Use of structure--activity data to activity data to compare structurecompare structure--based clustering methods and descriptors for based clustering methods and descriptors for use in compound selection. use in compound selection. J. Chem. Inf. J. Chem. Inf. ComputComput. . SciSci.. 19961996, , 3636, , 572572--584.584.

Wendy Warr & Associates

Most CitedMost Cited

•• 2.2. Fletcher, D. A.; Fletcher, D. A.; McMeekingMcMeeking, R. F.; , R. F.; ParkinParkin, D. The , D. The United Kingdom chemical database service. United Kingdom chemical database service. J. Chem. J. Chem. Inf. Inf. ComputComput. . SciSci. . 19961996, , 3636, 746, 746--749.749.

•• 1.1. Allen, F. H.; Davies, J. E.; Allen, F. H.; Davies, J. E.; GalloyGalloy, J. J.; Johnson, , J. J.; Johnson, O.; Kennard, O.; O.; Kennard, O.; MacraeMacrae, C. F.; Mitchell, E. M.; , C. F.; Mitchell, E. M.; Mitchell, G. F.; Smith, J. M.; Watson, D. G. The Mitchell, G. F.; Smith, J. M.; Watson, D. G. The development of versions 3 and 4 of the Cambridge development of versions 3 and 4 of the Cambridge Structural Database System. Structural Database System. J. Chem. Inf. J. Chem. Inf. ComputComput. . SciSci.. 19911991, , 31,31, 187187--204.204.

Wendy Warr & Associates

AcknowledgmentsAcknowledgments

•• Chemical Abstracts ServiceChemical Abstracts Service•• Eric Shively, CASEric Shively, CAS•• Michael LynchMichael Lynch

Wendy Warr & Associates

The Next 25 Years?The Next 25 Years?

Wendy Warr & Associates

Fearless predictionsFearless predictions……

““Where a calculator on the Where a calculator on the EniacEniac is is equipped with 18,000 vacuum tubes and equipped with 18,000 vacuum tubes and weighs 30 tons, computers in the future weighs 30 tons, computers in the future may have only 1,000 vacuum tubes and may have only 1,000 vacuum tubes and perhaps weigh 1.5 tons.perhaps weigh 1.5 tons.””

Popular Mechanics, Popular Mechanics, MarchMarch 19491949

Wendy Warr & Associates

Basic Principle of ForecastingBasic Principle of Forecasting

““Give them a number or give them a date,Give them a number or give them a date,but never both.but never both.””

Edgar FiedlerEdgar Fiedler

Wendy Warr & Associates

Technology Adoption CycleTechnology Adoption Cycle

Wendy Warr & Associates

Information OverloadInformation Overload

•• The LordThe Lord’’s Prayer uses 56 wordss Prayer uses 56 words•• The Ten Commandments, 297 wordsThe Ten Commandments, 297 words•• The American Declaration of The American Declaration of

Independence, 300 wordsIndependence, 300 words•• The EEC Directive on the import of The EEC Directive on the import of

caramel and caramel products uses caramel and caramel products uses 26,911 words26,911 words

Wendy Warr & Associates

A Mobile CAS ServiceA Mobile CAS Service

Wendy Warr & Associates

New Infrastructures, MiddlewareNew Infrastructures, Middleware

•• ee--ScienceScience•• CyberinfrastructureCyberinfrastructure•• eSciDoceSciDoc

Wendy Warr & Associates

““Here we sit side by side with thoseHere we sit side by side with thoseon whose shoulders we standon whose shoulders we stand””

Michael Lynch, 2002Michael Lynch, 2002