This document discusses using the R programming environment for cheminformatics and chemical data mining. It describes how R is enhanced by open source software like the Chemistry Development Kit (CDK) which provides capabilities for working with chemical structure data within R. The CDK allows reading/writing different file formats, generating fingerprints, visualizing molecules, and accessing public databases directly from R to streamline workflows like quantitative structure-activity relationship modeling. This enables reproducible science by ensuring analyses can be updated based on the latest data.