Profile (2)

Booth Dennis is a Data Scientist/Engineer with 7 years of experience in data processing, machine learning, and AI, currently working at Cabot Financial. He has led significant projects, including developing a charging order identifier and customer segmentation strategies, while also mentoring junior employees. His research interests encompass predictive modeling, deep learning, and graph theory, with a strong focus on customer service and team performance.

Uploaded by

Disney

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

Profile (2)

Uploaded by

Disney

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Contact

[email protected]
Booth Dennis
Data Coder Operator at Cabot Financial
www.linkedin.com/in/booth-dennis- Antioch, Illinois, United States
b70a88312 (LinkedIn)

Summary
Top Skills
Stakeholder Engagement An adaptable Data Scientist/Engineer with 7 years professional
Sequence Analysis experience delivering a wide array of impactful products for data
Customer Segmentation Strategy processing, machine learning and artificial intelligence. Through
influencing and communication with stakeholders, many of these
products have been deployed to be used regularly throughout
respective organisations. Especial regard is given to customer
service and satisfaction, to motivate and drive forward team
performance, recently adapting agile and DevOps practices. Very
open to new ideas and developing new skills, to continuously stay
competitive and to strive for excellence.

My research interests include:

· Predictive Modelling and Quantitative Structure-Property
Relationships (QSAR /QSPR)
· Deep Learning
· Graph Theory
· Similarity Searching (mostly in the context of Virtual Screening
using graph theory and/or fingerprints)
· Compound Deck design and Multi-Parameter Optimisation

Experience
Cabot Financial
Data Coder Operator

Cabot Financial
Senior Data Scientist
January 2022 - March 2023 (1 year 3 months)
Machine learning project lead and developer, building processes to help
recover debt on two projects:
· Built a charging order identifier (debts attached to property) using Spark NLP,
to search through historic company notes, to find references of charging orders
with a classification accuracy of 84%. Has the potential to recover hundreds

Page 1 of 4
of thousands of pounds of debt, though is currently pending deployment to
stakeholders.
· As the resident spark and cloud expert, upskilled other employees via
presentations and support groups, particularly on best practices to ensure
spark and DataBricks code was better optimised for speed and memory.
· Built a new customer segmentation strategy over a previous version, using
Experian and Open API (Doorda) data alongside internal customer information
to engineer features to optimise cluster separation.
· Managed two junior employees for 5 weeks in line manager’s absence to
progress the charging order project

THAMES WATER UTILITIES LIMITED

Senior Data Scientist
November 2018 - December 2021 (3 years 2 months)
Pioneer in Smart Metering and Leakage analytics introducing ground-breaking
products via the Azure platform to drive Thames Water into outstanding levels
of successful data utilisation. Along with managing a data scientist, our team
has:
· Built a critically acclaimed automated model to predict failed mains sensors;
hailed as one of the greatest breakthroughs in data science and modelling in
the business. Built to output daily scheduled predictions to Azure Synapse,
where stakeholders could access model outputs via Microsoft Excel. Model
was continuously used to direct technicians to predicted failed sensors for
immediate replacement.
· Built a shared supply predictor to slash customer complaints and thus
mitigate extortionate fines to the business. Built in Azure DataBricks (pyspark),
using recommended features from subject matter experts across the business,
and subsequently used to direct technicians to pre-emptively address shared
supply properties.
· Produced a random forest regressor, for predicting number of occupants in
a house (mean squared error 0.8). Pyspark was used to produce the model,
mining terabytes of meter reads from half a million customer accounts.
· Automated several models and projects, to give scheduled outputs to
stakeholders. Azure Data Factory ETL was used for scheduling, and
DataBricks (pyspark and SQL) for table creation and manipulation.
· Hosted and presented at forums for data science within the organisation

UCB
Chemoinformatician
October 2015 - November 2018 (3 years 2 months)

Page 2 of 4
Evaluated a set of calculable properties, which lead to the production of global
and series-specific pKa and logD models (m. s. error of 0.5 for both). These
models have been utilised in more than 6 therapeutic projects. Other activities
included:
· Released a hERG random forest classifier, which helped bridge our CADD
group with the DMPK department.
· Large-scale similarity searching, both 2D (ChemAxon MadFast and ChemFP)
and 3D (Schrodinger phase)
· Web form development for chemist access of QSAR models (including
JQuery and JavaScript)
· Demonstration and training of our chemoinformatics software to chemists in
both UK and Belgium sites
· Involved in the supervision and development of two chemoinformatics
placement students (one year each)

University of Sheffield
PhD Student in Chemoinformatics
October 2012 - October 2015 (3 years 1 month)
Explored the maximum common substructure concept and its applications
to 2D similarity searching and virtual screening. My work involved coding
various graph-matching algorithms to find the maximum common substructure
between two molecules. Such algorithms are used in the generation of
chemical hyperstructures (equivalent to supergraphs in graph theory), as well
as for various forms of virtual screening runs. Of note, we found a particular
topology-based manipulation which drastically sped up the search speed of
exact algorithms, allowing small molecules (~500 Daltons) to be compared
in seconds. KNIME (with Java, JUnit and R) was the software platform of
choice, with my thesis completed using LaTeX. Additional experience gained
in web design by creating the website for the information school postgraduate
research conference. Also did demonstration work for the teaching of
undergraduates and postgraduates in: chemoinformatics; web design (HTML);
content management systems (PHP and open source platforms); JavaScript;
JQuery; Database Design (ORACLE SQL).

EBI
Trainee
June 2012 - September 2012 (4 months)
Cheminformatics project using KNIME and Pipeline Pilot to construct naive
Bayesian classifiers, from ChEMBL bioactivity data based on ADME-related
protein targets. Application of domain of applicability concept to verify model

Page 3 of 4
validity and development/programming of KNIME nodes/software (in Java) to
retrieve data from ChEMBL.

Xention Discovery
Placement Student
June 2010 - August 2011 (1 year 3 months)
Used Pipeline Pilot, along with Java to develop software which designed
molecules with novel scaffolds and analogues from existing ligands. Built
Bayesian and activity space models (involving principal components analysis
and multi-dimensional scaling) to help predict compound activities against
targets, as well as doing some work on pharmacophore mapping. Helped
implement database retrieval software using Pipeline Pilot, ORACLE and
JavaScript. Learned how to apply information from scientific literature in a
programming perspective, and to present software to company scientists in
presentations and conversation.

Wellcome Trust Sanger Centre

Vacation Student
June 2009 - September 2009 (4 months)
Used Perl to develop a platform that performed text-based statistical analyses
of yeast promoter sequences. Also used R to statistically validate a DNA
binding motif predictor andplot the results.

Mologic Ltd
Vacation Student
June 2008 - September 2008 (4 months)
Researched and Presented findings to ‘Mologic Ltd’ about yeast promoter
functionality between many yeast species. Gained experience in extracting
information from scientific literature (using PubMed and online libraries as main
sources). Learned how to develop and apply personal ideas independently,
working in a field with no prior knowledge.

Education
The University of Sheffield
Doctor of Philosophy, Chemoinformatics · (2012 - 2015)

University of Birmingham School

Bioinformatics · (2008 - 2012)

Page 4 of 4

Java 17 Backend Development: Design backend systems using Spring Boot, Docker, Kafka, Eureka, Redis, and Tomcat
From Everand
Java 17 Backend Development: Design backend systems using Spring Boot, Docker, Kafka, Eureka, Redis, and Tomcat
Elara Drevyn
No ratings yet
Practical C++ Backend Programming
From Everand
Practical C++ Backend Programming
Justin Barbara
No ratings yet
Application Observability with Elastic: Real-time metrics, logs, errors, traces, root cause analysis, and anomaly detection
From Everand
Application Observability with Elastic: Real-time metrics, logs, errors, traces, root cause analysis, and anomaly detection
Navin Sabharwal
No ratings yet
Google Cloud Platform for Data Engineering: From Beginner to Data Engineer using Google Cloud Platform
From Everand
Google Cloud Platform for Data Engineering: From Beginner to Data Engineer using Google Cloud Platform
alasdair gilchrist
5/5 (1)
Practical Data Analysis - Second Edition
From Everand
Practical Data Analysis - Second Edition
Hector Cuesta
No ratings yet
Terrorism Research Initiative Perspectives On Terrorism
No ratings yet
Terrorism Research Initiative Perspectives On Terrorism
31 pages
Data Engineering with Scala and Spark: Build streaming and batch pipelines that process massive amounts of data using Scala
From Everand
Data Engineering with Scala and Spark: Build streaming and batch pipelines that process massive amounts of data using Scala
Eric Tome
No ratings yet
Architecting Go Applications: A Clean Approach to Building Scalable Gin Web Services
From Everand
Architecting Go Applications: A Clean Approach to Building Scalable Gin Web Services
Aarav Joshi
No ratings yet
Shivam Resume v1
No ratings yet
Shivam Resume v1
1 page
Sanket Patole Resume May2019
No ratings yet
Sanket Patole Resume May2019
1 page
JavaScript Design Patterns: Deliver fast and efficient production-grade JavaScript applications at scale
From Everand
JavaScript Design Patterns: Deliver fast and efficient production-grade JavaScript applications at scale
Hugo Di Francesco
No ratings yet
Machine Learning Mastery for Engineers
From Everand
Machine Learning Mastery for Engineers
Abdellatif Sadeq
No ratings yet
Mastering the Art of C# Programming: Unraveling the Secrets of Expert-Level Programming
From Everand
Mastering the Art of C# Programming: Unraveling the Secrets of Expert-Level Programming
Steve Jones
No ratings yet
Building Scalable Systems with C: Optimizing Performance and Portability
From Everand
Building Scalable Systems with C: Optimizing Performance and Portability
Larry Jones
No ratings yet
Automated Network Technology: The Changing Boundaries of Expert Systems
From Everand
Automated Network Technology: The Changing Boundaries of Expert Systems
Carl P. Catalano Ph.D.
No ratings yet
Resume Pooja Jain PDF
No ratings yet
Resume Pooja Jain PDF
2 pages
Practical C++ Backend Programming: Crafting Databases, APIs, and Web Servers for High-Performance Backend
From Everand
Practical C++ Backend Programming: Crafting Databases, APIs, and Web Servers for High-Performance Backend
Justin Barbara
No ratings yet
The Architect's Guide to NestJS: Architectural Trade-Offs and Implementation Patterns with NestJS
From Everand
The Architect's Guide to NestJS: Architectural Trade-Offs and Implementation Patterns with NestJS
Aarav Joshi
No ratings yet
Big Data on Kubernetes: A practical guide to building efficient and scalable data solutions
From Everand
Big Data on Kubernetes: A practical guide to building efficient and scalable data solutions
Neylson Crepalde
No ratings yet
Mastering ServiceStack: Utilize ServiceStack as the rock solid foundation of your distributed system
From Everand
Mastering ServiceStack: Utilize ServiceStack as the rock solid foundation of your distributed system
Andreas Niedermair
No ratings yet
Kaggle Kernels in Action: From Exploration to Competition
From Everand
Kaggle Kernels in Action: From Exploration to Competition
Robert Johnson
No ratings yet
Data Science with Python: From Zero to Machine Learning
From Everand
Data Science with Python: From Zero to Machine Learning
Pouvo
No ratings yet
Application Design: Key Principles For Data-Intensive App Systems
From Everand
Application Design: Key Principles For Data-Intensive App Systems
Rob Botwright
No ratings yet
Java Concurrency and Parallelism: Master advanced Java techniques for cloud-based applications through concurrency and parallelism
From Everand
Java Concurrency and Parallelism: Master advanced Java techniques for cloud-based applications through concurrency and parallelism
Jay Wang
No ratings yet
Mastering the Craft of C++ Programming: Unraveling the Secrets of Expert-Level Programming
From Everand
Mastering the Craft of C++ Programming: Unraveling the Secrets of Expert-Level Programming
Steve Jones
No ratings yet
Java 17 Backend Development
From Everand
Java 17 Backend Development
Elara Drevyn
No ratings yet
Active Machine Learning with Python: Refine and elevate data quality over quantity with active learning
From Everand
Active Machine Learning with Python: Refine and elevate data quality over quantity with active learning
Margaux Masson-Forsythe
No ratings yet
Statistics with Rust: 50+ Statistical Techniques Put into Action
From Everand
Statistics with Rust: 50+ Statistical Techniques Put into Action
Keiko Nakamura
No ratings yet
Jagadish Babu
No ratings yet
Jagadish Babu
1 page
Mastering C++: Advanced Techniques and Tricks
From Everand
Mastering C++: Advanced Techniques and Tricks
Ted Norice
No ratings yet
C# Algorithms for New Programmers: A Practical Guide with Examples
From Everand
C# Algorithms for New Programmers: A Practical Guide with Examples
William E. Clark
No ratings yet
Spark for Data Science
From Everand
Spark for Data Science
Srinivas Duvvuri
No ratings yet
Essays on Infrastructure-as-code
From Everand
Essays on Infrastructure-as-code
Ravi Rajamani
No ratings yet
Random DS Resume
No ratings yet
Random DS Resume
1 page
C++ OOP Made Simple: A Practical Guide with Examples
From Everand
C++ OOP Made Simple: A Practical Guide with Examples
William E. Clark
No ratings yet
Engineering Data Mesh in Azure Cloud: Implement data mesh using Microsoft Azure's Cloud Adoption Framework
From Everand
Engineering Data Mesh in Azure Cloud: Implement data mesh using Microsoft Azure's Cloud Adoption Framework
Aniruddha Deswandikar
No ratings yet
Mastering D3.js
From Everand
Mastering D3.js
Pablo Navarro Castillo
3/5 (1)
Mastering Flask Web and API Development: Build and deploy production-ready Flask apps seamlessly across web, APIs, and mobile platforms
From Everand
Mastering Flask Web and API Development: Build and deploy production-ready Flask apps seamlessly across web, APIs, and mobile platforms
Sherwin John C. Tragura
No ratings yet
Architecture Body of Knowledge
From Everand
Architecture Body of Knowledge
Ron Bennett
No ratings yet
React Anti-Patterns: Build efficient and maintainable React applications with test-driven development and refactoring
From Everand
React Anti-Patterns: Build efficient and maintainable React applications with test-driven development and refactoring
Juntao Qiu
No ratings yet
Blueprints of DevSecOps: Foundations to Fortify Your Cloud
From Everand
Blueprints of DevSecOps: Foundations to Fortify Your Cloud
Naveen Pakalapati
No ratings yet
Software Architecture with Kotlin: Combine various architectural styles to create sustainable and scalable software solutions
From Everand
Software Architecture with Kotlin: Combine various architectural styles to create sustainable and scalable software solutions
Jason (Tsz Shun) Chow
No ratings yet
Implementing the Stakeholder Based Goal-Question-Metric (Gqm) Measurement Model for Software Projects
From Everand
Implementing the Stakeholder Based Goal-Question-Metric (Gqm) Measurement Model for Software Projects
Dr. Prashanth Harish Southekal
No ratings yet
Data Engineering with Google Cloud Platform: A guide to leveling up as a data engineer by building a scalable data platform with Google Cloud
From Everand
Data Engineering with Google Cloud Platform: A guide to leveling up as a data engineer by building a scalable data platform with Google Cloud
Adi Wijaya
No ratings yet
Dilip Ravuri 2pg
No ratings yet
Dilip Ravuri 2pg
2 pages
AI for Everyone: An Intermediate Guide to Artificial Intelligence
From Everand
AI for Everyone: An Intermediate Guide to Artificial Intelligence
Nova Clarke
No ratings yet
Go Gin at Scale: Professional Patterns for High-Performance Web Service Development
From Everand
Go Gin at Scale: Professional Patterns for High-Performance Web Service Development
Aarav Joshi
No ratings yet
Be Data Curious!: Be Data Curious!, #1
From Everand
Be Data Curious!: Be Data Curious!, #1
Nick Jewell
No ratings yet
Practical Data Strategies and Recipes
From Everand
Practical Data Strategies and Recipes
Tom Henricksen
No ratings yet
Kubernetes Anti-Patterns: Overcome common pitfalls to achieve optimal deployments and a flawless Kubernetes ecosystem
From Everand
Kubernetes Anti-Patterns: Overcome common pitfalls to achieve optimal deployments and a flawless Kubernetes ecosystem
Govardhana Miriyala Kannaiah
No ratings yet
Naman Chindaliya - One Page
No ratings yet
Naman Chindaliya - One Page
1 page
Resume Chiranjibi - DA
No ratings yet
Resume Chiranjibi - DA
1 page
Graph Data Science with Python and Neo4j
From Everand
Graph Data Science with Python and Neo4j
Timothy Eastridge
No ratings yet
Graph Data Science with Python and Neo4j: Hands-on Projects on Python and Neo4j Integration for Data Visualization and Analysis Using Graph Data Science for Building Enterprise Strategies (English Edition)
From Everand
Graph Data Science with Python and Neo4j: Hands-on Projects on Python and Neo4j Integration for Data Visualization and Analysis Using Graph Data Science for Building Enterprise Strategies (English Edition)
Timothy Eastridge
No ratings yet
Feature Flagging with LaunchDarkly: Modern Approaches to Progressive Deployment
From Everand
Feature Flagging with LaunchDarkly: Modern Approaches to Progressive Deployment
Robert Johnson
No ratings yet
Learning .NET High-performance Programming
From Everand
Learning .NET High-performance Programming
Antonio Esposito
No ratings yet
C++ Mastery: Advanced Techniques and Strategies
From Everand
C++ Mastery: Advanced Techniques and Strategies
Adam Jones
No ratings yet
Building a Product Master
From Everand
Building a Product Master
Edufdev
No ratings yet
Mastering Data Engineering and Analytics with Databricks: A Hands-on Guide to Build Scalable Pipelines Using Databricks, Delta Lake, and MLflow (English Edition)
From Everand
Mastering Data Engineering and Analytics with Databricks: A Hands-on Guide to Build Scalable Pipelines Using Databricks, Delta Lake, and MLflow (English Edition)
Manoj Kumar
No ratings yet
Programming Best Practices for New Developers: A Practical Guide with Examples
From Everand
Programming Best Practices for New Developers: A Practical Guide with Examples
William E. Clark
No ratings yet
Accelerated DevOps with AI, ML & RPA: Non-Programmer’s Guide to AIOPS & MLOPS
From Everand
Accelerated DevOps with AI, ML & RPA: Non-Programmer’s Guide to AIOPS & MLOPS
Stephen Fleming
5/5 (2)
Data-Centers-Brochure_040124
No ratings yet
Data-Centers-Brochure_040124
2 pages
Module 7 - Complex and ERD Well Planning
No ratings yet
Module 7 - Complex and ERD Well Planning
36 pages
Enterprise Networking Power of The Platform
No ratings yet
Enterprise Networking Power of The Platform
67 pages
Introduction To Windows
No ratings yet
Introduction To Windows
24 pages
200T Crane
No ratings yet
200T Crane
9 pages
Manual Idp690
No ratings yet
Manual Idp690
94 pages
Marshall 1959HW Brochure
No ratings yet
Marshall 1959HW Brochure
32 pages
P0112, P0113
No ratings yet
P0112, P0113
4 pages
PE-FDXA4343R-HD(2019-11)
No ratings yet
PE-FDXA4343R-HD(2019-11)
10 pages
SAI's Resume
No ratings yet
SAI's Resume
1 page
Capgemini - CX Generative AI PoV - 13902e
No ratings yet
Capgemini - CX Generative AI PoV - 13902e
9 pages
Assosa University College of Computing Department of Computer Science
No ratings yet
Assosa University College of Computing Department of Computer Science
12 pages
IRIS BCR200DTP Driver Installation Guide
No ratings yet
IRIS BCR200DTP Driver Installation Guide
9 pages
Scheme Cbcs Nep Ece 2024
No ratings yet
Scheme Cbcs Nep Ece 2024
223 pages
Specification Jumbo Drill Sandvik DD311-40
100% (4)
Specification Jumbo Drill Sandvik DD311-40
4 pages
Assessment Katamso
No ratings yet
Assessment Katamso
4 pages
LIFO Search and FIFO Search
No ratings yet
LIFO Search and FIFO Search
9 pages
OPSWAT Market Share Report March 2012
No ratings yet
OPSWAT Market Share Report March 2012
8 pages
SS 3 2ND Term Computer Science
No ratings yet
SS 3 2ND Term Computer Science
24 pages
Installation Manual, GMU 44B
No ratings yet
Installation Manual, GMU 44B
31 pages
Doip
No ratings yet
Doip
15 pages
Hexa Multipay
No ratings yet
Hexa Multipay
16 pages
TOPIC-1 (1)
No ratings yet
TOPIC-1 (1)
3 pages
Final Reviewer MS
No ratings yet
Final Reviewer MS
6 pages
PC Builds 2023
No ratings yet
PC Builds 2023
4 pages
An Open Source EDA Tool For Circuit Design, Simulation, Analysis and PCB Design
No ratings yet
An Open Source EDA Tool For Circuit Design, Simulation, Analysis and PCB Design
129 pages
Test Procedures Mock Test Spot Prelims 21422
No ratings yet
Test Procedures Mock Test Spot Prelims 21422
3 pages
CT PT
100% (1)
CT PT
20 pages
Ankit Project
No ratings yet
Ankit Project
12 pages

Profile (2)

Uploaded by

Profile (2)

Uploaded by

Contact

My research interests include:

THAMES WATER UTILITIES LIMITED

Wellcome Trust Sanger Centre

University of Birmingham School

You might also like