0% found this document useful (0 votes)
68 views4 pages

Digesh Gathani 767297802

Digesh M. Gathani has over 14 years of experience in data science, analytics, and Microsoft technologies. He currently works as a Senior Data Scientist using Python. His skills include machine learning, deep learning, big data technologies like Hadoop and Spark, and data visualization with Tableau. He has experience conducting data analysis, building predictive models, and creating data-driven solutions to business problems for clients in various industries.

Uploaded by

shamsehr
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
68 views4 pages

Digesh Gathani 767297802

Digesh M. Gathani has over 14 years of experience in data science, analytics, and Microsoft technologies. He currently works as a Senior Data Scientist using Python. His skills include machine learning, deep learning, big data technologies like Hadoop and Spark, and data visualization with Tableau. He has experience conducting data analysis, building predictive models, and creating data-driven solutions to business problems for clients in various industries.

Uploaded by

shamsehr
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 4

Digesh M.

Gathani Mumbai, Maharashtra, India


9004054725
[email protected]

Data Scientist & Data Analyst


Overall 14+ years’ experience in Microsoft Technologies, Data Science – Python, ServiceNow, Tableau
Desktop 10, Big Data Hadoop (Sqoop, Flume, Hive, Pig, HBase, Scala, Spark).
Data Scientist with 6+ years of experience executing data-driven solutions to increases efficiency,
accuracy, and utility of internal data processing. Experienced at creating data regression models,
using predictive data modeling, and analyzing data mining algorithms to deliver insights and
implement action-oriented solutions to complex business problems.
Competencies
Languages Python (sci kit-learn, NumPy, SciPy, pandas, pyathenajdbc, pandas_redshift,
sklearn, streamlit, PyMuPDF, PyOCR, Image), C#.Net, VB.Net, .Net
1.1/2.0/3.5/4.0/4.5, LINQ
Data Scientist Python/R/SAS
Machine Classification, Regression, Clustering, Feature Engineering (Linear Regression,
Learning Logistic Regression, KNN, K Means, Decision Trees and Random Forests)
Deep Learning Tensor flow, Neural Networks, CNN, RNN
Big Data MapReduce, HDFS, Sqoop, Flume, Kafka, Hive, Pig, HBase, Scala, Spark
Visualization Matplotlib, Seaborn and Tableau Desktop 10
DB Redshift, Athena, SQL Server 2000/2005/2008/2012, MS Access
Web MVC4, ASP.NET, ADO.Net, WINFORMS, HTML, HTML 5, DHTML, XML, XSLT,
Technologies AJAX, JavaScript, jQuery, CSS, CSS3, WPF
ITSM ServiceNow
Version Control Visual Source Safe (VSS), Team Foundation Server (TFS), GitHub
Web Service WCF, Web Services
Testing Tool CodedUI

Domain Experience
 Data Analysis and Data visualization
 Machine Learning and Deep Learning
 Natural Language Process
 Big Data Hadoop
Professional Experience
Capgemini India (Dec 2011 to present)
Senior Data Scientist - Python (Dec 15 – till date)
 Conduct data regression analyses of the relationship between company data and industry trends,
achieving a 15% more accurate prediction of performance
 Utilize web scraping techniques extract and organize data.
 Used predictive analytics such as machine learning and data mining techniques to predict 95%
accuracy rate.
 Developed intricate algorithms based on deep-dive statistical analysis and predictive data modeling
that were used to deepen relationships, strengthen longevity and personalize interactions with
clients.
 Analyzed and processed complex data sets using advanced querying, for that using my Big Data
Hadoop skills. (Sqoop, Hive, HDFS)
 Finally, for visualization Tableau Desktop 10 used and create graphs and report to better
understand to clients.
ML Projects:
Retail:
1. Customer segmentation:
Cohort analysis & RFM analysis using customer dataset.
2. Product & Sales analysis store wise:
Data available store wise and merge all that and find which store have more
sales and which product sale more.
Healthcare:
1. Check Diabetes:
Build a model to accurately predict whether the patients in the dataset have
diabetes or not?
2. Predict insurance cost:
Predict insurance cost based on some internal features, using Streamlit open
source this algorithm lives on prod server.
3. A person makes a doctor appointment:
Predict someone to no-show an appointment.
4. Breast cancer:
Build a model to accurately predict whether the patients in the dataset have Breast
cancer or not?

Python ETL Project:


 PDF competition:
Python Modules: PyMuPDF, Image, ImageChops, pymysql, boto3, Pyocr, beautifulsoup, request
Connect Redshift DB and pick every product number and based on that url download pdf using
request and beautifulsoap and upload to AWS S3 bucket. Then we want to compare old version of
that same pdf files using python modules, if we find any difference then we want to maintain that
version and at the end all final pdfs file uploaded again back S3 bucket and final data csv file
uploaded to Redshift table.
 Janseen Select ETL:
Python Modules: pyathenajdbc, pandas_redshift, psycopg2, boto3, pandas
Csv file available in MBox, we want to download that file in ec2 and some task perform on that file
based on client checklist and validate that data.once everything fine then finally data uploaded in
Redshift table.
 Rest API using AWS Lambda:
Python modules: psycopg2, pandas, postman
Using AWS Lambda, created API which execute 3rd person outside of JnJ person using postman
and pull data from redshift/Athena/S3 tables without using EC2 server.
 Data pick from Sales Force:
Python modules: simple_salesforce, pandas, psycopg2, multiprocessing
We have 57 tables available in sale force and pick that table one by one and data uploaded into S3
bucket and Redshift table. Here we are using multiprocessing module for all my code working in
asynchronies.

ServiceNow Developer (March 15 – Nov 15)


Project Name: RAS (Rundeck Automation Tool)
 L1 & L2 activities perform manually, we want to automate that activities using Rundeck open
source platform in Python scripting and Service-Now as a frontend.
 Using Service-Now, we used Service Catalog, Workflow algorithm, Change & Incident module &
finally task to perform some action.
 Rundeck is open source Jobs running tool. Using Service-Now, user using service catalog and send
for approval, once get the approval automatically change task create and respected person assign
for that task.
 Service-Now version – Helsinki, Jakarta

Senior .Net Developer (Dec 11 – Feb 15)


 Worked for different clients and projects.
 Developed Web & Desktop application in Microsoft .Net.
 In this time frame worked so many different technologies.
 MS .net framework 1.0, 2.0, 3.0, 3.5, 4.0
 SQL Server 2000/2005/2008/2012
 WPF, WCF, Silver Light, SSRS, SSIS, Java Script, HTML5, CSS3, C#.Net, VB.Net, AJAX, LINQ
 Tools: Dynamic PDF, Component One, Coded UI, Crystal Report

Blue Zone System Pvt. Ltd. (Sep 2006 to Nov 2011)


Dot. Net Developer
 Worked for different clients and projects.
 Developed Web & Desktop application in Microsoft .Net.
 In this time frame worked so many different technologies.
 MS .net framework 1.0, 2.0, 3.0
 SQL Server 2000/2005
 Java Script, XML, XSLT, C#.Net, VB.Net, Window Service
 Tools: RAD Controls, TSF

Education
Bachelor of Science in Computer Science | Mumbai University | June 2003 to March 2006
Mumbai, Maharashtra
Certificate
Python Application Development
Master’s in Data Scientist – SimpliLearn (March 19 to Jan 20)
 Data Science – Python/R/SAS
 Machine Learning
 Deep Learning
 Big Data Hadoop & Spark Developer
 Tableau Desktop 10
Honor Awards
Performance of the Year – Capgemini India (Jan 15 to Dec 15)
Develop web application using Microsoft .net technologies, project timeline was 12 months,
but my team completed this project in 8 months. That’s the reason my team got best team of the year
and I received performance of the year award.

Languages
 English  Marathi
 Hindi  Gujarati
Interest
 Travelling  Sports
 Reading  Technology
 Music

You might also like