0% found this document useful (0 votes)
5 views

PhamHoToan CV

Ho Pham is a Lead Data Engineer with over 5 years of experience in software engineering, specializing in Data, AI, and Cloud technologies. Currently working at TTC Group, he leads a BI team and has a strong background in building scalable data platforms and pipelines across various domains. He holds multiple certifications, including Machine Learning and Deep Learning from Stanford University, and has experience with AWS, Azure, and various data processing tools.

Uploaded by

phamhotoan96
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
5 views

PhamHoToan CV

Ho Pham is a Lead Data Engineer with over 5 years of experience in software engineering, specializing in Data, AI, and Cloud technologies. Currently working at TTC Group, he leads a BI team and has a strong background in building scalable data platforms and pipelines across various domains. He holds multiple certifications, including Machine Learning and Deep Learning from Stanford University, and has experience with AWS, Azure, and various data processing tools.

Uploaded by

phamhotoan96
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

Email: phamhotoan96@gmail.

com

LEAD DATA EN GINEER

Mobile Phone: (+84) 328218204

Ho Pham
LinkedIn: linkedin.com/in/pham-ho-toan/

Toan
The Broad Introduction
Over 5 year’s experience as a software engineer
Primarily fond of 3 technical domains, namely Data, AI & Cloud
Be reliable, responsible and incredibly set to learn.

Working Experience
Lead Data En gineer
TTC Group / Ho Chi Minh city, Vietnam / 09.2023 - Present

Domain: Agriculture, Logistic, Technology, Finance


l l
Propose and bui d data p atforms for enterprises from scratch

Publication
Lead a BI team of l
6 members, inc uding DA and DE

Experience with AWS CI/CD, IaC (Terraform).


Ho, Toan Pham and Hoang, Vinh Truong. (2022).

CNN Parameter Adjustment for Brain

Tumor Classification. 
 Data Specialist

Applied Information Processing Systems.


Greenfeed Corporation / Ho Chi Minh city, Vietnam / 09.2022 - 09.2023

10.1007/978-981-16-2008-9_1
Domain: Animal husbandry, Logistic, Technology, Finance, HR, IOT (Product)
l l l
Propose sca ab e, security data p atforms for different situations

Education l
Capab e of setting up AWS or dep oying third-party too s l l
2023

l l l
Se f-dep oy Data Cata og, Data Governance too s l
Experience with GitLab CI/CD from scratch
Academic IELTS 7.0

HCMC British Council

l
Be ab e to independent y get business re lquirements from users
Maintain, enhance and develop fully a plethora of data pipelines

from sources to destinations, both local and cloud


2022

Machine Learnin g Certificate


Data modeling based on NoSQL standards, Star Schema
Stanford University

Ensure Data Quality and Data Security.

Deep Learnin g Certificate

eep Learning AI

gineer

D
Data En

FPT Software / Ho Chi Minh city, Vietnam / 04.2022 - 08.2022

2017 - 2021

Domain: Technology (Outsource)


Honor B.Sc in Computer Science
Perform statistica l analysis on datasets
HCMC Open University (Top 10% Graduation)

Hands-on experience with SQL database designs.

Prepare data for prescriptive and predictive mode ing. l


Technologies l
Capab e of configuring the YAML fi e (C oudFormation) l l
Experience with the Azure CI/CD.
AW DB Gi

Airflo Docke Ia
AI En gineer

Airbyte
ERPs
Linux
FPT Software / Ho Chi Minh city, Vietnam / 01.2022 - 04.2022

Domain: Technology (Outsource)


l
Research and deve op AI mode s to meet requirements l
ERP SAP (ECC, S/4HANA, BW) PIC Traq Expose AI/ML models APIs for applications to utilize

Oracle ERP l
So omon Porcitec Familiar with renowned DL or ML algorithms and frameworks.

DATABASE InnoDB (MSSQL, PostgreSQL


AI En gineer
HCMC Open University / Ho Chi Minh city, Vietnam / 04.2020 - 12.2021

DynamoDB

Domain: Health (Product)


Research and propose various performance-enhanced AI mode s l
Languages Python SQ based on renowned origina l CNNs, successfully overcoming the
C# / C++ Java l
thesis defense and pub ishing on German Springer.
PROFESSIONAL EXPERIENCE IN DETAIL

Farm Inventory

TTC Data Platform 11.2023 – Present & Performances, OKR 09.2022 – 01.2023

Role - Institute: Lead Data Engineer - TTC Group

Role - Institute: Data Specialist - Greenfeed

Responsibilities
Propose and build data platform Responsibilities:
Lead data team (DEs, DAs) Grasp business requirements from users
Data modeling Build data pipelines for PBI reports
Build data pipelines from numerous data sources Create data connectors for Solomon, Porcitec,
Builld/Config CI/CD flows for IaC/data modeling PICTraq, excel files.
Data Mesh on Redshift, Data Lakehouse on S3.

Transform and load data from Data Lake to Data


Warehouse
Technologies Used Schedule and monitor data pipelines.

Python
AWS (EMR, ECS, EC2, S3, Glue, Athena, Redshift, Technologies Used:
VPC, VPN, Control Tower, CodeCommit, Python (pymssql, pandas, pandasql, boto3
CodePipelines, CodeBuild, Cloud9, IAM, Secret AWS (EventBridge, S3, Redshift, CodeCommit)
Manager, DynamoDB, ...)
Azure (PBI)
Airbyte, DBT, Oracle ERP UACJ – Water Bridge (Japan) 04.2022 – 08.2022
Apache Spark, Apache Iceberg.
Role - Institute: Data Engineer/DevOps - FPT Software

Sales Group, Cash Flows 07.2023 – 11.2023

Responsibilities:
Partly design the high-level architect.
Role - Institute: Data Specialist - Greenfeed

Design the relational database.


Manage, config and deploy AWS through YAML.

Responsibilities:
Grasp business requirements from users. Technologies Used:
Build data pipelines for the PBI reports. Raw Python, Azure Git, Azure CI/CD, Jira, Postma
Enable the SAP OData service. AWS (CloudFormation, Lambda, API Gateway,
Create data connectors for Solomon, SAP, Porcitec, DynamoDB, Amazon Aurora, Cognito)
PICTraq, MariaDB, excel files.
Transform and load data from Data Lake to the
Data Warehouse. Fake News Classifications 01.2022 – 03.2022

Schedule and monitor data pipelines.

Technologies Used: Role - Institute: AI Engineer - FPT Software

Python (pymssql, pandas, pandasql, boto3), ABA


AWS (S3, Redshift, DMS, AppFlow, DataSync) Project: Fake News Classifications (Internal)

Airbyte, Airflow, DBT, Gitlab CI/CD, Docker, OData


Responsibilities:
Visualize and analyze data.
Greenfeed Data Platform 02.2023 – 06.2023
Preprocessing, cleaning data for feature
selection.
Role - Institute: Data Specialist - Greenfeed

Utilizing renowned techniques dealing with NLP


such as Stopwords, TF-IDF.
Responsibilities: Training/Fine-Tuning models to find out optimal
Determine pros and cons of components/tools in parameters by GridSearch.
the existing Data Platform Implement some universal ML models
Research and POC optimal alternative choices to Implement advanced ensemble models.

replace old technologies.


Make comparisons between the old and the Technologies Used:
new Python (TensorFlow, Scikit-learn, NumPy,
Deploy ultimate new services/tools in order to Pandas, Matplotlib, OpenCV, NLTK, Seaborn,
put in practice XGBoost, Bagging, AdaBoost
Centralize data from tons of data sources CNNs, KNNs, Multinomial Naive Bayes, Logistic
Research and grasp different OSS Data Catalog.

Regression, SVM, Decision Tree, Random Forest

Technologies Used: Best regards,


AWS (DMS, AppFlow, DataSync, EC2, VPN, EKS
Airbyte, Airflow, DBT, Slacks, Datahub, ODat
Docker, Gitlab CI/CD Toan Ho Pham

You might also like