SlideShare a Scribd company logo
DATA ORCHESTRATION SUMMIT
2020
Alluxio Architecture and
Scaling Performance
Gene Pang | Founding Engineer & Head Architect @ Alluxio, Inc.
DATA ORCHESTRATION SUMMIT
▪ Alluxio	high-level	architecture	
▪ Scaling	performance	for	large	deployments	
Outline
DATA ORCHESTRATION 
SUMMIT
2020
Alluxio Architecture
High-Level Architecture and Components
DATA ORCHESTRATION SUMMIT
Alluxio	Overview	
Data Orchestration for the Cloud
Java File API HDFS Interface S3 Interface REST APIPOSIX Interface
HDFS Driver Swift Driver S3 Driver NFS Driver
DATA ORCHESTRATION SUMMIT
Alluxio	Processes	
Alluxio
Master
Job
Master
Worker
Job
Worker
Fuse
Process
Proxy
Worker
Job
Worker
Fuse
Process
Proxy
Worker
Job
Worker
Fuse
Process
Proxy
DATA ORCHESTRATION SUMMIT
Worker
Job
Worker
Fuse
Process
Proxy
Worker
Job
Worker
Fuse
Process
Proxy
Worker
Job
Worker
Fuse
Process
Proxy
Job
Master
Alluxio
Master
Alluxio	Processes	
•  Manages	filesystem	namespace	
•  Handles	authentication	and	authorization	
•  Manages	structured	data	catalog	
•  Maintains	Alluxio	worker	membership
DATA ORCHESTRATION SUMMIT
Job
Master
Job
Worker
Fuse
Process
Proxy
Job
Worker
Fuse
Process
Proxy
Job
Worker
Fuse
Process
Proxy
Alluxio
Master
Worker WorkerWorker
Alluxio	Processes	
•  Manages	file/block	data	cache	
•  Writes	and	reads	data	to	and	from	UFS	
•  Interacts	with	clients	for	data	transfer
DATA ORCHESTRATION SUMMIT
Worker
Job
Worker
Fuse
Process
Proxy
Worker
Job
Worker
Fuse
Process
Proxy
Worker
Job
Worker
Fuse
Process
Proxy
Alluxio
Master
Job
Master
Alluxio	Processes	
•  Jobs	are	asynchronous	and	distributed	tasks	(load,	
persist,	policy-based	movements)	
•  Manages	execution	and	state	of	jobs	
•  Maintains	Job	worker	membership
DATA ORCHESTRATION SUMMIT
Job
Master
Fuse
Process
Proxy
Fuse
Process
Proxy
Fuse
Process
Proxy
Alluxio
Master
Worker WorkerWorker
Job
Worker
Job
Worker
Job
Worker
Alluxio	Processes	
•  Executes	distributed	tasks	for	jobs	
•  Uses	the	Alluxio	client	to	interact	with	Alluxio	data
DATA ORCHESTRATION SUMMIT
Job
Master
ProxyProxy Proxy
Alluxio
Master
Worker WorkerWorker
Job
Worker
Job
Worker
Job
Worker
Fuse
Process
Fuse
Process
Fuse
Process
Alluxio	Processes	
•  Enables	Alluxio	Fuse	(Filesystem	in	User	Space)	
•  Uses	Alluxio	client	to	interact	with	other	components	
•  Enables	mounting	Alluxio	namespace	to	local	filesystem	
•  Any	machine	with	this	process	can	interact	with	Alluxio
DATA ORCHESTRATION SUMMIT
Job
Master
Alluxio
Master
Worker WorkerWorker
Job
Worker
Job
Worker
Job
Worker
Fuse
Process
Fuse
Process
Fuse
Process
ProxyProxy Proxy
Alluxio	Processes	
•  Exposes	REST	and	S3-compatible	endpoints	
•  Uses	Alluxio	client	to	interact	with	other	components
DATA ORCHESTRATION 
SUMMIT
2020
Scaling Alluxio Performance
Improving Alluxio for Large Scale Deployments
DATA ORCHESTRATION SUMMIT
Large	Scale	Alluxio	Deployments	
Large	#	of	Files	 Large	#	of	Users
DATA ORCHESTRATION SUMMIT
To	stay	fast,	
avoid	slow	pauses!
DATA ORCHESTRATION SUMMIT
▪  External	Storage	Systems	(UFS)	
•  HDFS	
•  Object	stores	
•  Cloud	blob	stores	
▪  External	Catalogs	(UDB)	
•  Hive	Metastore	
•  AWS	Glue	
▪  Local	Disks	
•  SSDs	
•  HDDs	
Sources	of	Slow	Pauses
DATA ORCHESTRATION SUMMIT
Metadata	Sync	Lock	Contention	
UFS
Thread-1	
Thread-2	
Thread-3	
Thread-4	
•  Metadata	sync	is	single-threaded	
•  Requires	write	lock	for	entire	sync	
•  Blocks	other	users/threads	
Before
DATA ORCHESTRATION SUMMIT
Metadata	Sync	Lock	Contention	
UFS
Thread-1	
Thread-2	
Thread-3	
Thread-4	
•  Shorten	critical	section	which	requires	write	lock	
•  Parallelize	UFS	access	and	sync	with	multiple	threads	
•  Enable	more	concurrent	users,	and	faster	syncs	
After
DATA ORCHESTRATION SUMMIT
Slow	UFS	Data	Reads	
Alluxio
Worker
Client
UFS
•  High	concurrency	saturates	network	bandwidth	and	UFS	
•  Client	times	out	even	though	the	reads	eventually	complete	
•  Applications	may	fail	unnecessarily	
Client
Client
Client
Before
DATA ORCHESTRATION SUMMIT
Slow	UFS	Data	Reads	
Alluxio
Worker UFS
•  Expect	slow	IO,	adjust	and	handle	timeouts	better	
•  Improve	logging	to	observe	slow	IO	
•  Prevent	unnecessary	timeouts	for	users	and	applications,	during	
high	concurrency	
Client
Client
Client
Client
After
DATA ORCHESTRATION SUMMIT
Slow	Catalog	Sync	
Alluxio
Catalog UDBClient
•  Alluxio	catalog	syncs	from	the	UDB	with	a	single	thread	
•  Syncing	databases	with	many	tables	can	take	a	long	time	
Before
DATA ORCHESTRATION SUMMIT
Slow	Catalog	Sync	
Alluxio
Catalog UDBClient
•  Parallelize	catalog	syncing	with	multiple	threads	
•  Accelerate	syncing	large	databases	for	users	
After
DATA ORCHESTRATION SUMMIT
Synchronous	Disk	IO	in	Worker	Storage	Tiers	
MEM
SSD
HDD
•  CACHE_PROMOTE	requires	moving	block	to	top	tier	
•  Finding	free	space	may	require	cascading	eviction	
•  User	reads	of	cached	data	may	block	on	many	disk	IOs	
Before
DATA ORCHESTRATION SUMMIT
Synchronous	Disk	IO	in	Worker	Storage	Tiers	
MEM
SSD
HDD
•  Make	default	read	type	as	CACHE,	to	avoid	synchronous	disk	IO	
•  Rearrange	blocks	asynchronously	to	match	caching	policy	
•  Avoid	blocking	on	disk	IO	for	user	requests,	for	faster	and	more	
predictable	Alluxio	IO	performance	
After
DATA ORCHESTRATION SUMMIT
Unlimited	Waiting	for	Disk	Operations	
Local
Disk
Client
Data
Cache
get()	
•  Client-side	cache	stores	data	on	local	disk	
•  Some	disk	operations	can	take	an	unlimited	amount	of	time	
•  Applications	can	get	stuck	when	accessing	cache	
Before
DATA ORCHESTRATION SUMMIT
Unlimited	Waiting	for	Disk	Operations	
Local
Disk
Client
Data
Cache
get()	
•  Support	configurable	time	outs	for	client-side	cache	interactions	
•  Avoid	unexpected	hangs	when	accessing	client-side	cache	
After
DATA ORCHESTRATION SUMMIT
Conclusion	
Alluxio	Data	Orchestration	is	complex,	
interacts	with	external	storage	
Must	expect	slow	IO	to	external	storage	
Do	not	force	users	to	wait	for	slowness	
Handle	pauses	appropriately	
Benefits	to	Users	
faster	 predictable	
more	
concurrency

More Related Content

What's hot (20)

PDF
Jenkins를 활용한 Openshift CI/CD 구성
rockplace
 
PDF
Snowflake Architecture and Performance(db tech showcase Tokyo 2018)
Mineaki Motohashi
 
PDF
OpenStack超入門シリーズ いまさら聞けないNeutronの使い方
Toru Makabe
 
PDF
Nutanix 概要紹介
Satoshi Shimazaki
 
PDF
Automated CloudStack Deployment
ShapeBlue
 
PPTX
A critique of ansi sql isolation levels 解説公開用
Takashi Kambayashi
 
PDF
AWS Black Belt Techシリーズ Amazon Workspaces
Amazon Web Services Japan
 
PDF
Fluid: When Alluxio Meets Kubernetes
Alluxio, Inc.
 
PPTX
Kubernetes Basics
Rishabh Kumar
 
PPTX
マルチクラウドDWH(Snowflake)のすすめ
Yuuta Hishinuma
 
PDF
AWS Kubernetes 서비스 자세히 살펴보기 (정영준 & 이창수, AWS 솔루션즈 아키텍트) :: AWS DevDay2018
Amazon Web Services Korea
 
PDF
Making Cloud Native CI_CD Services.pdf
Rakuten Group, Inc.
 
PDF
AWS 상의 컨테이너 서비스 소개 ECS, EKS - 이종립 / Principle Enterprise Evangelist @베스핀글로벌
BESPIN GLOBAL
 
PPTX
Azure Stack Fundamentals
Cenk Ersoy
 
PPTX
Dexamethasone
Asra Hameed
 
PDF
MinIO January 2020 Briefing
Jonathan Symonds
 
PDF
JenkinsとCodeBuildとCloud Buildと私
Shoji Shirotori
 
PDF
製造装置データ収集の選択肢 (AWS IoT Deep Dive #5)
Amazon Web Services Japan
 
PDF
20210317 AWS Black Belt Online Seminar Amazon MQ
Amazon Web Services Japan
 
PDF
Amazon SageMaker을 통한 손쉬운 Jupyter Notebook 활용하기 - 윤석찬 (AWS 테크에반젤리스트)
Amazon Web Services Korea
 
Jenkins를 활용한 Openshift CI/CD 구성
rockplace
 
Snowflake Architecture and Performance(db tech showcase Tokyo 2018)
Mineaki Motohashi
 
OpenStack超入門シリーズ いまさら聞けないNeutronの使い方
Toru Makabe
 
Nutanix 概要紹介
Satoshi Shimazaki
 
Automated CloudStack Deployment
ShapeBlue
 
A critique of ansi sql isolation levels 解説公開用
Takashi Kambayashi
 
AWS Black Belt Techシリーズ Amazon Workspaces
Amazon Web Services Japan
 
Fluid: When Alluxio Meets Kubernetes
Alluxio, Inc.
 
Kubernetes Basics
Rishabh Kumar
 
マルチクラウドDWH(Snowflake)のすすめ
Yuuta Hishinuma
 
AWS Kubernetes 서비스 자세히 살펴보기 (정영준 & 이창수, AWS 솔루션즈 아키텍트) :: AWS DevDay2018
Amazon Web Services Korea
 
Making Cloud Native CI_CD Services.pdf
Rakuten Group, Inc.
 
AWS 상의 컨테이너 서비스 소개 ECS, EKS - 이종립 / Principle Enterprise Evangelist @베스핀글로벌
BESPIN GLOBAL
 
Azure Stack Fundamentals
Cenk Ersoy
 
Dexamethasone
Asra Hameed
 
MinIO January 2020 Briefing
Jonathan Symonds
 
JenkinsとCodeBuildとCloud Buildと私
Shoji Shirotori
 
製造装置データ収集の選択肢 (AWS IoT Deep Dive #5)
Amazon Web Services Japan
 
20210317 AWS Black Belt Online Seminar Amazon MQ
Amazon Web Services Japan
 
Amazon SageMaker을 통한 손쉬운 Jupyter Notebook 활용하기 - 윤석찬 (AWS 테크에반젤리스트)
Amazon Web Services Korea
 

Similar to Alluxio Architecture and Performance (20)

PDF
Accelerate Analytics and ML in the Hybrid Cloud Era
Alluxio, Inc.
 
PDF
Data Orchestration for the Hybrid Cloud Era
Alluxio, Inc.
 
PDF
Data Orchestration Platform for the Cloud
Alluxio, Inc.
 
PDF
From limited Hadoop compute capacity to increased data scientist efficiency
Alluxio, Inc.
 
PDF
Accelerate Analytics and ML in the Hybrid Cloud Era
Alluxio, Inc.
 
PDF
Introducing the Hub for Data Orchestration
Alluxio, Inc.
 
PDF
Accelerate Analytics and ML in the Hybrid Cloud Era
Alluxio, Inc.
 
PDF
Alluxio 2.0 Deep Dive – Simplifying data access for cloud workloads
Alluxio, Inc.
 
PDF
Alluxio 2 Community Update
Alluxio, Inc.
 
PDF
Enabling Apache Spark for Hybrid Cloud
Alluxio, Inc.
 
PDF
Unified Big Data Analytics: Any Stack, Any Cloud
Alluxio, Inc.
 
PDF
Accelerate Spark Workloads on S3
Alluxio, Inc.
 
PDF
Getting Started with Apache Spark and Alluxio for Blazingly Fast Analytics
Alluxio, Inc.
 
PDF
Alluxio Use Cases and Future Directions
Alluxio, Inc.
 
PDF
Alluxio Data Orchestration Platform for the Cloud
Shubham Tagra
 
PDF
Open Source Data Orchestration for AI, Big Data, and Cloud
Alluxio, Inc.
 
PDF
Modernizing Your Data Platform for Analytics and AI in the Hybrid Cloud Era
Alluxio, Inc.
 
PDF
How the Development Bank of Singapore solves on-prem compute capacity challen...
Alluxio, Inc.
 
PDF
Alluxio+Presto: An Architecture for Fast SQL in the Cloud
Alluxio, Inc.
 
PDF
Architecting a Heterogeneous Data Platform Across Clusters, Regions, and Clouds
Alluxio, Inc.
 
Accelerate Analytics and ML in the Hybrid Cloud Era
Alluxio, Inc.
 
Data Orchestration for the Hybrid Cloud Era
Alluxio, Inc.
 
Data Orchestration Platform for the Cloud
Alluxio, Inc.
 
From limited Hadoop compute capacity to increased data scientist efficiency
Alluxio, Inc.
 
Accelerate Analytics and ML in the Hybrid Cloud Era
Alluxio, Inc.
 
Introducing the Hub for Data Orchestration
Alluxio, Inc.
 
Accelerate Analytics and ML in the Hybrid Cloud Era
Alluxio, Inc.
 
Alluxio 2.0 Deep Dive – Simplifying data access for cloud workloads
Alluxio, Inc.
 
Alluxio 2 Community Update
Alluxio, Inc.
 
Enabling Apache Spark for Hybrid Cloud
Alluxio, Inc.
 
Unified Big Data Analytics: Any Stack, Any Cloud
Alluxio, Inc.
 
Accelerate Spark Workloads on S3
Alluxio, Inc.
 
Getting Started with Apache Spark and Alluxio for Blazingly Fast Analytics
Alluxio, Inc.
 
Alluxio Use Cases and Future Directions
Alluxio, Inc.
 
Alluxio Data Orchestration Platform for the Cloud
Shubham Tagra
 
Open Source Data Orchestration for AI, Big Data, and Cloud
Alluxio, Inc.
 
Modernizing Your Data Platform for Analytics and AI in the Hybrid Cloud Era
Alluxio, Inc.
 
How the Development Bank of Singapore solves on-prem compute capacity challen...
Alluxio, Inc.
 
Alluxio+Presto: An Architecture for Fast SQL in the Cloud
Alluxio, Inc.
 
Architecting a Heterogeneous Data Platform Across Clusters, Regions, and Clouds
Alluxio, Inc.
 
Ad

More from Alluxio, Inc. (20)

PDF
Introduction to Apache Iceberg™ & Tableflow
Alluxio, Inc.
 
PDF
Optimizing Tiered Storage for Low-Latency Real-Time Analytics at AI Scale
Alluxio, Inc.
 
PDF
Meet in the Middle: Solving the Low-Latency Challenge for Agentic AI
Alluxio, Inc.
 
PDF
From Data Preparation to Inference: How Alluxio Speeds Up AI
Alluxio, Inc.
 
PDF
Best Practice for LLM Serving in the Cloud
Alluxio, Inc.
 
PDF
Meet You in the Middle: 1000x Performance for Parquet Queries on PB-Scale Dat...
Alluxio, Inc.
 
PDF
How Coupang Leverages Distributed Cache to Accelerate ML Model Training
Alluxio, Inc.
 
PDF
Alluxio Webinar | Inside Deepseek 3FS: A Deep Dive into AI-Optimized Distribu...
Alluxio, Inc.
 
PDF
AI/ML Infra Meetup | Building Production Platform for Large-Scale Recommendat...
Alluxio, Inc.
 
PDF
AI/ML Infra Meetup | How Uber Optimizes LLM Training and Finetune
Alluxio, Inc.
 
PDF
AI/ML Infra Meetup | Optimizing ML Data Access with Alluxio: Preprocessing, ...
Alluxio, Inc.
 
PDF
AI/ML Infra Meetup | Deployment, Discovery and Serving of LLMs at Uber Scale
Alluxio, Inc.
 
PDF
Alluxio Webinar | What’s New in Alluxio AI: 3X Faster Checkpoint File Creatio...
Alluxio, Inc.
 
PDF
AI/ML Infra Meetup | A Faster and More Cost Efficient LLM Inference Stack
Alluxio, Inc.
 
PDF
AI/ML Infra Meetup | Balancing Cost, Performance, and Scale - Running GPU/CPU...
Alluxio, Inc.
 
PDF
AI/ML Infra Meetup | RAYvolution - The Last Mile: Mastering AI Deployment wit...
Alluxio, Inc.
 
PDF
Alluxio Webinar | Accelerate AI: Alluxio 101
Alluxio, Inc.
 
PDF
AI/ML Infra Meetup | The power of Ray in the era of LLM and multi-modality AI
Alluxio, Inc.
 
PDF
AI/ML Infra Meetup | Exploring Distributed Caching for Faster GPU Training wi...
Alluxio, Inc.
 
PDF
AI/ML Infra Meetup | Big Data and AI, Zoom Developers
Alluxio, Inc.
 
Introduction to Apache Iceberg™ & Tableflow
Alluxio, Inc.
 
Optimizing Tiered Storage for Low-Latency Real-Time Analytics at AI Scale
Alluxio, Inc.
 
Meet in the Middle: Solving the Low-Latency Challenge for Agentic AI
Alluxio, Inc.
 
From Data Preparation to Inference: How Alluxio Speeds Up AI
Alluxio, Inc.
 
Best Practice for LLM Serving in the Cloud
Alluxio, Inc.
 
Meet You in the Middle: 1000x Performance for Parquet Queries on PB-Scale Dat...
Alluxio, Inc.
 
How Coupang Leverages Distributed Cache to Accelerate ML Model Training
Alluxio, Inc.
 
Alluxio Webinar | Inside Deepseek 3FS: A Deep Dive into AI-Optimized Distribu...
Alluxio, Inc.
 
AI/ML Infra Meetup | Building Production Platform for Large-Scale Recommendat...
Alluxio, Inc.
 
AI/ML Infra Meetup | How Uber Optimizes LLM Training and Finetune
Alluxio, Inc.
 
AI/ML Infra Meetup | Optimizing ML Data Access with Alluxio: Preprocessing, ...
Alluxio, Inc.
 
AI/ML Infra Meetup | Deployment, Discovery and Serving of LLMs at Uber Scale
Alluxio, Inc.
 
Alluxio Webinar | What’s New in Alluxio AI: 3X Faster Checkpoint File Creatio...
Alluxio, Inc.
 
AI/ML Infra Meetup | A Faster and More Cost Efficient LLM Inference Stack
Alluxio, Inc.
 
AI/ML Infra Meetup | Balancing Cost, Performance, and Scale - Running GPU/CPU...
Alluxio, Inc.
 
AI/ML Infra Meetup | RAYvolution - The Last Mile: Mastering AI Deployment wit...
Alluxio, Inc.
 
Alluxio Webinar | Accelerate AI: Alluxio 101
Alluxio, Inc.
 
AI/ML Infra Meetup | The power of Ray in the era of LLM and multi-modality AI
Alluxio, Inc.
 
AI/ML Infra Meetup | Exploring Distributed Caching for Faster GPU Training wi...
Alluxio, Inc.
 
AI/ML Infra Meetup | Big Data and AI, Zoom Developers
Alluxio, Inc.
 
Ad

Recently uploaded (20)

PDF
Windows 10 Professional Preactivated.pdf
asghxhsagxjah
 
PDF
Softaken CSV to vCard Converter accurately converts CSV files to vCard
markwillsonmw004
 
PPTX
How Can Reporting Tools Improve Marketing Performance.pptx
Varsha Nayak
 
PPTX
How Odoo ERP Enhances Operational Visibility Across Your Organization.pptx
zidanakhtar874
 
PPTX
leaf desease detection using machine learning.pptx
kdjeevan35
 
PDF
NSF Converter Simplified: From Complexity to Clarity
Johnsena Crook
 
PPTX
Operations Profile SPDX_Update_20250711_Example_05_03.pptx
Shane Coughlan
 
PPTX
API DOCUMENTATION | API INTEGRATION PLATFORM
philipnathen82
 
PPTX
Odoo Migration Services by CandidRoot Solutions
CandidRoot Solutions Private Limited
 
PPTX
Lec 2 Compiler, Interpreter, linker, loader.pptx
javidmiakhil63
 
PPTX
Transforming Lending with IntelliGrow – Advanced Loan Software Solutions
Intelli grow
 
PPTX
UI5con_2025_Accessibility_Ever_Evolving_
gerganakremenska1
 
PDF
Salesforce Experience Cloud Consultant.pdf
VALiNTRY360
 
PPTX
BB FlashBack Pro 5.61.0.4843 With Crack Free Download
cracked shares
 
PDF
Australian Enterprises Need Project Service Automation
Navision India
 
PPTX
MiniTool Partition Wizard Crack 12.8 + Serial Key Download Latest [2025]
filmoracrack9001
 
PDF
ERP Consulting Services and Solutions by Contetra Pvt Ltd
jayjani123
 
PDF
How Attendance Management Software is Revolutionizing Education.pdf
Pikmykid
 
PDF
Code and No-Code Journeys: The Maintenance Shortcut
Applitools
 
PPT
24-BuildingGUIs Complete Materials in Java.ppt
javidmiakhil63
 
Windows 10 Professional Preactivated.pdf
asghxhsagxjah
 
Softaken CSV to vCard Converter accurately converts CSV files to vCard
markwillsonmw004
 
How Can Reporting Tools Improve Marketing Performance.pptx
Varsha Nayak
 
How Odoo ERP Enhances Operational Visibility Across Your Organization.pptx
zidanakhtar874
 
leaf desease detection using machine learning.pptx
kdjeevan35
 
NSF Converter Simplified: From Complexity to Clarity
Johnsena Crook
 
Operations Profile SPDX_Update_20250711_Example_05_03.pptx
Shane Coughlan
 
API DOCUMENTATION | API INTEGRATION PLATFORM
philipnathen82
 
Odoo Migration Services by CandidRoot Solutions
CandidRoot Solutions Private Limited
 
Lec 2 Compiler, Interpreter, linker, loader.pptx
javidmiakhil63
 
Transforming Lending with IntelliGrow – Advanced Loan Software Solutions
Intelli grow
 
UI5con_2025_Accessibility_Ever_Evolving_
gerganakremenska1
 
Salesforce Experience Cloud Consultant.pdf
VALiNTRY360
 
BB FlashBack Pro 5.61.0.4843 With Crack Free Download
cracked shares
 
Australian Enterprises Need Project Service Automation
Navision India
 
MiniTool Partition Wizard Crack 12.8 + Serial Key Download Latest [2025]
filmoracrack9001
 
ERP Consulting Services and Solutions by Contetra Pvt Ltd
jayjani123
 
How Attendance Management Software is Revolutionizing Education.pdf
Pikmykid
 
Code and No-Code Journeys: The Maintenance Shortcut
Applitools
 
24-BuildingGUIs Complete Materials in Java.ppt
javidmiakhil63
 

Alluxio Architecture and Performance