0% found this document useful (0 votes)

51 views25 pages

2024 MTH058 Lecture07 FederatedLearning

Federated learning enables training privacy-preserving models across distributed networks by keeping data localized on devices and periodically communicating updates to a central server. It follows a decentralized approach to ensure data privacy while allowing for collaborative model training. Devices train local models on their own data and only share model parameters rather than private data with the server, which then aggregates the updates into a global model. While federated learning preserves privacy and reduces network strain, it faces challenges from statistical and system heterogeneity among decentralized devices.

Uploaded by

Mark Mystery

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

51 views25 pages

2024 MTH058 Lecture07 FederatedLearning

Uploaded by

Mark Mystery

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 25

FEDERATED LEARNING

Nguyễn Ngọc Thảo

[email protected]
Federated learning: A definition
• Federated learning (FL) enables training privacy-preserving
models in heterogeneous, distributed networks.

An example of FL for the task of next-word prediction on cell phones

Image credit: CMU ML Blog 2

Federated learning: A definition
• It follows a decentralized approach, ensuring data privacy
and security while enabling collaborative model training.

Devices communicate
with a central server
periodically to learn a
global model.
FL helps preserve user
privacy and reduces strain
on the network by
keeping data localized.

3
Federated learning: A definition
Generate a global model shared by all nodes

Exchange
parameters
between
these local
nodes

Train local models on local data samples

4
Federated learning workflow
• First, the initial model is distributed to the edge devices and
trained based on data generated by the user.

5
Federated learning workflow
• Only the updated model will be sent to the server side.
• Any actual data based on user behavior is not necessarily
included.

6
Federated learning workflow
• The new model is aggregated on the server side and will be
distributed to all client devices.

7
Centralized vs. Decentralized techniques

Centralized Decentralized Distributed

All the local datasets are Local data samples are

collected to one server identically distributed

8
Types of federated learning
• Centralized federated learning
• A central server coordinates all the participating nodes during
the learning process → possibly a bottleneck of the system.

• Decentralized federated learning

• The nodes coordinate themselves to obtain the global model.
• The specific network topology may affect the performance.

• Heterogenous federated learning (HeteroFL)

• Local models are trained heterogeneously with dynamically-
varying computation complexities

9
10
Diao, Enmao, Jie Ding, and Vahid Tarokh. "Heterofl: Computation and
communication efficient federated learning for heterogeneous clients."
(Link, ICLR 2021)

Global model parameters 𝑊𝑔 are distributed to 𝑚 = 6

local clients with 𝑝 = 3 computation complexity levels.
11
Federated vs. Distributed learning
• The key difference lies in the assumptions about local dataset
properties.
• Distributed learning aims at parallelizing computing power where
federated learning aims at training on heterogeneous datasets.
• Distributed learning: local datasets are identically distributed (i.i.d.)
and roughly have the same size
• Federated learning: the datasets are typically heterogeneous, and their
sizes may span several orders of magnitude
• Clients involved in FL may be subject to more failures or drop out
• Nodes in distributed learning are typically datacenters of powerful
computational capabilities and fast networks.

12
Federated vs. Distributed learning
• Distributed learning aims at parallelizing computing power, while
Federated learning aims at training on heterogeneous datasets.

• Clients involved in FL may be subject to more failures or drop out.

• Nodes in distributed learning are typically datacenters of powerful
computational capabilities and fast networks.

• The distinction lies in the assumptions about local data properties.

• DL: local datasets are identically distributed (iid.) and roughly
of the same size.
• FL: datasets are typically heterogeneous, and their sizes may
span several orders of magnitude.

13
Canonical problem formulation
• FL was originally introduced as a new setting for distributed
optimization with a few distinctive properties.

Massive number of distributed nodes

Slow and expensive communication

Unbalanced and non-IID data scattered across the nodes

• FL aims to approximate centralized training and converge to

the same optimum as quickly as possible.

14
Canonical problem formulation
• Objective: Learn a single global statistical model from data
stored on tens to potentially millions of remote devices.
• In particular, minimize the following objective function:
𝑲

𝒘∗ = 𝐚𝐫𝐠 𝐦𝐢𝐧 𝑭 𝒘 ≔ ෍ 𝒑𝒌 𝑭𝒌 𝒘
𝒘∈𝑹𝒅 𝒌=𝟏
• 𝐾: the total number of devices
• 𝐹𝑘 is the local objective function for the 𝑘th device, defined as the
empirical risk over local data.
• 𝑝𝑘 is the relative impact of each device, 𝑝𝑘 ≥ 0 and σ𝐾
𝑘=1 𝑝𝑘 = 1.
1 𝑛𝑘
• It is user-defined, usually 𝑝𝑘 = 𝐾 or 𝑝𝑘 = , where 𝑛 is the number of
𝑛
samples over all devices.
15
The FedAvg (or Local SGD) method
• The clients optimize their local objective functions for
multiple steps to obtain 𝜃𝑡𝑖 .
• Then, send the pseudo-gradients, ∆𝑡𝑖 = 𝜃 𝑡 − 𝜃𝑡𝑖 , to the server.
• 𝜃 𝑡 : initial state, 𝜃𝑡𝑖 : a local update at client 𝑖 at timestep 𝑡.

• The server averages these values to update the model state

with the learning rate 𝛼𝑡 .
𝑵

𝜽𝒕+𝟏 = 𝜽𝒕 + 𝜶𝒕 ෍ 𝒑𝒊 ∆𝒕𝒊
𝒊=𝟏

16
FedAvg: The “client drift” problem
• Clients makes additional SGD steps locally → it converges much
faster both in the number of rounds and in wall-clock time.

• FedAvg converges to an inferior optimum in the non-IID setting

(i.e., when clients have different data distributions) .
• The resulting pseudo-gradients are somehow biased compared to
centralized training.

• Solution: use local regularization, carefully set learning rate

schedules, or use different control variate methods.
• Most of these intentionally must limit the optimization progress
clients can make at each round.

17
FedAvg: The “client drift” problem

A toy 2D setting with two clients and quadratic objectives that illustrates the convergence
issues of FedAvg. Left: convergence trajectories in the parameter space. Right: convergence
in terms of distance from the global optimum. Each drawing of the plot corresponds to a
run of federated optimization from a different starting point in the parameter space.
More local SGD steps per round speed up training, but the progress eventually stagnates at
an inferior point further away from the global optimum.

Image credit: CMU ML Blog 18

Federated Posterior Averaging (2021)
• FedPA employs MCMC for local posterior approximation on clients and
send statistics to the server to refine the global posterior mode estimate.

Al-Shedivat, Maruan, Jennifer Gillenwater, Eric Xing, and Afshin Rostamizadeh. "Federated
learning via posterior averaging: A new perspective and practical algorithms." ICLR, 2021.
Image credit: CMU ML Blog 19
Federated Posterior Averaging (2021)
• FedPA uses stochastic gradient Markov chain Monte Carlo (SG-MCMC)
for approximate sampling from local posteriors on the clients

FedPA vs. FedAvg in the toy 2D setting with two clients and quadratic objectives.

Image credit: CMU ML Blog 20

Federated learning: Pros and Cons

Hyper-personalized Low Cloud Minimum Privacy preserving

Infra Overheads latencies

Expensive System Statistical Privacy concerns

communication heterogeneity heterogeneity

21
Federated learning platforms

TensorFlow
Federated

IBM Federated Learning

22
Federated learning: Applications

Another application of federated learning for personal healthcare via learning over
heterogeneous electronic medical records distributed across multiple hospitals.
Image credit: CMU ML Blog 23
List of references

• Federated Learning: Challenges, Methods, and Future Directions (link)

• An Inferential Perspective on Federated Learning (link)
• Federated learning: a beginner guide (link)

24
25

Acharya Nagarjuna University: No - ANU/B.Ed. Exams/Percentage/2021 Date: 30-09-2021. To Whomsoever It May Concern
No ratings yet
Acharya Nagarjuna University: No - ANU/B.Ed. Exams/Percentage/2021 Date: 30-09-2021. To Whomsoever It May Concern
1 page
LSM602MV - CourseProject (003) ..........
No ratings yet
LSM602MV - CourseProject (003) ..........
7 pages
Cinematography 1 Syllabus 2018-2019
No ratings yet
Cinematography 1 Syllabus 2018-2019
3 pages
1FL 2024
No ratings yet
1FL 2024
75 pages
Federated Learning- Hope and Scope
No ratings yet
Federated Learning- Hope and Scope
4 pages
Federated Learning- Hope and Scope
No ratings yet
Federated Learning- Hope and Scope
3 pages
6
No ratings yet
6
12 pages
paper14
No ratings yet
paper14
25 pages
Fedlab A Flexible Federated Learning Framework
No ratings yet
Fedlab A Flexible Federated Learning Framework
10 pages
Client Selection in Federated Learning-Convergence Analysis and Power of choice selection strategies
No ratings yet
Client Selection in Federated Learning-Convergence Analysis and Power of choice selection strategies
22 pages
Federated Learning
No ratings yet
Federated Learning
50 pages
2017 Konecny Et Al Federated Learning Google Paper
No ratings yet
2017 Konecny Et Al Federated Learning Google Paper
10 pages
[24.07] Combining Federated Learning and Control A Survey
No ratings yet
[24.07] Combining Federated Learning and Control A Survey
19 pages
Federated Learning Advancements Applications and F
No ratings yet
Federated Learning Advancements Applications and F
7 pages
Daptive Ederated Ptimization: Mcmahan Et Al. 2017
No ratings yet
Daptive Ederated Ptimization: Mcmahan Et Al. 2017
38 pages
A Fair Federated Learning Framework With
No ratings yet
A Fair Federated Learning Framework With
8 pages
FL TUT ANS
No ratings yet
FL TUT ANS
19 pages
paper5
No ratings yet
paper5
7 pages
Federated_Learning_for_Generalization_Robustness_Fairness_A_Survey_and_Benchmark
No ratings yet
Federated_Learning_for_Generalization_Robustness_Fairness_A_Survey_and_Benchmark
20 pages
杨强教授：2021联邦学习专题研讨会
No ratings yet
杨强教授：2021联邦学习专题研讨会
76 pages
FL 1
No ratings yet
FL 1
25 pages
Flute: A S, E F H - P F L S: Calable Xtensible Ramework For IGH Erformance Ederated Earning Imulations
No ratings yet
Flute: A S, E F H - P F L S: Calable Xtensible Ramework For IGH Erformance Ederated Earning Imulations
13 pages
Federated Learning Challenges Methods and Future Directions
No ratings yet
Federated Learning Challenges Methods and Future Directions
11 pages
Newres 5
No ratings yet
Newres 5
23 pages
Fang Robust Federated Learning With Noisy and Heterogeneous Clients CVPR 2022 Paper
No ratings yet
Fang Robust Federated Learning With Noisy and Heterogeneous Clients CVPR 2022 Paper
10 pages
Multi Model
No ratings yet
Multi Model
33 pages
Federated Learning Client Selection Detailed
No ratings yet
Federated Learning Client Selection Detailed
3 pages
2004.12088v5
No ratings yet
2004.12088v5
14 pages
Federated Meta-Learning With Fast Convergence And
No ratings yet
Federated Meta-Learning With Fast Convergence And
14 pages
li2020
No ratings yet
li2020
11 pages
Optimizing Federated Learning On Non-IID Data With Reinforcement Learning
No ratings yet
Optimizing Federated Learning On Non-IID Data With Reinforcement Learning
10 pages
paper10
No ratings yet
paper10
10 pages
2207.06343v2
No ratings yet
2207.06343v2
29 pages
Bharati Et Al 2022 Federated Learning Applications Challenges and Future Directions
No ratings yet
Bharati Et Al 2022 Federated Learning Applications Challenges and Future Directions
17 pages
A Survey on Cluster-based Federated Learning
No ratings yet
A Survey on Cluster-based Federated Learning
22 pages
Federated Learning Challanges
No ratings yet
Federated Learning Challanges
21 pages
Implementation and Analysis of A Federated Learning Architecture Using CIFAR 10 Dataset 1
No ratings yet
Implementation and Analysis of A Federated Learning Architecture Using CIFAR 10 Dataset 1
6 pages
Data Fusion - KEN4223
No ratings yet
Data Fusion - KEN4223
45 pages
2502.04243v1
No ratings yet
2502.04243v1
6 pages
Heiko Ludwig Editor Nathalie Baracaldo Editor - Federated Learning a Comprehensive Overview of Methods and Applications-Springer 2022 (2)
No ratings yet
Heiko Ludwig Editor Nathalie Baracaldo Editor - Federated Learning a Comprehensive Overview of Methods and Applications-Springer 2022 (2)
531 pages
FedAFR Enhancing Federated Learning With Adaptive Fea - 2024 - Computer Communi
No ratings yet
FedAFR Enhancing Federated Learning With Adaptive Fea - 2024 - Computer Communi
8 pages
2104.14362
No ratings yet
2104.14362
36 pages
A Practical Recipe For Federated Learning Under Statistical Heterogeneity Experimental Design
No ratings yet
A Practical Recipe For Federated Learning Under Statistical Heterogeneity Experimental Design
14 pages
3820241090 Federated Learning
No ratings yet
3820241090 Federated Learning
10 pages
Federated Learning - Presentación
No ratings yet
Federated Learning - Presentación
32 pages
Federated Learning: Strategies For Improving Communication Efficiency
No ratings yet
Federated Learning: Strategies For Improving Communication Efficiency
5 pages
FEDERATED LEARNING OF A MIXTURE OF GLOBAL AND LOCAL MODELS
No ratings yet
FEDERATED LEARNING OF A MIXTURE OF GLOBAL AND LOCAL MODELS
33 pages
Federated Learning Presentation
No ratings yet
Federated Learning Presentation
11 pages
Decentralized_Federated_Learning_Fundamentals_State_of_the_Art_Frameworks_Trends_and_Challenges
No ratings yet
Decentralized_Federated_Learning_Fundamentals_State_of_the_Art_Frameworks_Trends_and_Challenges
31 pages
dfml
No ratings yet
dfml
6 pages
Federated Learning For Commercial Image Sources, WACV'23
No ratings yet
Federated Learning For Commercial Image Sources, WACV'23
10 pages
20894-Article Text-24907-1-2-20220628
No ratings yet
20894-Article Text-24907-1-2-20220628
9 pages
A Communication-Efficient Collaborative Learning21
No ratings yet
A Communication-Efficient Collaborative Learning21
19 pages
Learn What You Need in Personalized Federated Learning
No ratings yet
Learn What You Need in Personalized Federated Learning
11 pages
Gradient-congruity Guided Federated Sparse
No ratings yet
Gradient-congruity Guided Federated Sparse
12 pages
Technical Report
No ratings yet
Technical Report
35 pages
FL-AAAI-22 Paper 44
No ratings yet
FL-AAAI-22 Paper 44
9 pages
Federated Learning a Survery
No ratings yet
Federated Learning a Survery
31 pages
Fedsa: Accelerating Intrusion Detection in Collaborative Environments With Federated Simulated Annealing
No ratings yet
Fedsa: Accelerating Intrusion Detection in Collaborative Environments With Federated Simulated Annealing
9 pages
2208.09432
No ratings yet
2208.09432
22 pages
FedAdp
No ratings yet
FedAdp
11 pages
AI for Everyone: An Intermediate Guide to Artificial Intelligence
From Everand
AI for Everyone: An Intermediate Guide to Artificial Intelligence
Nova Clarke
No ratings yet
OpenCL Programming and Architecture: Definitive Reference for Developers and Engineers
From Everand
OpenCL Programming and Architecture: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
2024 MTH058 Lecture02 Backpropagation
No ratings yet
2024 MTH058 Lecture02 Backpropagation
62 pages
2024 MTH058 Lecture01 IntroductionToAI
No ratings yet
2024 MTH058 Lecture01 IntroductionToAI
52 pages
2024 Mth058 Lecture06 Mcts
100% (1)
2024 Mth058 Lecture06 Mcts
38 pages
2024 MTH058 Lecture05 ReinforcementLearning
No ratings yet
2024 MTH058 Lecture05 ReinforcementLearning
59 pages
N3 Shinkanzen Master Tu Vung
No ratings yet
N3 Shinkanzen Master Tu Vung
186 pages
2 Bottery - 1990
No ratings yet
2 Bottery - 1990
13 pages
PSYCH REPORT Instructions
No ratings yet
PSYCH REPORT Instructions
6 pages
Information Sheet of Single Mothers - 2025
0% (1)
Information Sheet of Single Mothers - 2025
2 pages
Itec 7410 - Vision Statement - Herndon
100% (2)
Itec 7410 - Vision Statement - Herndon
5 pages
PQ Criteria-Anji Khad
No ratings yet
PQ Criteria-Anji Khad
7 pages
Elements of Short Story 1223516596455327 8
No ratings yet
Elements of Short Story 1223516596455327 8
30 pages
шкатулка
No ratings yet
шкатулка
222 pages
PROFS DATABASE
No ratings yet
PROFS DATABASE
35 pages
Allama Iqbal Open University, Islamabad (Department of English) Warning
No ratings yet
Allama Iqbal Open University, Islamabad (Department of English) Warning
4 pages
MSC IRM Business Research Project Guidelines
No ratings yet
MSC IRM Business Research Project Guidelines
9 pages
A Psychoanalyst On The Couch
No ratings yet
A Psychoanalyst On The Couch
165 pages
Freedom and Neurobiology by John: Searle
100% (2)
Freedom and Neurobiology by John: Searle
4 pages
VII-SEM NM LAB-04.12.24-09.12.24
No ratings yet
VII-SEM NM LAB-04.12.24-09.12.24
4 pages
Trabajo Final Ingles 4 Trabajo Final Ingles 4
No ratings yet
Trabajo Final Ingles 4 Trabajo Final Ingles 4
9 pages
Final Exam of Interchange 3 - Compress
No ratings yet
Final Exam of Interchange 3 - Compress
2 pages
Detailed Lesson Plan - Epas 9
100% (1)
Detailed Lesson Plan - Epas 9
14 pages
Mapeh 4TH PT
No ratings yet
Mapeh 4TH PT
5 pages
Yuk 5
No ratings yet
Yuk 5
6 pages
Semi Detailed Lesson Plan
No ratings yet
Semi Detailed Lesson Plan
3 pages
12TraitsofaHigh-ValueManandHowYouCanBecomeOneToo_1691650700526
No ratings yet
12TraitsofaHigh-ValueManandHowYouCanBecomeOneToo_1691650700526
40 pages
EGRA Template - Post
No ratings yet
EGRA Template - Post
9 pages
GRIEF - Notes
No ratings yet
GRIEF - Notes
3 pages
Snehitha Reddy Assessment Report
No ratings yet
Snehitha Reddy Assessment Report
1 page
File Parikalpana Vol 6
No ratings yet
File Parikalpana Vol 6
104 pages
Emtech Semi IPIT
No ratings yet
Emtech Semi IPIT
6 pages
Instant ebooks textbook Reproducible Research with R and RStudio 3rd Edition Christopher Gandrud download all chapters
100% (1)
Instant ebooks textbook Reproducible Research with R and RStudio 3rd Edition Christopher Gandrud download all chapters
35 pages
Teaching Internship Module
No ratings yet
Teaching Internship Module
69 pages

2024 MTH058 Lecture07 FederatedLearning

Uploaded by

2024 MTH058 Lecture07 FederatedLearning

Uploaded by

FEDERATED LEARNING

Nguyễn Ngọc Thảo

An example of FL for the task of next-word prediction on cell phones

Image credit: CMU ML Blog 2

Train local models on local data samples

Centralized Decentralized Distributed

All the local datasets are Local data samples are

• Decentralized federated learning

• Heterogenous federated learning (HeteroFL)

Global model parameters 𝑊𝑔 are distributed to 𝑚 = 6

• Clients involved in FL may be subject to more failures or drop out.

• The distinction lies in the assumptions about local data properties.

Massive number of distributed nodes

Slow and expensive communication

Unbalanced and non-IID data scattered across the nodes

• FL aims to approximate centralized training and converge to

• The server averages these values to update the model state

• FedAvg converges to an inferior optimum in the non-IID setting

• Solution: use local regularization, carefully set learning rate

Image credit: CMU ML Blog 18

Image credit: CMU ML Blog 20

Hyper-personalized Low Cloud Minimum Privacy preserving

Expensive System Statistical Privacy concerns

IBM Federated Learning

• Federated Learning: Challenges, Methods, and Future Directions (link)

You might also like