Towards Optimal District Heating Temperature Contr-2

Uploaded by

oscar gonzalez

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views

Towards Optimal District Heating Temperature Contr-2

Uploaded by

oscar gonzalez

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 1

• •

• •
• •
Primary Secondary
• •
Ts

towards heat ··· • • • ···

generation plant ··· • • • ···
ṁ
Heat Exchanger Tr

Figure 1: A district heating network.

2 Approach
2.1 Control strategy

We apply a Reinforcement Learning (RL) paradigm [6], where an agent learns a control strategy
(policy) by interacting with the environment - here, the set of rooms heated by the network. The
problem is modelled as a Markov Decision Process: the agent receives an observation of the state
of the environment, chooses an action and receives as a result a reward from the environment. The
best control strategy maximizes the expected cumulative discounted reward over the lifetime of the
agent. Learning such a policy requires first to derive a model of the environment, to predict the indoor
temperatures from the commands and the weather conditions. This model is described in Section 2.2.
At time t, the state is a vector st containing the outdoor temperature To,t , supply water temperature
(j)
Ts,t , time of the day and indoor temperatures Tin,t for every room j ∈ {1, . . . , N } in the network.
st contains both present and past n measurements of these quantities. At an hourly time step, a
history of 24 hours is used to form st . At that same hourly time step, the agent is asked to select an
action at . The flow rate being kept constant, the action is restricted to the supply temperature Ts,t .
Two discrete action spaces, with Ts (◦ C) ∈ {20, 21, . . . , 50}, are considered. Agent 1 is the standard
strategy while Agent 2 is a finetuning of the baseline control strategy (cf Section 2.3):
1. Agent 1: to enforce the smoothness of the control signal, the action is limited to the
increments at = Ts,t − Ts,t−1 where at ∈ A := {0, ±0.5, ±1, ±1.5, . . . , ±3}.
b
2. Agent 2: the discrete action is the difference at = Ts,t − Ts,t where at ∈ A and Tsb is the
estimated baseline supply temperature.
Finally, the agent selects the action in order to maximize the expected cumulative discounted reward
PT
function R = t=0 γ t r(st , at ) over T time steps in the heating season. In the sequel, the discount
factor is set to γ = 0.9, which corresponds to an agent that adapts its behaviour to the expected
reward for the next 30 hours. The reward r penalizes deviations from a target temperature T :
N
(j) (j)
X
r(at , st ) = − |Tin,t − Tt |. (1)
j=1

We use Deep Reinforcement Learning (DRL), to train the different agents. DRL has proven to
be a successful algorithm in various domains such as games, robotics or demand response [7]. In
particular, we train Deep Q-Networks (DQNs, [8, 9]). For each training episode, a weather file is
randomly chosen from a set of 7 cities in China to avoid overfitting the local climate, and an entire
heating season is simulated. The weather measurements for testing the agents come from an eighth
city, Yuncheng. Some statistics summarizing the climate in these cities are gathered in Appendix A.

2.2 Model identification

Consider a six-story building with three apartments per level - facing either the Eastern, Southern or
Western direction. Heat is provided from a district heating substation and supplied to the apartments

IML651 Individual Assignment
100% (1)
IML651 Individual Assignment
22 pages
Towards Optimal District Heating Temperature Contr-4
No ratings yet
Towards Optimal District Heating Temperature Contr-4
1 page
Comparing Neural Architectures for Demand Response Through Model-Free Reinforcement Learning for Heat Pump Control_CameraReady
No ratings yet
Comparing Neural Architectures for Demand Response Through Model-Free Reinforcement Learning for Heat Pump Control_CameraReady
6 pages
F90de-Introduction To Reinforcement Learning
No ratings yet
F90de-Introduction To Reinforcement Learning
67 pages
Towards Optimal District Heating Temperature Contr-5
No ratings yet
Towards Optimal District Heating Temperature Contr-5
1 page
Actor-Critic Policy Optimization in Partially Observable Multiagent Environments 1810.09026
No ratings yet
Actor-Critic Policy Optimization in Partially Observable Multiagent Environments 1810.09026
28 pages
Deep Reinforcement Learning: Lecture Notes
No ratings yet
Deep Reinforcement Learning: Lecture Notes
60 pages
Energy Plus With RL
No ratings yet
Energy Plus With RL
112 pages
Multi-Agent Deep Reinforcement Learning: Maxim Egorov Stanford University
No ratings yet
Multi-Agent Deep Reinforcement Learning: Maxim Egorov Stanford University
8 pages
On State Variables and POMDP-s
No ratings yet
On State Variables and POMDP-s
49 pages
Multi Agent Reinforcement Learning a Rev
No ratings yet
Multi Agent Reinforcement Learning a Rev
25 pages
(DeepMind) Optimizing Industrial HVAC Systems With Hierarchical Reinforcement Learning
No ratings yet
(DeepMind) Optimizing Industrial HVAC Systems With Hierarchical Reinforcement Learning
11 pages
Cooperative Multi-Agent Control Using Deep
No ratings yet
Cooperative Multi-Agent Control Using Deep
8 pages
Deep Reinforcement Learning in Large Discrete Action Spaces
No ratings yet
Deep Reinforcement Learning in Large Discrete Action Spaces
11 pages
Systems 11 00136
No ratings yet
Systems 11 00136
20 pages
Alg RLearning Ejemplo
No ratings yet
Alg RLearning Ejemplo
99 pages
BTP Final Term Report v3
No ratings yet
BTP Final Term Report v3
26 pages
databookRL Steve Brunton PDF
No ratings yet
databookRL Steve Brunton PDF
76 pages
SSRN 4963741
No ratings yet
SSRN 4963741
26 pages
Towards Delivering a Coherent Self-Contained Explanation of Proximal Policy Optimization
No ratings yet
Towards Delivering a Coherent Self-Contained Explanation of Proximal Policy Optimization
36 pages
Nips00 Bs
No ratings yet
Nips00 Bs
7 pages
PowerCon2023 TSAC
No ratings yet
PowerCon2023 TSAC
7 pages
Safe HVAC Control Via Batch Reinforcement Learning
No ratings yet
Safe HVAC Control Via Batch Reinforcement Learning
12 pages
adprl_chapter_icis
No ratings yet
adprl_chapter_icis
43 pages
Reinforcement Learning For Planning of A Simulated Production Line
No ratings yet
Reinforcement Learning For Planning of A Simulated Production Line
62 pages
Average-reward Model-free Reinforcement Learning- A Systematic Review and Literature Mapping
No ratings yet
Average-reward Model-free Reinforcement Learning- A Systematic Review and Literature Mapping
36 pages
Audio to text embedding
No ratings yet
Audio to text embedding
144 pages
Kalmthout Aspen RL
No ratings yet
Kalmthout Aspen RL
18 pages
Case Study
No ratings yet
Case Study
16 pages
L12 Reinforcement Learning 2
No ratings yet
L12 Reinforcement Learning 2
26 pages
20690-Article Text-24703-1-2-20220628
No ratings yet
20690-Article Text-24703-1-2-20220628
9 pages
Deep Multi-Agent Reinforcement Learning With Discrete-Continuous Hybrid Action Spaces
No ratings yet
Deep Multi-Agent Reinforcement Learning With Discrete-Continuous Hybrid Action Spaces
7 pages
Reinforcement Learning Notes ?
No ratings yet
Reinforcement Learning Notes ?
40 pages
3.RL Unit 3
No ratings yet
3.RL Unit 3
31 pages
Reinforcement Learning in Dynamic Environments Optimizing Real Time Decision Making for Complex Systems MAR 2025
No ratings yet
Reinforcement Learning in Dynamic Environments Optimizing Real Time Decision Making for Complex Systems MAR 2025
8 pages
Algorithms For Reinforced Learning
No ratings yet
Algorithms For Reinforced Learning
98 pages
Deep Reinforcement Learning: Overcoming The Challenges of Deep Learning in Discrete and Continuous Markov Decision Processes
No ratings yet
Deep Reinforcement Learning: Overcoming The Challenges of Deep Learning in Discrete and Continuous Markov Decision Processes
110 pages
ML_Unit-4
No ratings yet
ML_Unit-4
10 pages
ABIDES-Gym: Gym Environments For Multi-Agent Discrete Event Simulation and Application To Financial Markets
No ratings yet
ABIDES-Gym: Gym Environments For Multi-Agent Discrete Event Simulation and Application To Financial Markets
9 pages
Lecture 10 - Overview of RL With A VIP Perspective
No ratings yet
Lecture 10 - Overview of RL With A VIP Perspective
35 pages
Reinforcement learning
No ratings yet
Reinforcement learning
10 pages
Reinforcement Learning and Dynamic Programming For Control
100% (1)
Reinforcement Learning and Dynamic Programming For Control
111 pages
2018_RL ALGORITHM -CONTROL OF A BIOREACTOR
No ratings yet
2018_RL ALGORITHM -CONTROL OF A BIOREACTOR
14 pages
Fundamentals of Reinforcement Learning
No ratings yet
Fundamentals of Reinforcement Learning
33 pages
unit 3 ai
No ratings yet
unit 3 ai
5 pages
Maai 6
No ratings yet
Maai 6
143 pages
Instant ebooks textbook (Ebook) Foundations of Deep Reinforcement Learning: Theory and Practice in Python by Laura Graesser; Wah Loon Keng ISBN 9780135172490, 0135172497 download all chapters
100% (9)
Instant ebooks textbook (Ebook) Foundations of Deep Reinforcement Learning: Theory and Practice in Python by Laura Graesser; Wah Loon Keng ISBN 9780135172490, 0135172497 download all chapters
65 pages
Algorithms For Reinforcement Learning - Szepesvari
No ratings yet
Algorithms For Reinforcement Learning - Szepesvari
98 pages
Non-Maximizing Policies That Fulfill Multi-Criterion Aspirations in Expectation
No ratings yet
Non-Maximizing Policies That Fulfill Multi-Criterion Aspirations in Expectation
19 pages
RL Ia 2
No ratings yet
RL Ia 2
14 pages
AISE23-0038
No ratings yet
AISE23-0038
10 pages
RL Test Leif
No ratings yet
RL Test Leif
163 pages
RLAlgs in MDPs
No ratings yet
RLAlgs in MDPs
98 pages
8185 Modeling Boundedly Ration
No ratings yet
8185 Modeling Boundedly Ration
15 pages
Tree Search-Based Policy Optimization Under Stochastic Execution Delay
No ratings yet
Tree Search-Based Policy Optimization Under Stochastic Execution Delay
21 pages
Multi-Agent Deep Reinforcement Learning For HVAC Control in Commercial Buildings
No ratings yet
Multi-Agent Deep Reinforcement Learning For HVAC Control in Commercial Buildings
14 pages
Biomimetics 08 00434
No ratings yet
Biomimetics 08 00434
26 pages
Chenyu Liu
No ratings yet
Chenyu Liu
81 pages
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
From Everand
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
Yue Jiang
4.5/5 (2)
Calculus: Maths of the Gods
From Everand
Calculus: Maths of the Gods
Bill Todorovich
No ratings yet
Statistics for Spatio-Temporal Data
From Everand
Statistics for Spatio-Temporal Data
Noel Cressie
No ratings yet
Learning facial expression and body gesture visual information for video emotion recognition
No ratings yet
Learning facial expression and body gesture visual information for video emotion recognition
14 pages
Simulation Modeling For Consulting
No ratings yet
Simulation Modeling For Consulting
24 pages
Unit 5 Have You Ever Seen
No ratings yet
Unit 5 Have You Ever Seen
6 pages
Alphazero - The New Chess King - Ver2020uio
No ratings yet
Alphazero - The New Chess King - Ver2020uio
19 pages
K-Nearest Neighbors (KNN)
No ratings yet
K-Nearest Neighbors (KNN)
9 pages
2024 SPE NAICE YP Presentation Final
No ratings yet
2024 SPE NAICE YP Presentation Final
22 pages
Weeemake Product Brochure 2024
No ratings yet
Weeemake Product Brochure 2024
28 pages
Swiggy-A-Project-Report-on-Indias-Leading-Food-Delivery-Platform
No ratings yet
Swiggy-A-Project-Report-on-Indias-Leading-Food-Delivery-Platform
8 pages
Chunking & Tokenization (Updated)
No ratings yet
Chunking & Tokenization (Updated)
25 pages
14SMB Group Project April 2024
No ratings yet
14SMB Group Project April 2024
8 pages
Introduction To AI
No ratings yet
Introduction To AI
10 pages
Axora Innovation Forecast 2022-23 Final
No ratings yet
Axora Innovation Forecast 2022-23 Final
15 pages
Key Points for UiPath certrification
No ratings yet
Key Points for UiPath certrification
14 pages
CS698V/CS779: Statistical Natural Language Processing Course Handout 1
No ratings yet
CS698V/CS779: Statistical Natural Language Processing Course Handout 1
2 pages
01 Introduction
No ratings yet
01 Introduction
16 pages
Unit 5
No ratings yet
Unit 5
51 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
41 pages
Tacn Topic 1
No ratings yet
Tacn Topic 1
5 pages
Managerial Support Systems
No ratings yet
Managerial Support Systems
55 pages
PGP-AIFL_Brochure (1)
No ratings yet
PGP-AIFL_Brochure (1)
16 pages
Alco Bev Companies Trust Polestar Solutions
No ratings yet
Alco Bev Companies Trust Polestar Solutions
11 pages
AI business model
No ratings yet
AI business model
27 pages
K Nearest Neighbour
No ratings yet
K Nearest Neighbour
2 pages
Group-5(Image Segmentation)_ppt (1)
100% (1)
Group-5(Image Segmentation)_ppt (1)
108 pages
Debate
No ratings yet
Debate
1 page
02-Search in AI
No ratings yet
02-Search in AI
91 pages
Drone Technology in Architecture, Engineering and Construction
100% (1)
Drone Technology in Architecture, Engineering and Construction
179 pages
Assignment-2: Abhishek Shringi
No ratings yet
Assignment-2: Abhishek Shringi
8 pages
COURSERA
No ratings yet
COURSERA
14 pages