0% found this document useful (0 votes)

20 views

N0410010204

This study presents deep reinforcement learning methods aimed at suppressing the horizontal sway of construction cranes in a virtual environment, addressing safety concerns in crane operations. The research analyzes sample efficiency in reinforcement learning using Proximal Policy Optimization and Generative Adversarial Imitation Learning techniques, demonstrating improved learning performance. The findings indicate that the applied reinforcement learning techniques effectively enhance sample efficiency for crane sway control.

Uploaded by

Sung Woo Shin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views

N0410010204

Uploaded by

Sung Woo Shin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Journal of Construction Automation and Robotics pISSN: 2800-0552, eISSN: 2951-116X

Vol. 1 No. 2, pp. 19-24 / July, 2022 DOI: https://doi.org/10.55785/JCAR.1.2.19

가상물리환경에서의 건설 크레인 수평 흔들림 억제를 위한

심층 강화학습 기법
Deep Reinforcement Learning Methods for Suppressing Horizontal Sway of
Construction Cranes in Virtual Physical Environment

김인원1 ․ 김남균2 ․ 정민혁3 ․ 안창범4 ․ 박문서5

Kim, In Won1 ․ Kim, Nam Kyoun2 ․ Jung, Min Hyuk3 ․ Ahn, Chang Bum4 ․ Park, Moon Seo5

Received June 27, 2022 빳 Revised July 8, 2022 빳 Accepted July 8, 2022

ABSTRACT
In the development of a deep reinforcement learning-based autonomous operation model of cranes, the control of heavy object’s horizontal sway is an
important issue that directly affects crane operation safety. In kinematics, however, the motion control of a heavy object with pendulum motion is
classified as an underactuated system in which the degree of freedom of movement of objects is larger than the number of manipulable actions of
controllers. This increases the variance of rewards expected from action and state samples in reinforcement learning, and raises the problem of sample
efficiency, which means the number of samples effective for learning. Therefore, this study analyzes the sample efficiency that occurs when learning the
reinforcement learning model for sway control of cranes using Proximal Policy Optimization and Generative Adversarial Imitation Learning (GAIL)
techniques. To this end, this study established a virtual physical environment capable of simulating the movement of a construction crane, and expert
demonstration data samples were collected for GAIL. Finally, the effect of PPO and GAIL on sample efficiency was analyzed through the experiment. The
results show that the reinforcement learning technique applied to the experiment is effective in improving the sample efficiency and learning
performance of the crane model.
Keyword : Autonomous Crane, Deep Reinforcement Learning, Sample Efficiency, Generative and Adversarial Imitation Learning

1. 서 론

(underactuated system) ,
.
, .
(Wu and Xia, 2014) , (Sawodny et al., 2002).

(Fang and Cho, 2017; Ramli et al., 2017). (Deep Reinforcement Learning, DRL)

1
서울대학교 대학원 건축학과 석사과정(Master’s Student, Department of Architecture and Architectural Engineering, Seoul National University, inwon33@
snu.ac.kr)
2
서울대학교 대학원 건축학과 박사과정(Ph. D. Student, Department of Architecture and Architectural Engineering, Seoul National University, dewichon@
naver.com)
3
교신저자 ․ 서울대학교 건축학과 연구교수(Corresponding Author, Research Professor, Department of Architecture and Architectural Engineering, Seoul
National University, [email protected])
4
서울대학교 건축학과 교수(Professor, Department of Architecture and Architectural Engineering, Seoul National University, [email protected])
5
서울대학교 건축학과 교수(Professor, Department of Architecture and Architectural Engineering, Seoul National University, [email protected])

Copyright © 2022 Korean Society of Automation and Robotics in Construction. This is an Open Access article distributed under the terms of the Creative
Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/) which permits unrestricted non-commercial use,
distribution, and reproduction in any medium, provided the original work is properly cited.

| Vol.1 No.2 | 19
김인원 ․ 김남균 ․ 정민혁 ․ 안창범 ․ 박문서

(Zhao et al., 2020; Sallab et al., 2017). 75

SNUCG , , ,
(sample efficiency) Multi-Sensory
. . Rong et al.(2020) End-to-End
LGSVL
(Botvinick et al., 2019), .
(Yu, 2018; Kiran et al., 2021).

,
.
,
, (Proximal Policy Op- .
timization, PPO) (Inverse Reinforcement Learn- Unity ML-Agents
ing, IRL) (Generative Adver- (Juliani et al., 2018),
sarial Imitation Learning, GAIL) .
.
2.2 강화학습에서의 표본 효율성
2. 선행연구 분석
.
2.1 가상물리환경 기반 강화학습
,
, 3 3
.
(Shah et al., 2018). 2 ( / , / )
,
, (Sawodny et al., 2002).
(Matsumoto et al., 2020).
. ,
,
. (Yang et al., 2019).
Alex Dosovitskiy(2017) Unreal Engine 4 , ( : )
CARLA ( : ) ( :
. , , )
Classical .
Module Pipeline, End-to-End, 3 -
,
. Savva et al.(2017) .
MINOS

20 | Korean Society of Automation and Robotics in Construction |

가상물리환경에서의 건설 크레인 수평 흔들림 억제를 위한 심층 강화학습 기법

. ,

2.3 표본 효율성 향상을 위한 모방학습

(Imitation Learning)
(Kiran et al., 2021).
,

(Kober and Peters, 2010).

Figure 1. Parameters of the training environment and the crane model

(behavior cloning) 3. DRL 모형 개발

. 3.1 가상물리환경에서의 크레인 및 작업 정의

(expert demon-
stration data) , Fig. 1 .
.
.
, .
,
(Bhattacharyya et al., 2020). . ,
GAIL .
(Ho .
and Ermon, 2016). GAIL - () , () ()

. . ()

, 15° , ()
.
. .

GAIL
: 1) (hook) () , 2)

. , 3) .
,
. 3.2 상태 공간 및 보상함수
(State space)
. . Table 1
GAIL .

. () () ()

| Vol.1 No.2 | 21
김인원 ․ 김남균 ․ 정민혁 ․ 안창범 ․ 박문서

Table 1. State space and reward function

Position    ,   
State space
Sway  , 
+1.0: distance  and    &   
Sparse
-1.0: distance  and   
Reward function +0.15: distance  and   
Dense -0.2: distance  and    (after agent got +0.15)
-1.0/maxstep at each step

, (), () 3.3 정책 네트워크

.
10 . .
(Reward function) , 256 2
.
Table 1 . . (PPO)
(Schulman et al., 2017).
() () ()
, () () 4. 실 험
. .
4.1 실험 설정
  PPO GAIL
     cos          (1)

GAIL
.
(m: , g: , l: )
Fig. 2
, ()
90 50,000 .
. GAIL
, Table 1 , PPO GAIL
(sparse reward) . 0.25, 0.5, 0.75, 1.0
4 .
.
,  4.2 결과 및 논의
(dense reward) Fig. 3 ,
. () ()  (time step )
,  . PPO (Default)
. GAIL
, .
. Default
(max step) , ,
1,000 .

22 | Korean Society of Automation and Robotics in Construction |

가상물리환경에서의 건설 크레인 수평 흔들림 억제를 위한 심층 강화학습 기법

Figure 2. Training environment to collect experts’ demonstrations: (1)front view, (2)trolley top view, (3)side view, and (4)rear view

5. 결 론

DRL

.
GAIL

Figure 3. (a) learning curves and (b) episode length of each model
.
. GAIL GAIL
. .
.
.
1~2M
1.0 GAIL 1.0 . ,
, 3M .
GAIL 0.25 GAIL 0.5 ( )
. .
,
.

| Vol.1 No.2 | 23
김인원 ․ 김남균 ․ 정민혁 ․ 안창범 ․ 박문서

In ISARC. Proceedings of the International Symposium on Auto-

감사의 글
mation and Robotics in Construction, IAARC Publications, 37, pp.
457-464.
/ Ramli, L., Mohamed, Z., Abdullahi, A. M., Jaafar, H. I., and Lazim, I. M.
( : 21CTAP-C163785-01). (2017). Control strategies for crane systems: A comprehensive
review. Mechanical Systems and Signal Processing, 95, pp. 1-23.
Rong, G., Shin, B.H., Tabatabaee, H., Lu, Q., Lemke, S., Možeiko, M.,
References Boise, E., Uhm, G., Gerow, M., Mehta, S., and Agafonov, E. (2020).
Lgsvl simulator: A high fidelity simulator for autonomous driving.
In 2020 IEEE 23rd International conference on intelligent trans-
Botvinick, M., Ritter, S., Wang, J. X., Kurth-Nelson, Z., Blundell, C., and
portation systems (ITSC) (pp. 1-6), IEEE.
Hassabis, D. (2019). Reinforcement learning, fast and slow. Trends
Sallab, A. E., Abdou, M., Perot, E., and Yogamani, S. (2017). Deep
in Cognitive Sciences, 23(5), pp. 408-422.
reinforcement learning framework for autonomous driving.
Bhattacharyya, R., Wulfe, B., Phillips, D., Kuefler, A., Morton, J., Senanayake,
Electronic Imaging, 19, pp. 70-76.
R., and Kochenderfer, M. (2020). Modeling human driving behavior
Savva, M., Chang, A. X., Dosovitskiy, A., Funkhouser, T., and Koltun, V.
through generative adversarial imitation learning. arXiv preprint
(2017). MINOS: Multimodal indoor simulator for navigation in
arXiv:2006.06412.
complex environments. arXiv preprint arXiv:1712.03931.
Dosovitskiy, A., Ros, G., Codevilla, F., Lopez, A., and Koltun, V. (2017).
Sawodny, O., Aschemann, H., and Lahres, S. (2002) An automated
CARLA: An open urban driving simulator. In Conference on Robot
gantry crane as a large workspace robot. Control Engineering
Learning, (pp. 1-16), PMLR.
Practice, 10(12), pp. 1323-1338.
Fang, Y., and Cho, Y. K. (2017). Effectiveness analysis from a cognitive
Schulman, J., Wolski, F., Dhariwal, P., Radford, A., and Klimov, O.
perspective for a real-time safety assistance system for mobile
(2017). Proximal policy optimization algorithms. arXiv preprint
crane lifting operations. Journal of Construction Engineering and
arXiv:1707.06347.
Management, 143(4), 05016025.
Shah, S., Dey, D., Lovett, C., and Kapoor, A. (2018). Airsim: High-fidelity
Ho, J., and Ermon, S. (2016). Generative adversarial imitation learning.
visual and physical simulation for autonomous vehicles. In Field
Advances in Neural Information Processing Systems, 29.
and Service Robotics (pp. 621-635), Springer, Cham.
Juliani, A., Berges, V. P., Teng, E., Cohen, A., Harper, J., Elion, C., Goy,
Wu, Z., and Xia, X. (2014). Optimal motion planning for overhead
C., Gao, Y., Henry, H., Mattar, M., and Lange, D. (2018). Unity: A
cranes. IET Control Theory & Applications, 8(17), pp. 1833-1842.
general platform for intelligent agents. arXiv preprint arXiv:1809.
Yang, R., Jiang, C., Miao, Y., Ma, J., Zhang, X., Yang, T., and Sun, N.
02627.
(2019). A flexible rope crane experiment system. Applications of
Kiran, B. R., Sobh, I., Talpaert, V., Mannion, P., Al Sallab, A. A., Yogamani,
Modeling and Simulation, 3(1), pp. 11-17.
S., and Pérez, P. (2021). Deep reinforcement learning for autono-
Yu, Y. (2018). Towards Sample Efficient Reinforcement Learning. In
mous driving: A survey, IEEE Transactions on Intelligent Transpor-
IJCAI (pp. 5739-5743).
tation Systems.
Zhao, W., Queralta, J. P., and Westerlund, T. (2020). Sim-to-real transfer
Kober, J., and Peters, J. (2010). Imitation and reinforcement learning.
in deep reinforcement learning for robotics: a survey. In 2020 IEEE
IEEE Robotics & Automation Magazine, 17(2), pp. 55-62.
Symposium Series on Computational Intelligence (SSCI) (pp. 737-744),
Matsumoto, K., Yamaguchi, A., Oka, T., Yasumoto, M., Hara, S., Iida,
IEEE.24.
M., and Teichmann, M. (2020). Simulation-based Reinforcement
Learning Approach towards Construction Machine Automation,

요 지

핵심용어 :

24 | Korean Society of Automation and Robotics in Construction |

Generative Adversarial Networks with Industrial Use Cases: Learning How to Build GAN Applications for Retail, Healthcare, Telecom, Media, Education, and HRTech
From Everand
Generative Adversarial Networks with Industrial Use Cases: Learning How to Build GAN Applications for Retail, Healthcare, Telecom, Media, Education, and HRTech
Navin K Manaswi
No ratings yet
Cersai: Central Registry of Securitisation Asset Reconstruction and Security Interest of India
100% (1)
Cersai: Central Registry of Securitisation Asset Reconstruction and Security Interest of India
3 pages
Science Inquiry Lesson Plan
100% (1)
Science Inquiry Lesson Plan
5 pages
Ibdc Paper.
No ratings yet
Ibdc Paper.
44 pages
자율성장인공지능기술
No ratings yet
자율성장인공지능기술
12 pages
N0410010201
No ratings yet
N0410010201
6 pages
___________(1)
No ratings yet
___________(1)
13 pages
딥러닝 기반 의미론적 분할 기법을 통한 건물 자동추출 연구 모델의 가중치 경중과 전이학습에
No ratings yet
딥러닝 기반 의미론적 분할 기법을 통한 건물 자동추출 연구 모델의 가중치 경중과 전이학습에
11 pages
N0410010103
No ratings yet
N0410010103
6 pages
Week9 CIV2020 Lecture Note Rev(2)
No ratings yet
Week9 CIV2020 Lecture Note Rev(2)
65 pages
유전 알고라즘 기반 학습자 중심형 메타버스 교육 프레임워크
No ratings yet
유전 알고라즘 기반 학습자 중심형 메타버스 교육 프레임워크
62 pages
그로킹 딥러닝 _ 알기 쉬운 비유와 기초 수학으로 시작하는
No ratings yet
그로킹 딥러닝 _ 알기 쉬운 비유와 기초 수학으로 시작하는
54 pages
(Legal Code) Disclaimer
No ratings yet
(Legal Code) Disclaimer
79 pages
N0410030302
No ratings yet
N0410030302
8 pages
Maarten Grootendorst: Notion
No ratings yet
Maarten Grootendorst: Notion
30 pages
Modified DDPG car-following model with a real-world human driving experience with CARLA simulator
No ratings yet
Modified DDPG car-following model with a real-world human driving experience with CARLA simulator
34 pages
18 5 33
No ratings yet
18 5 33
12 pages
(Legal Code) Disclaimer
No ratings yet
(Legal Code) Disclaimer
57 pages
GSAI KAIRI Summer-2023 Research-Topics
No ratings yet
GSAI KAIRI Summer-2023 Research-Topics
5 pages
Training and Testing Neural Networks
No ratings yet
Training and Testing Neural Networks
28 pages
KSCE_1_2022_10_076
No ratings yet
KSCE_1_2022_10_076
8 pages
Paper 3 Mlis
No ratings yet
Paper 3 Mlis
9 pages
2506.03568v2
No ratings yet
2506.03568v2
13 pages
35-3_20-33
No ratings yet
35-3_20-33
14 pages
JUnit in Depth: Definitive Reference for Developers and Engineers
From Everand
JUnit in Depth: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
SNN Cmos Paper
No ratings yet
SNN Cmos Paper
59 pages
튜터링_1차시
No ratings yet
튜터링_1차시
17 pages
10、《Let Hybrid a Path Planner Obey Traffic Rules a Deep Reinforcement Learning-Based Planning Framework》
No ratings yet
10、《Let Hybrid a Path Planner Obey Traffic Rules a Deep Reinforcement Learning-Based Planning Framework》
8 pages
Training and Testing Neural Networks
No ratings yet
Training and Testing Neural Networks
28 pages
Collision Avoidance Using RL
No ratings yet
Collision Avoidance Using RL
19 pages
Toward HITL AI Enhancing Deep Reinforcement Learning Via RealTime Human Guidance For Autonomous Driving
No ratings yet
Toward HITL AI Enhancing Deep Reinforcement Learning Via RealTime Human Guidance For Autonomous Driving
17 pages
Three Dimensional Computer Graphics: Exploring the Intersection of Vision and Virtual Worlds
From Everand
Three Dimensional Computer Graphics: Exploring the Intersection of Vision and Virtual Worlds
Fouad Sabry
No ratings yet
Data Mining 2
No ratings yet
Data Mining 2
24 pages
01_제어시스템소개_배포
No ratings yet
01_제어시스템소개_배포
22 pages
KIISE Transactions On Computing Practices: ISSN 2383-6318 (Print) ISSN 2383-6326 (Online)
No ratings yet
KIISE Transactions On Computing Practices: ISSN 2383-6318 (Print) ISSN 2383-6326 (Online)
8 pages
8-Reinforcement learning-based control with application through steam generator system
No ratings yet
8-Reinforcement learning-based control with application through steam generator system
10 pages
A Novel Hybrid-Action-Based Deep Reinforcement Learning for Industrial Energy Management
No ratings yet
A Novel Hybrid-Action-Based Deep Reinforcement Learning for Industrial Energy Management
15 pages
Stable_training_via_elastic_ad
No ratings yet
Stable_training_via_elastic_ad
9 pages
FCBDC42E687770
No ratings yet
FCBDC42E687770
8 pages
Deep Reinforcement Learning With Heuristic Correct
No ratings yet
Deep Reinforcement Learning With Heuristic Correct
18 pages
ML 01
No ratings yet
ML 01
34 pages
lec2 - 딥러닝 기초
No ratings yet
lec2 - 딥러닝 기초
73 pages
Design_and_Experimental_Validation_of_Deep_Reinforcement_Learning-Based_Fast_Trajectory_Planning_and_Control_for_Mobile_Robot_in_Unknown_Environment
No ratings yet
Design_and_Experimental_Validation_of_Deep_Reinforcement_Learning-Based_Fast_Trajectory_Planning_and_Control_for_Mobile_Robot_in_Unknown_Environment
15 pages
스크린샷, 2025-05-30 오후 10.51.08
No ratings yet
스크린샷, 2025-05-30 오후 10.51.08
7 pages
Geometric Feature Learning: Unlocking Visual Insights through Geometric Feature Learning
From Everand
Geometric Feature Learning: Unlocking Visual Insights through Geometric Feature Learning
Fouad Sabry
No ratings yet
learn to learn
No ratings yet
learn to learn
17 pages
KSCE_1_2022_10_036
No ratings yet
KSCE_1_2022_10_036
4 pages
An Obstacle Avoidance Specific Reinforcement Learni - 2024 - Engineering Applica
No ratings yet
An Obstacle Avoidance Specific Reinforcement Learni - 2024 - Engineering Applica
14 pages
Comparison of Multiple Reinforcement Learning and Deep Reinforcement Learning Methods For The Task Aimed at Achieving The Goal
No ratings yet
Comparison of Multiple Reinforcement Learning and Deep Reinforcement Learning Methods For The Task Aimed at Achieving The Goal
9 pages
Foundational Models and Architectures S1: Generative AI, #1
From Everand
Foundational Models and Architectures S1: Generative AI, #1
Leaster Startx
No ratings yet
Copykiller Summary Report
No ratings yet
Copykiller Summary Report
8 pages
24 Summer
No ratings yet
24 Summer
23 pages
Optimization of Design Parameters of A EPPR Valve Solenoid Using Artificial Neural Network
No ratings yet
Optimization of Design Parameters of A EPPR Valve Solenoid Using Artificial Neural Network
8 pages
Design and Implementation of an Environment for Learning to Run a Power Network (L2RPN)
No ratings yet
Design and Implementation of an Environment for Learning to Run a Power Network (L2RPN)
18 pages
PGP Report Sachin t22060
No ratings yet
PGP Report Sachin t22060
20 pages
Spatial Engineering and Survey Techniques
From Everand
Spatial Engineering and Survey Techniques
Rajendra Asan
No ratings yet
Blob Detection: Unveiling Patterns in Visual Data
From Everand
Blob Detection: Unveiling Patterns in Visual Data
Fouad Sabry
No ratings yet
Donkey Car Depp Reinforcement Learning
No ratings yet
Donkey Car Depp Reinforcement Learning
7 pages
Articulated Body Pose Estimation: Unlocking Human Motion in Computer Vision
From Everand
Articulated Body Pose Estimation: Unlocking Human Motion in Computer Vision
Fouad Sabry
No ratings yet
Reinenforement Learning With Pid Loop
No ratings yet
Reinenforement Learning With Pid Loop
7 pages
Deep Reinforcement Learning For Power System
No ratings yet
Deep Reinforcement Learning For Power System
13 pages
Automatic Anti-Swing Gantry Crane Based On PID6
No ratings yet
Automatic Anti-Swing Gantry Crane Based On PID6
6 pages
JMockit in Practice: Definitive Reference for Developers and Engineers
From Everand
JMockit in Practice: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Full body pose estimation of construction equipment using computer vision and deep learning techniques
No ratings yet
Full body pose estimation of construction equipment using computer vision and deep learning techniques
19 pages
10-1108_ci-04-2023-0062
No ratings yet
10-1108_ci-04-2023-0062
28 pages
1-s2.0-S0263224123013349-main
No ratings yet
1-s2.0-S0263224123013349-main
14 pages
Developing Risk Breakdown Structure
No ratings yet
Developing Risk Breakdown Structure
10 pages
f24287c9-c37f-0fb1-ee76-f9b4a79ed1ce_1771382902
No ratings yet
f24287c9-c37f-0fb1-ee76-f9b4a79ed1ce_1771382902
26 pages
Risk Analysis - 2009 - Cox JR - What S Wrong With Hazard Ranking Systems An Expository Note
No ratings yet
Risk Analysis - 2009 - Cox JR - What S Wrong With Hazard Ranking Systems An Expository Note
9 pages
ispa2005afinal
No ratings yet
ispa2005afinal
6 pages
1 s2.0 S0925753509002161 Main
No ratings yet
1 s2.0 S0925753509002161 Main
9 pages
MU-based full-body pose estimation for construction machines using kinematics modeling
No ratings yet
MU-based full-body pose estimation for construction machines using kinematics modeling
17 pages
A_multiple_object_tracking_method_using_Kalman_filter
No ratings yet
A_multiple_object_tracking_method_using_Kalman_filter
5 pages
Dispense_ENG
No ratings yet
Dispense_ENG
217 pages
ariaudo21ms
No ratings yet
ariaudo21ms
85 pages
Development and Application of Safety Technology Adoption Decision-Making Tool
No ratings yet
Development and Application of Safety Technology Adoption Decision-Making Tool
15 pages
KSCE_1_2022_10_030
No ratings yet
KSCE_1_2022_10_030
6 pages
1 s2.0 S1474034616300349 Main
No ratings yet
1 s2.0 S1474034616300349 Main
19 pages
1 s2.0 S0022437516000128 Main
No ratings yet
1 s2.0 S0022437516000128 Main
12 pages
Falls From Heights - A Computer Vision-Based Approach For Safety Harness Detection
No ratings yet
Falls From Heights - A Computer Vision-Based Approach For Safety Harness Detection
9 pages
1 s2.0 S0166361515300464 Main
No ratings yet
1 s2.0 S0166361515300464 Main
16 pages
A Criterion For Developing Credible Accident Scenarios For Risk Assessment
No ratings yet
A Criterion For Developing Credible Accident Scenarios For Risk Assessment
9 pages
What Drives Construction Workers Acceptance of Wearable Technologies in The Workplace
No ratings yet
What Drives Construction Workers Acceptance of Wearable Technologies in The Workplace
11 pages
Metatheoretical Foundations For Post Normal Risk
No ratings yet
Metatheoretical Foundations For Post Normal Risk
31 pages
Comparison of Techniques For Accident Scenario Analysis in Hazardous Systems
No ratings yet
Comparison of Techniques For Accident Scenario Analysis in Hazardous Systems
9 pages
RiskManagement B00246928
No ratings yet
RiskManagement B00246928
8 pages
A Discussion of The Acceptable Risk Problem
No ratings yet
A Discussion of The Acceptable Risk Problem
9 pages
On How To Define, Understand and Describe Risk
No ratings yet
On How To Define, Understand and Describe Risk
9 pages
Risk Analysis - 2008 - Anthony Tony Cox - What S Wrong With Risk Matrices
No ratings yet
Risk Analysis - 2008 - Anthony Tony Cox - What S Wrong With Risk Matrices
16 pages
Risk Analysis - 2005 - Cox - Some Limitations of Qualitative Risk Rating Systems
No ratings yet
Risk Analysis - 2005 - Cox - Some Limitations of Qualitative Risk Rating Systems
12 pages
Standard Protections/Alarms Features
No ratings yet
Standard Protections/Alarms Features
1 page
MT 2 Mod
No ratings yet
MT 2 Mod
1 page
Full Absorption & Variable Costing Methods (Answers)
No ratings yet
Full Absorption & Variable Costing Methods (Answers)
3 pages
Powersarj Catalogue..
No ratings yet
Powersarj Catalogue..
18 pages
Ed20c Sociometry
No ratings yet
Ed20c Sociometry
19 pages
Ann Cristy - No Gentle Possession (1984)
87% (39)
Ann Cristy - No Gentle Possession (1984)
109 pages
4700 6700 User
No ratings yet
4700 6700 User
170 pages
Advanced Soil Mechanics Assignment 2018
No ratings yet
Advanced Soil Mechanics Assignment 2018
43 pages
2 Information About Insects
No ratings yet
2 Information About Insects
3 pages
Doctrine of Limited Liability
No ratings yet
Doctrine of Limited Liability
46 pages
Heroclix Marvel - 1 Infinity Challenge Rulebook
100% (3)
Heroclix Marvel - 1 Infinity Challenge Rulebook
28 pages
Alpha 1800 Cta GB Neu
100% (1)
Alpha 1800 Cta GB Neu
33 pages
BTFL 1
No ratings yet
BTFL 1
4 pages
REVIEW - TEST 01 - CHAPTER 01-02 - MAS291 - SU23 - On
No ratings yet
REVIEW - TEST 01 - CHAPTER 01-02 - MAS291 - SU23 - On
36 pages
Report Bakery Management System
100% (1)
Report Bakery Management System
21 pages
4 Introduction To The Divergence Theorem and Stoke's Theorem (Short)
No ratings yet
4 Introduction To The Divergence Theorem and Stoke's Theorem (Short)
116 pages
Project at A Glance - Top Sheet: Taluk/Block: District: Pin: State: E-Mail: Mobile
No ratings yet
Project at A Glance - Top Sheet: Taluk/Block: District: Pin: State: E-Mail: Mobile
7 pages
DOC-20241211-WA0020.
No ratings yet
DOC-20241211-WA0020.
41 pages
Analisis Penutupan Lahan Kawasan Hutan Pada Daerah Aliran Sungai Krueng Aceh Pra Dan Pasca Tsunami
No ratings yet
Analisis Penutupan Lahan Kawasan Hutan Pada Daerah Aliran Sungai Krueng Aceh Pra Dan Pasca Tsunami
8 pages
Nobility and Eros The Noble Succubus
85% (13)
Nobility and Eros The Noble Succubus
13 pages
Economics For Decision Making MBA 641
No ratings yet
Economics For Decision Making MBA 641
25 pages
Chapter-Two: Stakeholders Analysis and Strategic Intent
No ratings yet
Chapter-Two: Stakeholders Analysis and Strategic Intent
51 pages
BEAMA Guide To LV BTS Verified To IEC 61439-6
No ratings yet
BEAMA Guide To LV BTS Verified To IEC 61439-6
22 pages
Hippo S19 Reading Preliminary 2022
0% (1)
Hippo S19 Reading Preliminary 2022
10 pages
Internship Proposal Pertamina
100% (1)
Internship Proposal Pertamina
4 pages
AMD FX Performance Tuning Guide
No ratings yet
AMD FX Performance Tuning Guide
20 pages

N0410010204

Uploaded by

N0410010204

Uploaded by

Journal of Construction Automation and Robotics pISSN: 2800-0552, eISSN: 2951-116X

Vol. 1 No. 2, pp. 19-24 / July, 2022 DOI: https://doi.org/10.55785/JCAR.1.2.19

가상물리환경에서의 건설 크레인 수평 흔들림 억제를 위한

김인원1 ․ 김남균2 ․ 정민혁3 ․ 안창범4 ․ 박문서5

(Zhao et al., 2020; Sallab et al., 2017). 75

20 | Korean Society of Automation and Robotics in Construction |

2.3 표본 효율성 향상을 위한 모방학습

(Kober and Peters, 2010).

(behavior cloning) 3. DRL 모형 개발

. 3.1 가상물리환경에서의 크레인 및 작업 정의

. () () ()

Table 1. State space and reward function

, (), () 3.3 정책 네트워크

22 | Korean Society of Automation and Robotics in Construction |

In ISARC. Proceedings of the International Symposium on Auto-

24 | Korean Society of Automation and Robotics in Construction |

You might also like