HCIA-Intelligent Computing V1.0 Lab Guide
HCIA-Intelligent Computing V1.0 Lab Guide
Issue: 1.0
No part of this document may be reproduced or transmitted in any form or by any means without
prior written consent of Huawei Technologies Co., Ltd.
and other Huawei trademarks are trademarks of Huawei Technologies Co., Ltd.
All other trademarks and trade names mentioned in this document are the property of their
respective holders.
Notice
The purchased products, services and features are stipulated by the contract made between Huawei
and the customer. All or part of the products, services and features described in this document may
not be within the purchase scope or the usage scope. Unless otherwise specified in the contract, all
statements, information, and recommendations in this document are provided "AS IS" without
warranties, guarantees or representations of any kind, either expressed or implied.
The information in this document is subject to change without notice. Every effort has been made in
the preparation of this document to ensure accuracy of the contents, but all statements, information,
and recommendations in this document do not constitute a warranty of any kind, express or implied.
Website: https://e.huawei.com/en
Contents
Reference documents:
Software:
BIOS
iBMC
Reference links:
https://support.huawei.com/enterprise/en/doc/EDOC1100019358/
https://e.huawei.com/en
HCIA – Management Software Operation Guide for Trainees Page 5
2.2 Objectives
After the course, the trainees will be able to:
2.4 Tasks
[Task Overview]-Task Flowchart
HCIA – Management Software Operation Guide for Trainees Page 6
Background
The monitoring and O&M of the seismic monitoring platform is not intelligent. For
example, faults need to be identified and rectified manually one by one, which results
in high labor and material costs.
HCIA – Management Software Operation Guide for Trainees Page 7
Suppose you have a Huawei rack server. Log in to the iBMC web user interface (WebUI),
and view alarms and logs of the server and perform system configuration and
management.
Question
How to perform operations on the iBMC CLI?
Requirements: Screenshot the key steps for viewing information and configuring the
system, and name the screenshots in 1.1 iBMC Configure-N format. N indicates the
sequence number of the screenshot. The screenshots for each question are numbered
from 1.
Evaluation criteria:
1.6 Enable power capping and set the smart cooling mode to High performance mode.
1.10 Mount an image file to the server through the remote console.
Background
RAID is configured to reduce errors and improve the performance and reliability of the
storage system. Generally, RAID needs to be configured for a newly purchased server.
Suppose you have a rack server (configured with an LSI SAS3108 RAID controller card).
Restart the server, access the RAID Configuration Utility, and create a RAID 5 array.
Notice:
During the login process, you are asked to install and run the Java program. Perform
operations as prompted. In addition, you need to manually add iBMC to the
Exception Site List on Java Control panel or set the Java security level to a lower
level.
Data on a hard disk will be deleted after the hard disk is added to a RAID array.
Before creating a RAID array, check that there is no data on hard disks or the data
on hard disks is not required.
Disks of the same type and specifications must be used in a RAID array.
Question
What are the precautions to be observed when you configure RAID 5? What are the
application scenarios of other RAID levels?
Reference: RAID levels and Huawei V5 Server RAID Controller Card User Guide
HCIA – Management Software Operation Guide for Trainees Page 9
RAID 0
RAID 1
RAID 5
RAID 6
RAID 1E
RAID 10
RAID 50
RAID 60
Requirements: Screenshot the key steps and name the screenshots in the "1.1 RAID
Configure-N" format. N indicates the sequence number of the screenshot. The
screenshots for each question are numbered from 1.
Evaluation criteria:
Background
Suppose you have a Huawei rack server. Access the BIOS interface and query the
internal information, including the CPU, memory, and disk information of the server.
Then, set the boot mode of the server.
Question
How do you set the server boot mode to Legacy?
Requirements: Screenshot the key steps and name the screenshots in the "1.1 BIOS
Configure-N" format. N indicates the sequence number of the screenshot. The
screenshots for each question are numbered from 1.
Evaluation criteria:
Task 1
Task 2
XXX case
XXX Task 3
(trainee/group)
Task 4
Total score
Network
diagram and data planning.xlsx
Management Software
Operation Guide for
Trainers
Issue: 1.0
No part of this document may be reproduced or transmitted in any form or by any means without
prior written consent of Huawei Technologies Co., Ltd.
and other Huawei trademarks are trademarks of Huawei Technologies Co., Ltd.
All other trademarks and trade names mentioned in this document are the property of their
respective holders.
Notice
The purchased products, services and features are stipulated by the contract made between Huawei
and the customer. All or part of the products, services and features described in this document may
not be within the purchase scope or the usage scope. Unless otherwise specified in the contract, all
statements, information, and recommendations in this document are provided "AS IS" without
warranties, guarantees or representations of any kind, either expressed or implied.
The information in this document is subject to change without notice. Every effort has been made in
the preparation of this document to ensure accuracy of the contents, but all statements, information,
and recommendations in this document do not constitute a warranty of any kind, express or implied.
Website: https://e.huawei.com/en
Contents
Reference documents:
Software:
BIOS
iBMC
Reference links:
https://support.huawei.com/enterprise/en/doc/EDOC1100019358/
https://e.huawei.com/en
HCIA – Management Software Operation Guide for Trainers Page 5
Duration
Phase Actions
(Minutes)
Group trainees.
2.3 Objectives
After the course, the trainees will be able to:
2) Divide trainees into two to four groups. Each group has three to five people.
HCIA – Management Software Operation Guide for Trainers Page 7
3) Rearrange the tables in the classroom by groups, and print group number plates for
each group.
Background
The monitoring and O&M of the seismic monitoring platform is not intelligent. For
example, faults need to be identified and rectified manually one by one, which results
in high labor and material costs.
Suppose you have a Huawei rack server. Log in to the iBMC web user interface (WebUI),
and view alarms and logs of the server and perform system configuration and
management.
Question
How to perform operations on the iBMC CLI?
Requirements: Screenshot the key steps for viewing information and configuring the
system, and name the screenshots in 1.1 iBMC Configure-N format. N indicates the
HCIA – Management Software Operation Guide for Trainers Page 9
sequence number of the screenshot. The screenshots for each question are numbered
from 1.
Evaluation criteria:
1.6 Enable power capping and set the smart cooling mode to High performance mode.
1.10 Mount an image file to the server through the remote console.
[Operation Guide]
Please refer to the FusionServer Pro Rack Server iBMC (V300 to V369) User Guide.
Note: The default user name and password of the V5 server are Administrator and
Admin@9000 respectively.
Background
RAID is configured to reduce errors and improve the performance and reliability of the
storage system. Generally, RAID needs to be configured for a newly purchased server.
Suppose you have a rack server (configured with an LSI SAS3108 RAID controller card).
Restart the server, access the RAID Configuration Utility, and create a RAID 5 array.
Notice:
During the login process, you are asked to install and run the Java program. Perform
operations as prompted. In addition, you need to manually add iBMC to the
Exception Site List on Java Control panel or set the Java security level to a lower
level.
Data on a hard disk will be deleted after the hard disk is added to a RAID array.
Before creating a RAID array, check that there is no data on hard disks or the data
on hard disks is not required.
Disks of the same type and specifications must be used in a RAID array.
Question
What are the precautions to be observed when you configure RAID 5? What are the
application scenarios of other RAID levels?
Reference: RAID levels and Huawei V5 Server RAID Controller Card User Guide
https://support.huawei.com/enterprise/en/doc/EDOC1000163569/b9b6ef50
HCIA – Management Software Operation Guide for Trainers Page 11
RAID 0
RAID 1
RAID 5
RAID 6
RAID 1E
RAID 10
RAID 50
RAID 60
[Answer]
Min.
RAID Read Write Disk
Reliability Number of
Level Performance Performance Utilization
Disks
Relatively
RAID 5 High Medium 3 (N-1)/N
high
Relatively
RAID 6 High Medium 4 (N-2)/N
high
Relatively
RAID 50 High High 6 (N-M)/N
high
Relatively
RAID 60 High High 8 (N-M*2)/N
high
N indicates the number of disks in a RAID array, and M indicates the number of
spans in a RAID array.
Requirements: Screenshot the key steps and name the screenshots in the "1.1 RAID
Configure-N" format. N indicates the sequence number of the screenshot. The
screenshots for each question are numbered from 1.
Evaluation criteria:
[Operation Guide]
1.1-1.4: See chapter 6 in the Huawei V5 Server RAID Controller Card User Guide.
Rules
After the discussion, each group summarizes their discussion results and assigns a
representative to describe the RAID 5 configuration clue. The trainer guides trainees in
other groups to ask questions and make comments. Considerations for the evaluation
on trainees include:
Select the best team by comparing their output. This team adds 1 point to their total
score.
HCIA – Management Software Operation Guide for Trainers Page 13
Background
Suppose you have a Huawei rack server. Access the BIOS interface and query the
internal information, including the CPU, memory, and disk information of the server.
Then, set the boot mode of the server.
Question
How do you set the server boot mode to Legacy?
Requirements: Screenshot the key steps and name the screenshots in the "1.1 BIOS
Configure-N" format. N indicates the sequence number of the screenshot. The
screenshots for each question are numbered from 1.
Evaluation criteria:
[Operation Guide]
Please refer to the Huawei Server Purley Platform BIOS Parameter Reference
https://support.huawei.com/enterprise/en/doc/EDOC1000163372
Rules
After the discussion, each group summarizes their discussion results and assigns a
representative to describe the process. The trainer guides trainees in other groups to
ask questions and make comments. Considerations for the evaluation on trainees
include:
Select the best team by comparing their output. This team adds 1 point to their total
score.
1. Blank paper (four pieces for each group), marker pens of three colors (one set for
each group), and whiteboard stickers (10 pieces for each group)
2. A printed copy of the lab networking and data plan for each trainee
Network
diagram and data planning.xlsx
Assessment
No. Description Score
Point
HCIA – Management Software Operation Guide for Trainers Page 15
Task 1
Task 2
XXX case
XXX Task 3
(trainee/group)
Task 4
Total score
Management Software Operation Guide
Page 2 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Contents
1. Background
4. BIOS Configuration
Page 3 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Drill Background
Background
Page 4 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Objectives
After completing this course, you will be able to understand and grasp:
Basic functions of the server management software
Application scenarios of different RAID levels
iBMC, RAID, and BIOS operation processes
Page 5 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
V5 Rack Server Management Software
Deployment
Objectives Forms of Discussion
Page 6 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Task Flowchart
View alarm and diagnosis
information.
Operations on the iBMC Configure system management
management platform settings.
Page 7 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Contents
1. Background
4. BIOS Configuration
Page 8 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
iBMC Management Platform Operations
iBMC functions
Group discussion: 8 minutes
Operations on the iBMC
Presentation/group: 3 minutes
Management Platform
Comments: 5 minutes
Page 9 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Operations on the iBMC Management
Platform
Background:
The monitoring and O&M of the seismic monitoring platform is not intelligent. For
example, faults need to be identified and rectified manually one by one, which results in
high labor and material costs.
Question:
Operations on the iBMC command-line interface (CLI).
Page 10 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Operations on the iBMC Management
Platform
[Task Overview]-Task Flowchart
Start End
Page 11 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Operations on the iBMC Management
Platform
Task 1: Log in to the iBMC WebUI of a 2288H V5, query the system
information, and fill in the following table.
Processor model
Page 12 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Operations on the iBMC Management
Platform
Reference answer
Page 13 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Contents
1. Background
4. BIOS Configuration
Page 14 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
RAID Operations
Page 15 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
RAID Operations
Background:
RAID is configured to reduce errors and improve the performance and reliability of the
storage system. Generally, RAID needs to be configured for a newly purchased server.
Suppose you have a rack server. Restart the server, access the RAID Configuration Utility,
and create a RAID 5 array.
Question:
What are the precautions to be observed during the configuration of a RAID 5 array? What
are the application scenarios of other RAID levels?
Reference: common RAID types of 2288H V5 servers and Huawei V5 Server RAID Controller Card User
Guide
Page 16 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
RAID Operations
[Task Overview]-Task Flowchart
Start End
Page 17 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
RAID Operations
Task 1: Compare RAID levels.
Read Write Min. Number Disk
RAID Level Reliability
Performance Performance of Disks Utilization
RAID 0
RAID 1
RAID 5
RAID 6
RAID1E
RAID 10
RAID 50
RAID 60
Page 18 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
RAID Operations
[Reference answer]
Read Write Min. Number Disk
RAID Level Reliability
Performance Performance of Disks Utilization
RAID 0 Low High High 2 100%
Page 19 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Contents
1. Background
4. BIOS Configuration
Page 20 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
BIOS Management Platform Operations
Page 21 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
BIOS Management Platform Operations
Background:
Suppose you have a Huawei rack server. Access the BIOS interface and query the
internal information, including the CPU, memory, and disk information of the
server. Then, set the boot mode of the server.
Question:
How do you set the server boot mode to Legacy?
Page 22 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
BIOS Management Platform Operations
[Task Overview]-Task Flowchart
Start End
Page 23 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
BIOS Management Platform Operations
Task 1: Check the disk information and fill in the following table.
Port 0
SATA controller
Port 1
Page 24 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
BIOS Management Platform Operations
[Reference answer]
Port 0 Enabled
Port 1 Enabled
Page 25 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
BIOS Management Platform Operations
Task 2: Set the server boot mode to Legacy, write down the operation
procedure, and take a screenshot.
Page 26 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
BIOS Management Platform Operations
[Reference answer]
1. Log in to the BIOS. For details, see the 2. In the dialog box displayed, choose Legacy.
user guide. Choose Boot > Boot Type, and
press Enter.
Page 27 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Summary
Perform initial configuration of V5 rack servers after the study.
Understand the functions and basic working principles of the management
software.
Page 28 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Recommendations
Page 29 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Thank You
www.huawei.com
Revision Record
Course Code Product Product Version Course Version
ISSUE: 1.0
No part of this document may be reproduced or transmitted in any form or by any means without
prior written consent of Huawei Technologies Co., Ltd.
and other Huawei trademarks are trademarks of Huawei Technologies Co., Ltd.
All other trademarks and trade names mentioned in this document are the property of their
respective holders.
Notice
The purchased products, services and features are stipulated by the contract made between Huawei
and the customer. All or part of the products, services and features described in this document may
not be within the purchase scope or the usage scope. Unless otherwise specified in the contract, all
statements, information, and recommendations in this document are provided "AS IS" without
warranties, guarantees or representations of any kind, either expressed or implied.
The information in this document is subject to change without notice. Every effort has been made in
the preparation of this document to ensure accuracy of the contents, but all statements, information,
and recommendations in this document do not constitute a warranty of any kind, express or
implied.
Website: https://e.huawei.com/en
Contents
2 Overview ....................................................................................................................... 5
2.1 Course Introduction ...................................................................................................................................................... 5
Reference links:
https://docs.ansible.com/
https://support-open.huawei.com/en
https://e.huawei.com/en
HCIA - Server Intelligent O&M Guide for Trainees Page 5
2 Overview
2.2 Objectives
Upon completion of this course, you will be able to:
Understand the modes and scenarios of Ansible installation and deployment.
Manage servers in batches using the ad-hoc command of Ansible.
Assume that you are an IT system engineer of company Z, and you need to complete
the following tasks and configuration.
HCIA - Server Intelligent O&M Guide for Trainees Page 6
2.4 Tasks
IP address: 192.168.1.100
Host01
Host03
Command
Task 4: Copy the test.sh File from the Control End to the /tmp/
Directory on the Target Host, and Set the Owner and Group of the
File to root with the File Permission rwxr-xr-x
Write the command:
Task 5: Check the uid and gid Information in the /etc/sysctl.conf File
of the Remote Host Group Webservers
Write the command:
HCIA - Server Intelligent O&M Guide for Trainees Page 9
Task 7: Enable the HTTP Service for the Remote Host Group
Webservers and Check the Service Status
Write the commands:
Task 8: Create and Delete the /home/f1 File on the Remote Server
Group Webservers
Write the commands:
Assessment point 1
Assessment point 2
Case xx
Assessment point 3
Trainee/Group xx
Assessment point 4
Total score
Revision Record
Course Code Product Product Version Course Version
ISSUE: 1.0
No part of this document may be reproduced or transmitted in any form or by any means without
prior written consent of Huawei Technologies Co., Ltd.
and other Huawei trademarks are trademarks of Huawei Technologies Co., Ltd.
All other trademarks and trade names mentioned in this document are the property of their
respective holders.
Notice
The purchased products, services and features are stipulated by the contract made between Huawei
and the customer. All or part of the products, services and features described in this document may
not be within the purchase scope or the usage scope. Unless otherwise specified in the contract, all
statements, information, and recommendations in this document are provided "AS IS" without
warranties, guarantees or representations of any kind, either expressed or implied.
The information in this document is subject to change without notice. Every effort has been made in
the preparation of this document to ensure accuracy of the contents, but all statements, information,
and recommendations in this document do not constitute a warranty of any kind, express or
implied.
Website: https://e.huawei.com/en
Contents
2 Overview ....................................................................................................................... 5
2.1 Teaching Procedure ....................................................................................................................................................... 5
Reference links:
https://docs.ansible.com/
https://support-open.huawei.com/en
https://e.huawei.com/en
HCIA - Server Intelligent O&M Guide for Trainers Page 5
2 Overview
Duration
Phase Actions
(minute)
2.3 Objectives
Upon completion of this course, you will be able to:
Understand the modes and scenarios of Ansible installation and deployment.
Manage servers in batches using the ad-hoc command of Ansible.
2. Divide trainees into two to four groups. Each group has three to five persons.
3. Rearrange the tables in the classroom by groups, and print group number plates
for each group.
HCIA - Server Intelligent O&M Guide for Trainers Page 7
Assume that you are an IT system engineer of company Z, and you need to complete
the following tasks and configuration.
IP address: 192.168.1.100
Host01
Host03
[Reference Answer]
CentOS (Yum)
2. Install Ansible.
$ sudo yum install -y ansible
[Reference Answer]
HCIA - Server Intelligent O&M Guide for Trainers Page 9
ssh-keygen
[root@centos ~]# ssh-keygen
Generating public/private rsa key pair.
Enter file in which to save the key (/root/.ssh/id_rsa):
Enter passphrase (empty for no passphrase):
Enter same passphrase again:
Your identification has been saved in /root/.ssh/id_rsa.
Your public key has been saved in /root/.ssh/id_rsa.pub.
The key fingerprint is:
[Reference Answer]
# vi /etc/ansible/ansible.cfg
[defaults]
inventory = /etc/ansible/hosts
forks = 5
become = root
remote_port = 22
host_key_checking = False
timeout = 10
log_path = /var/log/ansible.log
private_key_file = /root/.ssh/id_rsa
HCIA - Server Intelligent O&M Guide for Trainers Page 10
#cat /etc/ansible/hosts
[webservers]
192.168.1.101
192.168.1.102
192.168.1.103
Command
[Reference Answer]
[root@localhost ~]# ansible webservers -m ping
[Reference Answer]
[root@localhost ~]# ansible webservers -m command -a 'ip addr show dev eth0'
[Reference Answer]
[root@localhost ~]# ansible webservers -m shell -a "/home/test.sh"
The /home/test.sh script must exist on the remote host and have the execution
permission.
[root@localhost ~]#more test.sh
echo "Welcome to Huawei Cloud"
chmod 777 test.sh
Task 4: Copy the test.sh File from the Control End to the /tmp/
Directory on the Target Host, and Set the Owner and Group of the
File to root with the File Permission rwxr-xr-x
Write the command:
[Reference Answer]
[root@localhost ~]# ansible webservers -m copy -a "src=/home/test.sh dest=/tmp/ owner=root
group=root mode=0755"
Task 5: Check the uid and gid Information in the /etc/sysctl.conf File
of the Remote Host Group Webservers
Write the command:
[Reference Answer]
[root@localhost ~]# ansible webservers -m stat -a "path=/etc/sysctl.conf"
HCIA - Server Intelligent O&M Guide for Trainers Page 12
[Reference Answer]
[root@localhost ~]# ansible webservers -m yum -a "name=httpd state=latest
disable_gpg_check=yes enablerepo=epel"
#name: package name
#state (Choices: present, installed, latest, absent, removed)[Default: present]
#disable_gpg_check: disables the gpg check
#enablerepo: enables only the specified repo
Task 7: Enable the HTTP Service for the Remote Host Group
Webservers and Check the Service Status
Write the commands:
[Reference Answer]
Task 8: Create and Delete the /home/f1 File on the Remote Server
Group Webservers
Write the commands:
[Reference Answer]
ansible all -m file -a 'name=/home/f1 state=touch'
ansible all -m file -a 'name=/home/f1 state=absent'
[Reference Answer]
# main.yml
---
- hosts: webservers
tasks:
- name: Add repo
yum_repository:
name: nginx
description: nginx repo
baseurl: http://nginx.org/packages/centos/7/$basearch/
gpgcheck: no
enabled: 1
- name: Install nginx
yum:
name: nginx
state: latest
- name: Start nginx
service:
name: nginx
state: started
1. Prepare large blank paper (5 pieces for each group), markers of three colors (1
set for each group), and whiteboard stickers (10 for each group).
3. The network diagrams required in the tasks must be printed in advance. One
copy for each trainee.
Assessment
No. Description Score
Point
Assessment point 1
Assessment point 2
Case xx
Assessment point 3
Trainee/Group xx
Assessment point 4
Total score
Server Intelligent O&M Guide Slides
Page 2 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Background
Introduction
To improve work efficiency, eliminate duplicate tasks, and reduce error risks,
company Z requires that the modification of the servers on the live network
be minimized. Therefore, Ansible is selected from the four mainstream O&M
automation tools (Puppet, SaltStack, Chef, and Ansible) to automate O&M
management.
Page 3 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Objectives
Upon completion of this course, you will be able to:
Page 4 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Topology
Host 01
Host 03
Page 5 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Contents
1. Case Background
Page 6 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Installing and Configuring Ansible
Discussion Objectives Form of Discussion
Task 1: Confirm the environment Activity 1: Group discussion
Task 2: Install Ansible Activity 2: Group presentation
Task 3: Install Python and log in to the Case Activity 3: Comments on each other
Study
system using SSH without a password
Task 4: Configure the Controlled Hosts
Page 7 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Installing and Configuring Ansible
Task 1: Confirm Service Environment
Server
Managed end
Page 8 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Installing and Configuring Ansible
[Reference Answer]
192.168.1.101 Yes
Managed end CentOS 7.2 192.168.1.102 Yes
192.168.1.103 Yes
Page 9 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Installing and Configuring Ansible
Task 2: Install Ansible Using Yum Commands on the Control End
Page 10 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Installing and Configuring Ansible
[Reference Answer]
CentOS (Yum)
2. Install Ansible.
$ sudo yum install -y ansible
Page 11 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Installing and Configuring Ansible
Task 3: Install Python and Configure SSH Login Without a Password
Page 12 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Installing and Configuring Ansible
[Reference Answer]
1. Install Yum, SSH, and Python on all nodes.
$ sudo yum install -y openssh-server python
Page 13 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Installing and Configuring Ansible
Task 4: Modify the ansible.cfg Configuration File and Configure the
Controlled Hosts
Page 14 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Installing and Configuring Ansible
[Reference Answer]
# vi /etc/ansible/ansible.cfg
[defaults]
inventory = /etc/ansible/hosts
forks = 5
become = root
remote_port = 22
host_key_checking = False
timeout = 10
log_path = /var/log/ansible.log
private_key_file = /root/.ssh/id_rsa
#cat /etc/ansible/hosts
[webservers]
192.168.1.101
192.168.1.102
192.168.1.103
Page 15 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Contents
1. Case Background
Page 16 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Solution Architecture Design
Discussion Objectives Form of Discussion
Task 1: Test the connectivity Activity 1: Group discussion
Task 2: Check the NIC information Activity 2: Group presentation
Task 3: Execute the remote script Case Activity 3: Comments on each other
Task 4: Copy file remotely
Study
Page 17 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Managing Servers in Batches Using the
ad-hoc Command
Task 1: Test the Connectivity of All Remote Host Group Webservers
Page 18 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Managing Servers in Batches Using the
ad-hoc Command
[Reference Answer]
Page 19 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Managing Servers in Batches Using the
ad-hoc Command
Task 2: Check the Information about eth0 of the Remote Host Group
Webservers
Page 20 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Managing Servers in Batches Using the
ad-hoc Command
[Reference Answer]
[root@localhost ~]# ansible webservers -m command -a 'ip addr show dev eth0'
Page 21 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Managing Servers in Batches Using the
ad-hoc Command
Task 3: Run the Remote Host Script test.sh
Page 22 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Managing Servers in Batches Using the
ad-hoc Command
[Reference Answer]
[root@localhost ~]# ansible webservers -m shell -a "/home/test.sh"
Note: The /home/test.sh script must exist on the remote host and have the
execution permission.
#more test.sh
Echo "Welcome to Huawei Cloud"
chmod 777 test.sh
Page 23 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Managing Servers in Batches Using the
ad-hoc Command
Task 4: Copy the test.sh File from the Control End to the /tmp/ Directory on
the Target Host, and Set the Owner and Group of the File to root with the File
Permission rwxr-xr-x
Page 24 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Managing Servers in Batches Using the
ad-hoc Command
[Reference Answer]
[root@localhost ~]# ansible webservers -m copy -a "src=/home/test.sh
dest=/tmp/ owner=root group=root mode=0755"
Page 25 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Managing Servers in Batches Using the
ad-hoc Command
Task 5: Check the uid and gid Information in the /etc/sysctl.conf File of the
Remote Host Group Webservers
Page 26 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Managing Servers in Batches Using the
ad-hoc Command
[Reference Answer]
Page 27 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Managing Servers in Batches Using the
ad-hoc Command
Task 6: Install HTTPD on All Remote Host Group Webservers
Page 28 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Managing Servers in Batches Using the
ad-hoc Command
[Reference Answer]
[root@localhost ~]# ansible webservers -m yum -a "name=httpd
state=latest disable_gpg_check=yes enablerepo=epel "
Page 29 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Managing Servers in Batches Using the
ad-hoc Command
Task 7: Enable the HTTP Service for the Remote Host Group Webservers and
Check the Service Status
Page 30 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Managing Servers in Batches Using the
ad-hoc Command
[Reference Answer]
#Enable the service:
[root@localhost ~]# ansible webservers -m service -a "name=httpd state=restarted"
Page 31 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Managing Servers in Batches Using the
ad-hoc Command
Task 8: Create and Delete the /home/f1 File on the Remote Server Group
Webservers
Page 32 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Managing Servers in Batches Using the
ad-hoc Command
[Reference Answer]
ansible all -m file -a 'name=/home/f1 state=touch'
ansible all -m file -a 'name=/home/f1 state=absent'
Page 33 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Contents
1. Case Background
Page 34 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Solution Implementation
Discussion Objectives Form of Discussion
Task 1: Deploy Nginx automatically Activity 1: Group discussion
using a playbook Activity 2: Group presentation
Case
Page 35 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Deploying Nginx Using a Playbook
Task 1: Deploy Nginx Automatically Using a Playbook
Page 36 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Deploying Nginx Using a Playbook
[Reference Answer]
# main.yml
---
- hosts: webservers
tasks:
- name: Add repo
yum_repository:
name: nginx
description: nginx repo
baseurl: http://nginx.org/packages/centos/7/$basearch/
gpgcheck: no
enabled: 1
- name: Install nginx
yum:
name: nginx
state: latest
- name: Start nginx
service:
name: nginx
state: started
Page 37 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Summary
Three experiment scenarios:
Installing and Configuring Ansible
Page 38 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Quiz
1. Which of the following options belong to Ansible?
A. copy
B. command
C. file
D. Yum
Page 39 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
More Information
https://docs.ansible.com/
Page 40 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Thank You
www.huawei.com
Revision Record
Course Code Product Product Version Course Version
For Trainees
Issue 1.0
No part of this document may be reproduced or transmitted in any form or by any means without
prior written consent of Huawei Technologies Co., Ltd.
and other Huawei trademarks are trademarks of Huawei Technologies Co., Ltd.
All other trademarks and trade names mentioned in this document are the property of their
respective holders.
Note
The purchased products, services and features are stipulated by the contract made between
Huawei and the customer. All or part of the products, services and features described in this
document may not be within the purchase scope or the usage scope. Unless otherwise specified in
the contract, all statements, information, and recommendations in this document are provided "AS
IS" without warranties, guarantees or representations of any kind, either express or implied.
The information in this document is subject to change without notice. Every effort has been made
in the preparation of this document to ensure accuracy of the contents, but all statements,
information, and recommendations in this document do not constitute a warranty of any kind,
express or implied.
Website: https://e.huawei.com/en
Contents
Reference documents:
1. https://support.huawei.com/enterprise/en/index.html
2. https://e.huawei.com/en/
Industry Solution Practice Guide for Trainees Page 5
2.2 Objectives
Understand the characteristics and components of the HPC solution.
Understand how to select device models.
Understand how to design the network of a small- and medium-sized HPC cluster.
Understand the delivery process of an HPC basic environment.
Understand the HPC project acceptance process.
2.3 Background
Note: The case in this document is for reference only. The actual configuration may
vary. For details, see the corresponding product documentation.
With the rapid development of computer technology and national economy, HPC has
become a necessary tool for scientific researches and plays an important role in
various basic disciplines and production systems. HPC has been applied in industrial
Industry Solution Practice Guide for Trainees Page 6
Based on the project survey, M company decides to deploy an HPC cloud simulation
platform. You are the implementation engineer of this project and need to complete
several basic tasks.
This section describes the acceptance scope of the HPC solution implementation
service, including:
1. Devices involved in the project, such as servers, storage devices, and network
switching devices
2. Software involved in the project, such as OSs, parallel file system software,
application environment software, and cluster management software
According to the HPC solution design and implementation requirements, the Huawei
HPC solution is deployed in equipment room A. The solution provides a complete
service running platform, an HPC cloud simulation platform, centralized management
and scheduling services, and unified storage space. Huawei provides the overall
solution design, software and hardware installation service, commissioning service,
and acceptance service.
2.4 Tasks
You are an engineer. Compare HPC and common computing such as server
virtualization in terms of computing, storage, and networking.
Question
What are the differences between HPC and common computing in terms of
computing, storage, and networking?
Industry Solution Practice Guide for Trainees Page 7
1 2
3 4
5 6
7 8
9 10
11 - -
2. Fill in the table with the component names of the Atlas G5500 & G560 V5.
Industry Solution Practice Guide for Trainees Page 8
1 2
3 4
3. Fill in the table with the component names of the FusionServer Pro 2488H V5.
1 2
Industry Solution Practice Guide for Trainees Page 9
3 4
5 6
7 8
9 10
11 12
FlexIO card 1
FlexIO card 2
Industry Solution Practice Guide for Trainees Page 10
Logical diagram:
CE8861 S5720
Switch ports:
S5720
2 4 6 8 10 12 14 16 18 20 22 24 26 28 30 32 34 36 38 40 42 44 46 48
1 3 5 7 9 11 13 15 17 19 21 23 25 27 29 31 33 35 37 39 41 43 45 47
CE8861
2 4 6 8 2 4 6 8 10 12 14 16 18 20 22 24
1 3 5 7 1 3 5 7 9 11 13 15 17 19 21 23
2488 V5 fat
/ 25GE port 1
node
P12X-1 MGMT
OceanStor
P12X-2 MGMT
9000
P12X-3 MGMT
XA320C-1 MGMT
XA320C-2 MGMT
TaiShan X6000
IPMI network XA320C-3 MGMT
XA320C-4 MGMT
2488 V5 fat
/ MGMT
node
1288 V5
/ MGMT
management
Industry Solution Practice Guide for Trainees Page 13
node
P12X-1 GE port 1
OceanStor
P12X-2 GE port 1
9000
P12X-3 GE port 1
XA320C-1 GE port 1
XA320C-2 GE port 1
TaiShan X6000
Management XA320C-3 GE port 1
2488 V5 fat
/ GE port 1
node
1288 V5
management / GE port 1
node
2. Which field shows the final result of the floating-point computing test?
Industry Solution Practice Guide for Trainees Page 14
Assessment point 1
Assessment point 2
XXX Case
Total score
Revision Record
Course Code Product Product Version Course Version
For Trainers
Issue 1.0
No part of this document may be reproduced or transmitted in any form or by any means without
prior written consent of Huawei Technologies Co., Ltd.
and other Huawei trademarks are trademarks of Huawei Technologies Co., Ltd.
All other trademarks and trade names mentioned in this document are the property of their
respective holders.
Note
The purchased products, services and features are stipulated by the contract made between
Huawei and the customer. All or part of the products, services and features described in this
document may not be within the purchase scope or the usage scope. Unless otherwise specified in
the contract, all statements, information, and recommendations in this document are provided "AS
IS" without warranties, guarantees or representations of any kind, either express or implied.
The information in this document is subject to change without notice. Every effort has been made
in the preparation of this document to ensure accuracy of the contents, but all statements,
information, and recommendations in this document do not constitute a warranty of any kind,
express or implied.
Website: https://e.huawei.com/en
Contents
Reference documents:
1. https://support.huawei.com/enterprise/en/index.html
2. https://e.huawei.com/en/
Huawei WLAN Certification Training Lab Guide Page 5
The detailed procedure for this drill is described in the following table.
3. Group trainees.
This course is a case study based on the HPC knowledge we have learned. In recent
years, universities in China are undertaking more scientific research tasks and have
stronger requirements on the computing efficiency of complex tasks. HPC, which was
used only by a few scientific research institutions in the past, has become a necessary
infrastructure for many universities. The case study focuses on the requirement
analysis, network planning, delivery and implementation, and acceptance and testing
of a specific project. Through this case study, we can consolidate and review what we
have learned before.
2.3 Objectives
Understand the characteristics and components of the HPC solution.
Understand how to select device models.
2. Divide trainees into 2 to 4 group. Each group contains three to five trainees.
3. Move the tables in the classroom to divide areas by groups, and print the group
number plates of each group.
(The following table lists the drill rules. Trainers can flexibly adjust the rules based on
actual situations.)
1. This experiment manual covers four scenarios. The full score for each scenario is
20 points, and the total score is 80 points. For details about the scoring rules, see
section 2.8 and section 2.9.
2. In each drill scenario, group members discuss questions or finish tasks in actual
operations. Each group sends a group member to present the results.
3. Trainees in each group can ask questions and make comments on the
presentations of other groups.
4. After the drill in each scenario is complete, the trainer compares the output of
each group, selects the best group, and scores each group based on the scoring
rules.
2.5 Background
Note: The case in this document is for reference only. The actual configuration may
vary. For details, see the corresponding product documentation.
With the rapid development of computer technology and national economy, HPC has
become a necessary tool for scientific researches and plays an important role in
various basic disciplines and production systems. HPC has been applied in industrial
simulation, teaching and scientific research, energy exploration, weather forecasting,
and other fields.
Huawei WLAN Certification Training Lab Guide Page 8
Based on the project survey, M company decides to deploy an HPC cloud simulation
platform. You are the implementation engineer of this project and need to complete
several basic tasks.
This section describes the acceptance scope of the HPC solution implementation
service, including:
Software involved in the project, such as OSs, parallel file system software,
application environment software, and cluster management software
Tools involved in the project, such as FusionServer Tools
According to the HPC solution design and implementation requirements, the Huawei
HPC solution is deployed in equipment room A. The solution provides a complete
service running platform, an HPC cloud simulation platform, centralized management
and scheduling services, and unified storage space. Huawei provides the overall
solution design, software and hardware installation service, commissioning service,
and acceptance service.
You are an engineer. Compare HPC and common computing such as server
virtualization in terms of computing, storage, and networking.
Question
What are the differences between HPC and common computing in terms of
computing, storage, and networking?
Huawei WLAN Certification Training Lab Guide Page 9
[Key]
Three-plane networking:
Applicable to scenarios
demanding large memory of
SMP compute node 4-socket or 8-socket servers
a single node. Generally, the
(fat node) with large memory capacity
memory size is greater than
512 GB.
Applicable to
Uses storage-type scenarios that are
server to deploy the not demanding on
NFS server; small performance,
capacity and usually small
NAS NFS relatively low projects with a
performance. For budget of about
example, deploy the CNY X00k for the
NFS server by using HPC system. The
RH2288 V3. NFS Server can be
deployed on
Huawei WLAN Certification Training Lab Guide Page 11
management
nodes.
Applicable to HPC
Directly uses NAS or systems with
unified storage to budgets below
provide servers; CNY2 million and
supports NFS and without expansion
CIFS, and provides plans; the required
Unified storage large capacity and performance is less
relatively high than 2 GB/s.
performance, for Applicable to
example, the systems with
OceanStor V3 Windows clients for
unified storage. accessing the
storage
Dedicated storage
with integrated
For ultra-large
software and
projects (20 GB/s or
Xyratex hardware, delivering
above), Xyratex is
the highest
preferred.
performance in the
industry
The network system can be classified into four types: out-of-band management
network, management network, computing network, and storage network. The
following table describes their characteristics.
Type Characteristics
Type Characteristics
Drill Rules
After discussion, each group summarizes the discussion results and sends a group
member to the stage to explain the conclusions of the group. The trainer guides
trainees in each group to ask questions and make comments. The key evaluation
factors are as follows:
The task score is 10 points. Points will be deducted if questions are not fully answered.
Select the best team by comparing their output. This team adds 1 point to their total
score.
1 2
3 4
5 6
7 8
9 10
11 - -
[Key]
2. Fill in the table with the component names of the Atlas G5500 & G560 V5.
1 2
3 4
Huawei WLAN Certification Training Lab Guide Page 16
[Key]
3. Fill in the table with the component names of the FusionServer Pro 2488H V5.
1 2
3 4
5 6
7 8
9 10
11 12
[Key]
Huawei WLAN Certification Training Lab Guide Page 17
Management network
7 8 Serial port
port
11 PSU 1 12 PSU 2
FlexIO card 1
FlexIO card 2
Huawei WLAN Certification Training Lab Guide Page 18
Logical diagram:
[Key]
Huawei WLAN Certification Training Lab Guide Page 19
CE8861 S5720
[Key]
Huawei WLAN Certification Training Lab Guide Page 20
CE8861 S5720
Management/IPMI
Computing/Network
Switch ports:
S5720
2 4 6 8 10 12 14 16 18 20 22 24 26 28 30 32 34 36 38 40 42 44 46 48
1 3 5 7 9 11 13 15 17 19 21 23 25 27 29 31 33 35 37 39 41 43 45 47
CE8861
2 4 6 8 2 4 6 8 10 12 14 16 18 20 22 24
1 3 5 7 1 3 5 7 9 11 13 15 17 19 21 23
2488 V5 fat
/ 25GE port 1
node
P12X-1 MGMT
OceanStor
P12X-2 MGMT
9000
P12X-3 MGMT
XA320C-2 MGMT
TaiShan X6000
XA320C-3 MGMT
XA320C-4 MGMT
Huawei WLAN Certification Training Lab Guide Page 22
2488 V5 fat
/ MGMT
node
1288 V5
management / MGMT
node
P12X-1 GE port 1
OceanStor
P12X-2 GE port 1
9000
P12X-3 GE port 1
XA320C-1 GE port 1
XA320C-2 GE port 1
TaiShan X6000
Management XA320C-3 GE port 1
2488 V5 fat
/ GE port 1
node
1288 V5
management / GE port 1
node
[Key]
2488 V5 fat
/ 25GE port 1 CE8861 25GE 2/5
node
2488 V5 fat
/ MGMT S5720 GE 9
node
1288 V5
management / MGMT S5720 GE 10
node
2488 V5 fat
/ GE port 1 S5720 GE 19
node
1288 V5
management / GE port 1 S5720 GE 20
node
Drill Rules
After discussion, each group summarizes the discussion results and sends a group
member to the stage to explain the conclusions of the group. The trainer guides
trainees in each group to ask questions and make comments. The key evaluation
factors are as follows:
Whether the part names are correct
The task score is 10 points. Points will be deducted if questions are not fully answered.
Select the best team by comparing their output. This team adds 1 point to their total
score.
Huawei WLAN Certification Training Lab Guide Page 25
2. Which field shows the final result of the floating-point computing test?
[Key]
1. For details, see the HPC Solution TaiShan Platform CPU Linpack Test Guide.
2. WC00C2R2
[Key]
For details, see the HPC Solution TaiShan Platform IOR Test Guide.
Drill Rules
After discussion, each group summarizes the discussion results and sends a group
member to the stage to explain the conclusions of the group. The trainer guides
trainees in each group to ask questions and make comments. The key evaluation
factors are as follows:
Huawei WLAN Certification Training Lab Guide Page 26
The task score is 10 points. Points will be deducted if questions are not fully answered.
Select the best team by comparing their output. This team adds 1 point to their total
score.
1. Prepare the following props in advance: large blank paper (five pieces for each
group), marker pens of three colors (one set for each group), and stickers (10
pieces for each group).
2. Each trainee needs a copy of the case background information. Print the materials
before class.
3. Each trainee needs a copy of the networking diagram. Print the materials before
class.
Assessment point 1
Assessment point 2
XXX Case
Total score
Industry Solution Practice Guide
HPC Scenario
Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Contents
1. Background
2. Discussion on HPC
3. Device Connection
4. Acceptance Test
Page 2 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Background
With the rapid development of computer technology and national economy, high-
performance computing (HPC) has become a necessary tool for scientific researches
and is playing an important role in various basic disciplines and production systems.
HPC has been applied in industrial simulation, teaching and scientific research,
energy exploration, weather forecasting, and other fields.
Page 3 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Objectives
Understand the characteristics and components of the HPC solution.
Understand how to select device models.
Understand how to design the network of a small- and medium-sized HPC
cluster.
Understand the delivery process of an HPC basic environment.
Understand the HPC project acceptance process.
Page 4 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Contents
1. Background
2. Discussion on HPC
3. Device Connection
4. Acceptance Test
Page 5 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Differences Between HPC and Common
Computing
Background
You are an engineer. Compare HPC and common computing such as server
virtualization in terms of computing, storage, and networking without considering
the software.
Task 1
What are the differences between HPC and common computing in terms of
computing, storage, and networking?
Page 6 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Key to the HPC Discussion
An HPC system consists of the management network, computing network, and
storage network, including compute nodes, fat nodes, acceleration nodes,
management nodes, login nodes, and parallel file systems.
Three types of compute nodes:
Compute nodes (thin nodes): high-performance blade servers or rack servers
Fat nodes: SMP high-performance servers with multiple processors and large
memory capacity
GPU compute nodes: use GPGPU cards for GPU computing acceleration
Three-plane networking:
1. Computing network: used for message transmission during computing
2. Management network: used for cluster system management
3. Storage network: used for storage or data transmission
Page 7 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Key to the HPC Discussion
Type Characteristics Application Scenario
Page 8 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Key to the HPC Discussion
Type Characteristics Application Scenario
Uses storage-type server to deploy the NFS
server; small capacity and relatively low Applicable to small projects that do not require
NFS
performance. For example, deploy the NFS high performance.
server by using RH2288 V3.
NAS Directly uses NAS or unified storage to Applicable to HPC systems with budgets below
provide servers; supports NFS and CIFS, and CNY2 million and without expansion plans.
Unified
provides large capacity and relatively high Required performance less than 2 GB/s
storage
performance, for example, the OceanStor Applicable to systems with Windows clients for
V3 unified storage. accessing the storage
Uses RH2288 servers and OceanStor V3 FC Applicable to projects with budgets of over
SAN with the Intel Lustre file system. The CNY2 million for the HPC system.
Lustre
system provides high performance and Required performance of 2 GB/s to 20 GB/s
storage
Parallel good scalability. The native system supports All nodes accessing the storage in the cluster
storage only Linux clients. are Linux systems.
Page 9 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Key to the HPC Discussion
Type Characteristics
Page 10 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Contents
1. Background
2. Discussion on HPC
3. Device Connection
4. Acceptance Test
Page 11 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Device Connection
Background
The compute nodes, network devices, and storage devices have been
selected. Some devices have no FlexIO card. Select FlexIO cards and
fill in the physical connection planning table.
Page 12 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Task 1 Identifying Components
Fill in the table with component names corresponding to the numbers in the device rear view.
Step 1:
Rear view of the TaiShan X6000 & XA320C
1 2
3 4
5 6
7 8
9 10
11 - -
Page 13 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Task 1 Identifying Components
Key:
Page 14 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Task 1 Identifying Components
Step 2
Rear view of the Atlas G5500 & G560 V5
1 2
3 4
Page 15 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Task 1 Identifying Components
Key:
Page 16 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Task 1 Identifying Components
Step 3
Rear view of the FusionServer Pro 2488H V5
1 2
3 4
5 6
7 8
9 10
11 12
Page 17 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Task 1 Identifying Components
Key:
Management network
7 8 Serial port
port
PCIe slots (slots 3 to 11
9 VGA port 10
from left to right)
11 PSU 1 12 PSU 2
Page 18 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Task 2 Adding Interface Cards
Insert the following two FlexIO cards into the G5500 server and the FusionServer Pro
2488 server respectively, and provide the schematic diagram.
IN200 Intelligent Ethernet NIC, Standard NIC 4 x 10GE or 4 x 25GE FlexIO card
Page 19 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Task 2 Adding Interface Cards
Key:
Page 20 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Task 3 Designing Logical Connections
Design the logical connections of the devices by drawing lines.
CE8861 S5720
Page 21 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Task 3 Designing Logical Connections
Key:
CE8861 S5720
Management/
IPMI
Computing/
Network
P12X-1 P12X-2 P12X-3
Page 22 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Task 4 Planning Physical Connections
After the logical connections are designed, plan the physical connections and fill in the table.
Switch ports:
S5720
2 4 6 8 10 12 14 16 18 20 22 24 26 28 30 32 34 36 38 40 42 44 46 48
1 3 5 7 9 11 13 15 17 19 21 23 25 27 29 31 33 35 37 39 41 43 45 47
CE8861
2 4 6 8 2 4 6 8 10 12 14 16 18 20 22 24
1 3 5 7 1 3 5 7 9 11 13 15 17 19 21 23
Rear view of a
storage node:
Page 23 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Task 4 Planning Physical Connections
connection planning table Storage network OceanStor 9000 P12X-2 Slot 1-0
P12X-3 Slot 1-0
on the manual. XA320C-1 100GE port 1
XA320C-2 100GE port 1
TaiShan X6000
Computing XA320C-3 100GE port 1
network XA320C-4 100GE port 1
Atlas G5500 G560 V5 25GE port 1
2488 V5 fat node / 25GE port 1
P12X-1 MGMT
OceanStor 9000 P12X-2 MGMT
P12X-3 MGMT
XA320C-1 MGMT
XA320C-2 MGMT
IPMI network TaiShan X6000
XA320C-3 MGMT
XA320C-4 MGMT
Atlas G5500 G560 V5 MGMT
2488 V5 fat node / MGMT
1288Mgmt / MGMT
P12X-1 GE port 1
OceanStor 9000 P12X-2 GE port 1
P12X-3 GE port 1
XA320C-1 GE port 1
Management XA320C-2 GE port 1
TaiShan X6000
network XA320C-3 GE port 1
XA320C-4 GE port 1
Atlas G5500 G560 V5 GE port 1
2488 V5 fat node / GE port 1
1288 V5 management node / GE port 1
Page 24 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Task 4 Planning Physical Connections
Key: Network Plane Product Device Node Port Switch Switch port
P12X-1 Slot 1-0 CE8861 25GE 2/1
Storage network OceanStor 9000 P12X-2 Slot 1-0 CE8861 25GE 2/2
P12X-3 Slot 1-0 CE8861 25GE 2/3
XA320C-1 100GE port 1 CE8861 100GE 1/1
XA320C-2 100GE port 1 CE8861 100GE 1/2
TaiShan X6000
Computing XA320C-3 100GE port 1 CE8861 100GE 1/3
network XA320C-4 100GE port 1 CE8861 100GE 1/4
Atlas G5500 G560 V5 25GE port 1 CE8861 25GE 2/4
2488 V5 fat node / 25GE port 1 CE8861 25GE 2/5
P12X-1 MGMT S5720 GE 1
OceanStor 9000 P12X-2 MGMT S5720 GE 2
P12X-3 MGMT S5720 GE 3
XA320C-1 MGMT S5720 GE 4
XA320C-2 MGMT S5720 GE 5
IPMI network TaiShan X6000
XA320C-3 MGMT S5720 GE 6
XA320C-4 MGMT S5720 GE 7
Atlas G5500 G560 V5 MGMT S5720 GE 8
2488 V5 fat node / MGMT S5720 GE 9
1288 V5 management node / MGMT S5720 GE 10
P12X-1 GE port 1 S5720 GE 11
OceanStor 9000 P12X-2 GE port 1 S5720 GE 12
P12X-3 GE port 1 S5720 GE 13
XA320C-1 GE port 1 S5720 GE 14
Management XA320C-2 GE port 1 S5720 GE 15
TaiShan X6000
network XA320C-3 GE port 1 S5720 GE 16
XA320C-4 GE port 1 S5720 GE 17
Atlas G5500 G560 V5 GE port 1 S5720 GE 18
2488 V5 fat node / GE port 1 S5720 GE 19
1288 V5 management node / GE port 1 S5720 GE 20
Page 25 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Contents
1. Background
2. Discussion on HPC
3. Device Connection
4. Acceptance Test
Page 26 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Acceptance Test
Background
You are the acceptance engineer of the project. You need to complete the
acceptance of the project after the cluster software configuration and
storage configuration are complete.
Page 27 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Task 1 Testing the Cluster HPL Performance
1. What are the steps for testing the cluster HPL performance?
2. Which field shows the final result of the floating-point computing test?
Page 28 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Task 1 Testing the Cluster HPL Performance
Key:
1. For details, see the HPC Solution TaiShan Platform CPU Linpack Test Guide.
2. WC00C2R2
Page 29 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Task 2 Testing the Performance of the File
System
What are the steps for testing the file system?
Page 30 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Task 2 Testing the Performance of the File
System
Key:
For details, see the HPC Solution TaiShan Platform IOR Test Guide.
Page 31 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Summary
This course covers the following contents:
1. Background
2. Discussion on HPC
3. Device Connection
4. Acceptance Test
Learn the server device models and basic networking rules by finishing tasks.
Page 32 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
References and Tools
Reference documents:
1. HPC Solution V100R001C08 HPL Performance Test Guide
2. HPC Solution Deployment Guide
3. HPC Solution TaiShan Platform OpenHPC Installation and Deployment Guide
4. HPC Solution TaiShan Platform CPU Linpack Test Guide
5. HPC Solution STREAM Test Guide
6. HPC Solution TaiShan Platform IOR Test Guide
For details, see the following links:
https://support.huawei.com/enterprise/en/index.html
https://e.huawei.com/en/
Page 33 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Thank You
www.huawei.com