HCIA-Intelligent Computing V1.0 Lab Guide
HCIA-Intelligent Computing V1.0 Lab Guide
Issue: 1.0
No part of this document may be reproduced or transmitted in any form or by any means without
prior written consent of Huawei Technologies Co., Ltd.
and other Huawei trademarks are trademarks of Huawei Technologies Co., Ltd.
All other trademarks and trade names mentioned in this document are the property of their
respective holders.
Notice
The purchased products, services and features are stipulated by the contract made between Huawei
and the customer. All or part of the products, services and features described in this document may
not be within the purchase scope or the usage scope. Unless otherwise specified in the contract, all
statements, information, and recommendations in this document are provided "AS IS" without
warranties, guarantees or representations of any kind, either expressed or implied.
The information in this document is subject to change without notice. Every effort has been made in
the preparation of this document to ensure accuracy of the contents, but all statements, information,
and recommendations in this document do not constitute a warranty of any kind, express or implied.
Website: https://e.huawei.com/en
Contents
Reference documents:
Software:
BIOS
iBMC
Reference links:
https://support.huawei.com/enterprise/en/doc/EDOC1100019358/
https://e.huawei.com/en
HCIA – Management Software Operation Guide for Trainees Page 5
2.2 Objectives
After the course, the trainees will be able to:
2.4 Tasks
[Task Overview]-Task Flowchart
HCIA – Management Software Operation Guide for Trainees Page 6
Background
The monitoring and O&M of the seismic monitoring platform is not intelligent. For
example, faults need to be identified and rectified manually one by one, which results
in high labor and material costs.
HCIA – Management Software Operation Guide for Trainees Page 7
Suppose you have a Huawei rack server. Log in to the iBMC web user interface (WebUI),
and view alarms and logs of the server and perform system configuration and
management.
Question
How to perform operations on the iBMC CLI?
Requirements: Screenshot the key steps for viewing information and configuring the
system, and name the screenshots in 1.1 iBMC Configure-N format. N indicates the
sequence number of the screenshot. The screenshots for each question are numbered
from 1.
Evaluation criteria:
1.6 Enable power capping and set the smart cooling mode to High performance mode.
1.10 Mount an image file to the server through the remote console.
Background
RAID is configured to reduce errors and improve the performance and reliability of the
storage system. Generally, RAID needs to be configured for a newly purchased server.
Suppose you have a rack server (configured with an LSI SAS3108 RAID controller card).
Restart the server, access the RAID Configuration Utility, and create a RAID 5 array.
Notice:
During the login process, you are asked to install and run the Java program. Perform
operations as prompted. In addition, you need to manually add iBMC to the
Exception Site List on Java Control panel or set the Java security level to a lower
level.
Data on a hard disk will be deleted after the hard disk is added to a RAID array.
Before creating a RAID array, check that there is no data on hard disks or the data
on hard disks is not required.
Disks of the same type and specifications must be used in a RAID array.
Question
What are the precautions to be observed when you configure RAID 5? What are the
application scenarios of other RAID levels?
Reference: RAID levels and Huawei V5 Server RAID Controller Card User Guide
HCIA – Management Software Operation Guide for Trainees Page 9
RAID 0
RAID 1
RAID 5
RAID 6
RAID 1E
RAID 10
RAID 50
RAID 60
Requirements: Screenshot the key steps and name the screenshots in the "1.1 RAID
Configure-N" format. N indicates the sequence number of the screenshot. The
screenshots for each question are numbered from 1.
Evaluation criteria:
Background
Suppose you have a Huawei rack server. Access the BIOS interface and query the
internal information, including the CPU, memory, and disk information of the server.
Then, set the boot mode of the server.
Question
How do you set the server boot mode to Legacy?
Requirements: Screenshot the key steps and name the screenshots in the "1.1 BIOS
Configure-N" format. N indicates the sequence number of the screenshot. The
screenshots for each question are numbered from 1.
Evaluation criteria:
Task 1
Task 2
XXX case
XXX Task 3
(trainee/group)
Task 4
Total score
Network
diagram and data planning.xlsx
Management Software Operation Guide
Page 2 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Contents
1. Background
4. BIOS Configuration
Page 3 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Drill Background
Background
Page 4 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Objectives
After completing this course, you will be able to understand and grasp:
Basic functions of the server management software
Application scenarios of different RAID levels
iBMC, RAID, and BIOS operation processes
Page 5 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
V5 Rack Server Management Software
Deployment
Objectives Forms of Discussion
Page 6 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Task Flowchart
View alarm and diagnosis
information.
Operations on the iBMC Configure system management
management platform settings.
Page 7 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Contents
1. Background
4. BIOS Configuration
Page 8 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
iBMC Management Platform Operations
iBMC functions
Group discussion: 8 minutes
Operations on the iBMC
Presentation/group: 3 minutes
Management Platform
Comments: 5 minutes
Page 9 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Operations on the iBMC Management
Platform
Background:
The monitoring and O&M of the seismic monitoring platform is not intelligent. For
example, faults need to be identified and rectified manually one by one, which results in
high labor and material costs.
Question:
Operations on the iBMC command-line interface (CLI).
Page 10 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Operations on the iBMC Management
Platform
[Task Overview]-Task Flowchart
Start End
Page 11 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Operations on the iBMC Management
Platform
Task 1: Log in to the iBMC WebUI of a 2288H V5, query the system
information, and fill in the following table.
Processor model
Page 12 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Operations on the iBMC Management
Platform
Reference answer
Page 13 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Contents
1. Background
4. BIOS Configuration
Page 14 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
RAID Operations
Page 15 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
RAID Operations
Background:
RAID is configured to reduce errors and improve the performance and reliability of the
storage system. Generally, RAID needs to be configured for a newly purchased server.
Suppose you have a rack server. Restart the server, access the RAID Configuration Utility,
and create a RAID 5 array.
Question:
What are the precautions to be observed during the configuration of a RAID 5 array? What
are the application scenarios of other RAID levels?
Reference: common RAID types of 2288H V5 servers and Huawei V5 Server RAID Controller Card User
Guide
Page 16 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
RAID Operations
[Task Overview]-Task Flowchart
Start End
Page 17 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
RAID Operations
Task 1: Compare RAID levels.
Read Write Min. Number Disk
RAID Level Reliability
Performance Performance of Disks Utilization
RAID 0
RAID 1
RAID 5
RAID 6
RAID1E
RAID 10
RAID 50
RAID 60
Page 18 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
RAID Operations
[Reference answer]
Read Write Min. Number Disk
RAID Level Reliability
Performance Performance of Disks Utilization
RAID 0 Low High High 2 100%
Page 19 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Contents
1. Background
4. BIOS Configuration
Page 20 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
BIOS Management Platform Operations
Page 21 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
BIOS Management Platform Operations
Background:
Suppose you have a Huawei rack server. Access the BIOS interface and query the
internal information, including the CPU, memory, and disk information of the
server. Then, set the boot mode of the server.
Question:
How do you set the server boot mode to Legacy?
Page 22 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
BIOS Management Platform Operations
[Task Overview]-Task Flowchart
Start End
Page 23 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
BIOS Management Platform Operations
Task 1: Check the disk information and fill in the following table.
Port 0
SATA controller
Port 1
Page 24 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
BIOS Management Platform Operations
[Reference answer]
Port 0 Enabled
Port 1 Enabled
Page 25 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
BIOS Management Platform Operations
Task 2: Set the server boot mode to Legacy, write down the operation
procedure, and take a screenshot.
Page 26 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
BIOS Management Platform Operations
[Reference answer]
1. Log in to the BIOS. For details, see the 2. In the dialog box displayed, choose Legacy.
user guide. Choose Boot > Boot Type, and
press Enter.
Page 27 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Summary
Perform initial configuration of V5 rack servers after the study.
Understand the functions and basic working principles of the management
software.
Page 28 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Recommendations
Page 29 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Thank You
www.huawei.com
Revision Record
Course Code Product Product Version Course Version
ISSUE: 1.0
No part of this document may be reproduced or transmitted in any form or by any means without
prior written consent of Huawei Technologies Co., Ltd.
and other Huawei trademarks are trademarks of Huawei Technologies Co., Ltd.
All other trademarks and trade names mentioned in this document are the property of their
respective holders.
Notice
The purchased products, services and features are stipulated by the contract made between Huawei
and the customer. All or part of the products, services and features described in this document may
not be within the purchase scope or the usage scope. Unless otherwise specified in the contract, all
statements, information, and recommendations in this document are provided "AS IS" without
warranties, guarantees or representations of any kind, either expressed or implied.
The information in this document is subject to change without notice. Every effort has been made in
the preparation of this document to ensure accuracy of the contents, but all statements, information,
and recommendations in this document do not constitute a warranty of any kind, express or
implied.
Website: https://e.huawei.com/en
Contents
2 Overview ....................................................................................................................... 5
2.1 Course Introduction ...................................................................................................................................................... 5
Reference links:
https://docs.ansible.com/
https://support-open.huawei.com/en
https://e.huawei.com/en
HCIA - Server Intelligent O&M Guide for Trainees Page 5
2 Overview
2.2 Objectives
Upon completion of this course, you will be able to:
Understand the modes and scenarios of Ansible installation and deployment.
Manage servers in batches using the ad-hoc command of Ansible.
Assume that you are an IT system engineer of company Z, and you need to complete
the following tasks and configuration.
HCIA - Server Intelligent O&M Guide for Trainees Page 6
2.4 Tasks
IP address: 192.168.1.100
Host01
Host03
Command
Task 4: Copy the test.sh File from the Control End to the /tmp/
Directory on the Target Host, and Set the Owner and Group of the
File to root with the File Permission rwxr-xr-x
Write the command:
Task 5: Check the uid and gid Information in the /etc/sysctl.conf File
of the Remote Host Group Webservers
Write the command:
HCIA - Server Intelligent O&M Guide for Trainees Page 9
Task 7: Enable the HTTP Service for the Remote Host Group
Webservers and Check the Service Status
Write the commands:
Task 8: Create and Delete the /home/f1 File on the Remote Server
Group Webservers
Write the commands:
Assessment point 1
Assessment point 2
Case xx
Assessment point 3
Trainee/Group xx
Assessment point 4
Total score
Server Intelligent O&M Guide Slides
Page 2 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Background
Introduction
To improve work efficiency, eliminate duplicate tasks, and reduce error risks,
company Z requires that the modification of the servers on the live network
be minimized. Therefore, Ansible is selected from the four mainstream O&M
automation tools (Puppet, SaltStack, Chef, and Ansible) to automate O&M
management.
Page 3 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Objectives
Upon completion of this course, you will be able to:
Page 4 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Topology
Host 01
Host 03
Page 5 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Contents
1. Case Background
Page 6 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Installing and Configuring Ansible
Discussion Objectives Form of Discussion
Task 1: Confirm the environment Activity 1: Group discussion
Task 2: Install Ansible Activity 2: Group presentation
Task 3: Install Python and log in to the Case Activity 3: Comments on each other
Study
system using SSH without a password
Task 4: Configure the Controlled Hosts
Page 7 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Installing and Configuring Ansible
Task 1: Confirm Service Environment
Server
Managed end
Page 8 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Installing and Configuring Ansible
[Reference Answer]
192.168.1.101 Yes
Managed end CentOS 7.2 192.168.1.102 Yes
192.168.1.103 Yes
Page 9 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Installing and Configuring Ansible
Task 2: Install Ansible Using Yum Commands on the Control End
Page 10 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Installing and Configuring Ansible
[Reference Answer]
CentOS (Yum)
2. Install Ansible.
$ sudo yum install -y ansible
Page 11 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Installing and Configuring Ansible
Task 3: Install Python and Configure SSH Login Without a Password
Page 12 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Installing and Configuring Ansible
[Reference Answer]
1. Install Yum, SSH, and Python on all nodes.
$ sudo yum install -y openssh-server python
Page 13 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Installing and Configuring Ansible
Task 4: Modify the ansible.cfg Configuration File and Configure the
Controlled Hosts
Page 14 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Installing and Configuring Ansible
[Reference Answer]
# vi /etc/ansible/ansible.cfg
[defaults]
inventory = /etc/ansible/hosts
forks = 5
become = root
remote_port = 22
host_key_checking = False
timeout = 10
log_path = /var/log/ansible.log
private_key_file = /root/.ssh/id_rsa
#cat /etc/ansible/hosts
[webservers]
192.168.1.101
192.168.1.102
192.168.1.103
Page 15 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Contents
1. Case Background
Page 16 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Solution Architecture Design
Discussion Objectives Form of Discussion
Task 1: Test the connectivity Activity 1: Group discussion
Task 2: Check the NIC information Activity 2: Group presentation
Task 3: Execute the remote script Case Activity 3: Comments on each other
Task 4: Copy file remotely
Study
Page 17 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Managing Servers in Batches Using the
ad-hoc Command
Task 1: Test the Connectivity of All Remote Host Group Webservers
Page 18 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Managing Servers in Batches Using the
ad-hoc Command
[Reference Answer]
Page 19 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Managing Servers in Batches Using the
ad-hoc Command
Task 2: Check the Information about eth0 of the Remote Host Group
Webservers
Page 20 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Managing Servers in Batches Using the
ad-hoc Command
[Reference Answer]
[root@localhost ~]# ansible webservers -m command -a 'ip addr show dev eth0'
Page 21 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Managing Servers in Batches Using the
ad-hoc Command
Task 3: Run the Remote Host Script test.sh
Page 22 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Managing Servers in Batches Using the
ad-hoc Command
[Reference Answer]
[root@localhost ~]# ansible webservers -m shell -a "/home/test.sh"
Note: The /home/test.sh script must exist on the remote host and have the
execution permission.
#more test.sh
Echo "Welcome to Huawei Cloud"
chmod 777 test.sh
Page 23 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Managing Servers in Batches Using the
ad-hoc Command
Task 4: Copy the test.sh File from the Control End to the /tmp/ Directory on
the Target Host, and Set the Owner and Group of the File to root with the File
Permission rwxr-xr-x
Page 24 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Managing Servers in Batches Using the
ad-hoc Command
[Reference Answer]
[root@localhost ~]# ansible webservers -m copy -a "src=/home/test.sh
dest=/tmp/ owner=root group=root mode=0755"
Page 25 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Managing Servers in Batches Using the
ad-hoc Command
Task 5: Check the uid and gid Information in the /etc/sysctl.conf File of the
Remote Host Group Webservers
Page 26 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Managing Servers in Batches Using the
ad-hoc Command
[Reference Answer]
Page 27 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Managing Servers in Batches Using the
ad-hoc Command
Task 6: Install HTTPD on All Remote Host Group Webservers
Page 28 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Managing Servers in Batches Using the
ad-hoc Command
[Reference Answer]
[root@localhost ~]# ansible webservers -m yum -a "name=httpd
state=latest disable_gpg_check=yes enablerepo=epel "
Page 29 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Managing Servers in Batches Using the
ad-hoc Command
Task 7: Enable the HTTP Service for the Remote Host Group Webservers and
Check the Service Status
Page 30 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Managing Servers in Batches Using the
ad-hoc Command
[Reference Answer]
#Enable the service:
[root@localhost ~]# ansible webservers -m service -a "name=httpd state=restarted"
Page 31 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Managing Servers in Batches Using the
ad-hoc Command
Task 8: Create and Delete the /home/f1 File on the Remote Server Group
Webservers
Page 32 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Managing Servers in Batches Using the
ad-hoc Command
[Reference Answer]
ansible all -m file -a 'name=/home/f1 state=touch'
ansible all -m file -a 'name=/home/f1 state=absent'
Page 33 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Contents
1. Case Background
Page 34 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Solution Implementation
Discussion Objectives Form of Discussion
Task 1: Deploy Nginx automatically Activity 1: Group discussion
using a playbook Activity 2: Group presentation
Case
Page 35 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Deploying Nginx Using a Playbook
Task 1: Deploy Nginx Automatically Using a Playbook
Page 36 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Deploying Nginx Using a Playbook
[Reference Answer]
# main.yml
---
- hosts: webservers
tasks:
- name: Add repo
yum_repository:
name: nginx
description: nginx repo
baseurl: http://nginx.org/packages/centos/7/$basearch/
gpgcheck: no
enabled: 1
- name: Install nginx
yum:
name: nginx
state: latest
- name: Start nginx
service:
name: nginx
state: started
Page 37 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Summary
Three experiment scenarios:
Installing and Configuring Ansible
Page 38 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Quiz
1. Which of the following options belong to Ansible?
A. copy
B. command
C. file
D. Yum
Page 39 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
More Information
https://docs.ansible.com/
Page 40 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Thank You
www.huawei.com
Revision Record
Course Code Product Product Version Course Version
For Trainees
Issue 1.0
No part of this document may be reproduced or transmitted in any form or by any means without
prior written consent of Huawei Technologies Co., Ltd.
and other Huawei trademarks are trademarks of Huawei Technologies Co., Ltd.
All other trademarks and trade names mentioned in this document are the property of their
respective holders.
Note
The purchased products, services and features are stipulated by the contract made between
Huawei and the customer. All or part of the products, services and features described in this
document may not be within the purchase scope or the usage scope. Unless otherwise specified in
the contract, all statements, information, and recommendations in this document are provided "AS
IS" without warranties, guarantees or representations of any kind, either express or implied.
The information in this document is subject to change without notice. Every effort has been made
in the preparation of this document to ensure accuracy of the contents, but all statements,
information, and recommendations in this document do not constitute a warranty of any kind,
express or implied.
Website: https://e.huawei.com/en
Contents
Reference documents:
1. https://support.huawei.com/enterprise/en/index.html
2. https://e.huawei.com/en/
Industry Solution Practice Guide for Trainees Page 5
2.2 Objectives
Understand the characteristics and components of the HPC solution.
Understand how to select device models.
Understand how to design the network of a small- and medium-sized HPC cluster.
Understand the delivery process of an HPC basic environment.
Understand the HPC project acceptance process.
2.3 Background
Note: The case in this document is for reference only. The actual configuration may
vary. For details, see the corresponding product documentation.
With the rapid development of computer technology and national economy, HPC has
become a necessary tool for scientific researches and plays an important role in
various basic disciplines and production systems. HPC has been applied in industrial
Industry Solution Practice Guide for Trainees Page 6
Based on the project survey, M company decides to deploy an HPC cloud simulation
platform. You are the implementation engineer of this project and need to complete
several basic tasks.
This section describes the acceptance scope of the HPC solution implementation
service, including:
1. Devices involved in the project, such as servers, storage devices, and network
switching devices
2. Software involved in the project, such as OSs, parallel file system software,
application environment software, and cluster management software
According to the HPC solution design and implementation requirements, the Huawei
HPC solution is deployed in equipment room A. The solution provides a complete
service running platform, an HPC cloud simulation platform, centralized management
and scheduling services, and unified storage space. Huawei provides the overall
solution design, software and hardware installation service, commissioning service,
and acceptance service.
2.4 Tasks
You are an engineer. Compare HPC and common computing such as server
virtualization in terms of computing, storage, and networking.
Question
What are the differences between HPC and common computing in terms of
computing, storage, and networking?
Industry Solution Practice Guide for Trainees Page 7
1 2
3 4
5 6
7 8
9 10
11 - -
2. Fill in the table with the component names of the Atlas G5500 & G560 V5.
Industry Solution Practice Guide for Trainees Page 8
1 2
3 4
3. Fill in the table with the component names of the FusionServer Pro 2488H V5.
1 2
Industry Solution Practice Guide for Trainees Page 9
3 4
5 6
7 8
9 10
11 12
FlexIO card 1
FlexIO card 2
Industry Solution Practice Guide for Trainees Page 10
Logical diagram:
CE8861 S5720
Switch ports:
S5720
2 4 6 8 10 12 14 16 18 20 22 24 26 28 30 32 34 36 38 40 42 44 46 48
1 3 5 7 9 11 13 15 17 19 21 23 25 27 29 31 33 35 37 39 41 43 45 47
CE8861
2 4 6 8 2 4 6 8 10 12 14 16 18 20 22 24
1 3 5 7 1 3 5 7 9 11 13 15 17 19 21 23
2488 V5 fat
/ 25GE port 1
node
P12X-1 MGMT
OceanStor
P12X-2 MGMT
9000
P12X-3 MGMT
XA320C-1 MGMT
XA320C-2 MGMT
TaiShan X6000
IPMI network XA320C-3 MGMT
XA320C-4 MGMT
2488 V5 fat
/ MGMT
node
1288 V5
/ MGMT
management
Industry Solution Practice Guide for Trainees Page 13
node
P12X-1 GE port 1
OceanStor
P12X-2 GE port 1
9000
P12X-3 GE port 1
XA320C-1 GE port 1
XA320C-2 GE port 1
TaiShan X6000
Management XA320C-3 GE port 1
2488 V5 fat
/ GE port 1
node
1288 V5
management / GE port 1
node
2. Which field shows the final result of the floating-point computing test?
Industry Solution Practice Guide for Trainees Page 14
Assessment point 1
Assessment point 2
XXX Case
Total score
Industry Solution Practice Guide
HPC Scenario
Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Contents
1. Background
2. Discussion on HPC
3. Device Connection
4. Acceptance Test
Page 2 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Background
With the rapid development of computer technology and national economy, high-
performance computing (HPC) has become a necessary tool for scientific researches
and is playing an important role in various basic disciplines and production systems.
HPC has been applied in industrial simulation, teaching and scientific research,
energy exploration, weather forecasting, and other fields.
Page 3 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Objectives
Understand the characteristics and components of the HPC solution.
Understand how to select device models.
Understand how to design the network of a small- and medium-sized HPC
cluster.
Understand the delivery process of an HPC basic environment.
Understand the HPC project acceptance process.
Page 4 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Contents
1. Background
2. Discussion on HPC
3. Device Connection
4. Acceptance Test
Page 5 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Differences Between HPC and Common
Computing
Background
You are an engineer. Compare HPC and common computing such as server
virtualization in terms of computing, storage, and networking without considering
the software.
Task 1
What are the differences between HPC and common computing in terms of
computing, storage, and networking?
Page 6 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Key to the HPC Discussion
An HPC system consists of the management network, computing network, and
storage network, including compute nodes, fat nodes, acceleration nodes,
management nodes, login nodes, and parallel file systems.
Three types of compute nodes:
Compute nodes (thin nodes): high-performance blade servers or rack servers
Fat nodes: SMP high-performance servers with multiple processors and large
memory capacity
GPU compute nodes: use GPGPU cards for GPU computing acceleration
Three-plane networking:
1. Computing network: used for message transmission during computing
2. Management network: used for cluster system management
3. Storage network: used for storage or data transmission
Page 7 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Key to the HPC Discussion
Type Characteristics Application Scenario
Page 8 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Key to the HPC Discussion
Type Characteristics Application Scenario
Uses storage-type server to deploy the NFS
server; small capacity and relatively low Applicable to small projects that do not require
NFS
performance. For example, deploy the NFS high performance.
server by using RH2288 V3.
NAS Directly uses NAS or unified storage to Applicable to HPC systems with budgets below
provide servers; supports NFS and CIFS, and CNY2 million and without expansion plans.
Unified
provides large capacity and relatively high Required performance less than 2 GB/s
storage
performance, for example, the OceanStor Applicable to systems with Windows clients for
V3 unified storage. accessing the storage
Uses RH2288 servers and OceanStor V3 FC Applicable to projects with budgets of over
SAN with the Intel Lustre file system. The CNY2 million for the HPC system.
Lustre
system provides high performance and Required performance of 2 GB/s to 20 GB/s
storage
Parallel good scalability. The native system supports All nodes accessing the storage in the cluster
storage only Linux clients. are Linux systems.
Page 9 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Key to the HPC Discussion
Type Characteristics
Page 10 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Contents
1. Background
2. Discussion on HPC
3. Device Connection
4. Acceptance Test
Page 11 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Device Connection
Background
The compute nodes, network devices, and storage devices have been
selected. Some devices have no FlexIO card. Select FlexIO cards and
fill in the physical connection planning table.
Page 12 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Task 1 Identifying Components
Fill in the table with component names corresponding to the numbers in the device rear view.
Step 1:
Rear view of the TaiShan X6000 & XA320C
1 2
3 4
5 6
7 8
9 10
11 - -
Page 13 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Task 1 Identifying Components
Key:
Page 14 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Task 1 Identifying Components
Step 2
Rear view of the Atlas G5500 & G560 V5
1 2
3 4
Page 15 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Task 1 Identifying Components
Key:
Page 16 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Task 1 Identifying Components
Step 3
Rear view of the FusionServer Pro 2488H V5
1 2
3 4
5 6
7 8
9 10
11 12
Page 17 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Task 1 Identifying Components
Key:
Management network
7 8 Serial port
port
PCIe slots (slots 3 to 11
9 VGA port 10
from left to right)
11 PSU 1 12 PSU 2
Page 18 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Task 2 Adding Interface Cards
Insert the following two FlexIO cards into the G5500 server and the FusionServer Pro
2488 server respectively, and provide the schematic diagram.
IN200 Intelligent Ethernet NIC, Standard NIC 4 x 10GE or 4 x 25GE FlexIO card
Page 19 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Task 2 Adding Interface Cards
Key:
Page 20 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Task 3 Designing Logical Connections
Design the logical connections of the devices by drawing lines.
CE8861 S5720
Page 21 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Task 3 Designing Logical Connections
Key:
CE8861 S5720
Management/
IPMI
Computing/
Network
P12X-1 P12X-2 P12X-3
Page 22 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Task 4 Planning Physical Connections
After the logical connections are designed, plan the physical connections and fill in the table.
Switch ports:
S5720
2 4 6 8 10 12 14 16 18 20 22 24 26 28 30 32 34 36 38 40 42 44 46 48
1 3 5 7 9 11 13 15 17 19 21 23 25 27 29 31 33 35 37 39 41 43 45 47
CE8861
2 4 6 8 2 4 6 8 10 12 14 16 18 20 22 24
1 3 5 7 1 3 5 7 9 11 13 15 17 19 21 23
Rear view of a
storage node:
Page 23 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Task 4 Planning Physical Connections
connection planning table Storage network OceanStor 9000 P12X-2 Slot 1-0
P12X-3 Slot 1-0
on the manual. XA320C-1 100GE port 1
XA320C-2 100GE port 1
TaiShan X6000
Computing XA320C-3 100GE port 1
network XA320C-4 100GE port 1
Atlas G5500 G560 V5 25GE port 1
2488 V5 fat node / 25GE port 1
P12X-1 MGMT
OceanStor 9000 P12X-2 MGMT
P12X-3 MGMT
XA320C-1 MGMT
XA320C-2 MGMT
IPMI network TaiShan X6000
XA320C-3 MGMT
XA320C-4 MGMT
Atlas G5500 G560 V5 MGMT
2488 V5 fat node / MGMT
1288Mgmt / MGMT
P12X-1 GE port 1
OceanStor 9000 P12X-2 GE port 1
P12X-3 GE port 1
XA320C-1 GE port 1
Management XA320C-2 GE port 1
TaiShan X6000
network XA320C-3 GE port 1
XA320C-4 GE port 1
Atlas G5500 G560 V5 GE port 1
2488 V5 fat node / GE port 1
1288 V5 management node / GE port 1
Page 24 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Task 4 Planning Physical Connections
Key: Network Plane Product Device Node Port Switch Switch port
P12X-1 Slot 1-0 CE8861 25GE 2/1
Storage network OceanStor 9000 P12X-2 Slot 1-0 CE8861 25GE 2/2
P12X-3 Slot 1-0 CE8861 25GE 2/3
XA320C-1 100GE port 1 CE8861 100GE 1/1
XA320C-2 100GE port 1 CE8861 100GE 1/2
TaiShan X6000
Computing XA320C-3 100GE port 1 CE8861 100GE 1/3
network XA320C-4 100GE port 1 CE8861 100GE 1/4
Atlas G5500 G560 V5 25GE port 1 CE8861 25GE 2/4
2488 V5 fat node / 25GE port 1 CE8861 25GE 2/5
P12X-1 MGMT S5720 GE 1
OceanStor 9000 P12X-2 MGMT S5720 GE 2
P12X-3 MGMT S5720 GE 3
XA320C-1 MGMT S5720 GE 4
XA320C-2 MGMT S5720 GE 5
IPMI network TaiShan X6000
XA320C-3 MGMT S5720 GE 6
XA320C-4 MGMT S5720 GE 7
Atlas G5500 G560 V5 MGMT S5720 GE 8
2488 V5 fat node / MGMT S5720 GE 9
1288 V5 management node / MGMT S5720 GE 10
P12X-1 GE port 1 S5720 GE 11
OceanStor 9000 P12X-2 GE port 1 S5720 GE 12
P12X-3 GE port 1 S5720 GE 13
XA320C-1 GE port 1 S5720 GE 14
Management XA320C-2 GE port 1 S5720 GE 15
TaiShan X6000
network XA320C-3 GE port 1 S5720 GE 16
XA320C-4 GE port 1 S5720 GE 17
Atlas G5500 G560 V5 GE port 1 S5720 GE 18
2488 V5 fat node / GE port 1 S5720 GE 19
1288 V5 management node / GE port 1 S5720 GE 20
Page 25 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Contents
1. Background
2. Discussion on HPC
3. Device Connection
4. Acceptance Test
Page 26 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Acceptance Test
Background
You are the acceptance engineer of the project. You need to complete the
acceptance of the project after the cluster software configuration and
storage configuration are complete.
Page 27 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Task 1 Testing the Cluster HPL Performance
1. What are the steps for testing the cluster HPL performance?
2. Which field shows the final result of the floating-point computing test?
Page 28 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Task 1 Testing the Cluster HPL Performance
Key:
1. For details, see the HPC Solution TaiShan Platform CPU Linpack Test Guide.
2. WC00C2R2
Page 29 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Task 2 Testing the Performance of the File
System
What are the steps for testing the file system?
Page 30 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Task 2 Testing the Performance of the File
System
Key:
For details, see the HPC Solution TaiShan Platform IOR Test Guide.
Page 31 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Summary
This course covers the following contents:
1. Background
2. Discussion on HPC
3. Device Connection
4. Acceptance Test
Learn the server device models and basic networking rules by finishing tasks.
Page 32 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
References and Tools
Reference documents:
1. HPC Solution V100R001C08 HPL Performance Test Guide
2. HPC Solution Deployment Guide
3. HPC Solution TaiShan Platform OpenHPC Installation and Deployment Guide
4. HPC Solution TaiShan Platform CPU Linpack Test Guide
5. HPC Solution STREAM Test Guide
6. HPC Solution TaiShan Platform IOR Test Guide
For details, see the following links:
https://support.huawei.com/enterprise/en/index.html
https://e.huawei.com/en/
Page 33 Copyright © 2019 Huawei Technologies Co., Ltd. All rights reserved.
Thank You
www.huawei.com