10.introduction To Data-Parallel Architectures

This document discusses data-parallel architectures. It introduces SIMD (Single Instruction Multiple Data) and describes how one instruction is broadcast to multiple processing elements that each operate on their own data. Connectivity between processing elements is important and can include near-neighbor, tree, pyramid and hypercube topologies. Data-parallel approaches are well-suited for multimedia, image processing, and scientific applications by operating in parallel on large data sets.

Uploaded by

Ashok Ashokbyadav

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

171 views21 pages

10.introduction To Data-Parallel Architectures

Uploaded by

Ashok Ashokbyadav

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 21

10.

Introduction to Data-Parallel architectures

• SIMD {Single Instruction Multiple Data}
• 10.1 Introduction
• 10.2 Connectivity
• 10.3 Alternative architecture

• e.g. add: (c1=a1+b1), (c2=a2+b2), (c3=a3+b3)

TECH
Computer Science
CH01
Introduction
• Single instruction, multiple data
• One instruction stream is broadcast to all processors
• Each processor, also called a processing element (or
PE), is usually simplistic and logically is essentially
an ALU;
– PEs do not store a copy of the program
nor have a program control unit.
• Individual processors can remain idle during
execution of segments of the program (based on a
data test).
• All active processor executes the same instruction
synchronously, but on different data
Cont…
• Technically, on a memory access, all active
processors must access the same location in their
local memory.
This requirement is sometimes relaxed a bit.
• The data items form an array (or vector) and an
instruction can act on the complete array in one cycle.
• Quinn also calls this architecture a processor array.
Some examples include
ILLIAC IV (1974) was the first SIMD computer
The STARAN and MPP (Dr. Batcher architect)
Connection Machine CM2, built by Thinking
Machines).
MasPar (for Massively Parallel) computers.
Data-parallel computation (bit parallel)
Scalar vs. SIMD
Ex:Multimedia Extension Architectures
• At the core of multimedia extensions
– SIMD parallelism
– Variable-sized data fields:
– Vector length = register width / type size
Application of Data-parallel Architectures:
One data entity processed by one PE
Multimedia / Scientific Applications
• Image
– Graphics : 3D games, movies
– Image recognition
– Video encoding/decoding : JPEG, MPEG4
• Sound
– Encoding/decoding: IP phone, MP3
– Speech recognition
– Digital signal processing: Cell phones
• Scientific applications
– Double precision Matrix-Matrix multiplication
(DGEMM)
– Y[] = a*X[] + Y[] (SAXPY)
Connectivity
• Important aspects of the design space of data parallel
computers is the connectivity.

• This connectivity is established between processing

elements(PEs).

• The inter connectivity methods are:

• Near neighbor, tree, pyramid and hypercube—widely
used data parallel connectivity.

• Bus ,crossbar, and multistage networks –used for

functional parallel design
Mapping Problem space into Architectural Space:
Data entity onto PE (1-to-1 mapping)
Near- Neighbor Connectivity

• It is arose in the context of spatial mapping coherent

data on SIMD systems.
• The data is spatially correlated.
• Ex: 8-connected near neighbor case
Near-neighbor connectivity (2-D: Mesh)
Example of a 2-D Processor Interconnection
Network in a Processor Array

Each VLSI chip has 16 processing elements.

Each PE can simultaneously send a value to a neighbor.

PE =
processor
element

13
Trees and Graphs
• Near-Neighbour Connected system is specialized for
image Processing applications.
• Any general problem has a natural expression in the
form of a graph.
• The problems that are represented as hierarchical
structure like database searching, model matching and
expert system can be represented through a tree
connectivity.
• In this many simultaneous processes are executed at a
low level, then progressively fewer at higher level.
• The computations at this level depends on the results
of lower level.
Tree: 2-D hierarchy
The Pyramid
• The basic architectutre of the pyramid is a for
connected mesh.
• Connection between the layers is fixed and every
processor is connected to one element in the layer
above and four elements in the layer below.
• This approach is suitable for basic image processing
applications.
• Consider the data passing in an N ×N mesh.
• To pass an item of data from one corner of the array
to that diagonally opposite need 2N shift operations.
• If it forms a base level of pyramid ,the same operation
can be performed in 2log2N shifts.
Pyramid: 3-D hierarchy
Hypercube: 2^N nodes in N dimension
Hypercube: 4-D
Long and short-range connections
Data-parallel approaches
Principal characteristics of data-parallel systems

UNIT-2 PP FlynnsClassification
No ratings yet
UNIT-2 PP FlynnsClassification
80 pages
Module 2 - Parallel Computing
No ratings yet
Module 2 - Parallel Computing
55 pages
BCSE412L - Parallel Computing 04
No ratings yet
BCSE412L - Parallel Computing 04
9 pages
Parallel Architectures Parallel Architectures: Ever Faster
No ratings yet
Parallel Architectures Parallel Architectures: Ever Faster
11 pages
PDC-architectures
No ratings yet
PDC-architectures
24 pages
Parallel Processing Report
No ratings yet
Parallel Processing Report
9 pages
SIMD Computer Organizations
0% (1)
SIMD Computer Organizations
20 pages
PARALLEL VS DISTRIBUTED COMPUTING
No ratings yet
PARALLEL VS DISTRIBUTED COMPUTING
9 pages
Unit-1 ACA
No ratings yet
Unit-1 ACA
26 pages
Unit 1 - Part - 2
No ratings yet
Unit 1 - Part - 2
30 pages
COA U5 PPT Full
No ratings yet
COA U5 PPT Full
43 pages
15CS72 ACA Module1 Chapter1FinalCopy
No ratings yet
15CS72 ACA Module1 Chapter1FinalCopy
25 pages
Aca UNIT-5
No ratings yet
Aca UNIT-5
10 pages
Unit 1- Part 1
No ratings yet
Unit 1- Part 1
51 pages
CA Classes-221-225
No ratings yet
CA Classes-221-225
5 pages
Parallel Computig Assignment
No ratings yet
Parallel Computig Assignment
15 pages
Unit 1
No ratings yet
Unit 1
22 pages
Computer Achitecture II - Parallel - Computing
No ratings yet
Computer Achitecture II - Parallel - Computing
46 pages
Lecture006. Introduction Systolic Array
No ratings yet
Lecture006. Introduction Systolic Array
36 pages
02 Lecture Flynn IN
No ratings yet
02 Lecture Flynn IN
78 pages
Lecture 5 Network Topologies for Parallel Architectures - Updated
No ratings yet
Lecture 5 Network Topologies for Parallel Architectures - Updated
46 pages
Parallel Algorithms: Peter Harrison and William Knottenbelt
No ratings yet
Parallel Algorithms: Peter Harrison and William Knottenbelt
65 pages
W3C1 Principles of Parallel Computing
No ratings yet
W3C1 Principles of Parallel Computing
28 pages
Data-Parallel Architectures and
No ratings yet
Data-Parallel Architectures and
27 pages
PDA_2
No ratings yet
PDA_2
105 pages
Introduction To Parallel Programming
No ratings yet
Introduction To Parallel Programming
268 pages
Unit 1 - Part - 3
No ratings yet
Unit 1 - Part - 3
29 pages
Lec1 Introduction to Parallel Computing (2)
No ratings yet
Lec1 Introduction to Parallel Computing (2)
40 pages
Parallel Computing Platforms and Memory System Performance: John Mellor-Crummey
No ratings yet
Parallel Computing Platforms and Memory System Performance: John Mellor-Crummey
43 pages
8051 Arch
No ratings yet
8051 Arch
55 pages
01 Intro Parallel Computing
No ratings yet
01 Intro Parallel Computing
40 pages
Module -4 - Parallel Processing
No ratings yet
Module -4 - Parallel Processing
32 pages
Motivation For Parallelism Motivation For Parallelism
No ratings yet
Motivation For Parallelism Motivation For Parallelism
6 pages
CP4253 Map Unit I
No ratings yet
CP4253 Map Unit I
31 pages
KCS 713 Unit 1 Lecture 5
No ratings yet
KCS 713 Unit 1 Lecture 5
32 pages
UNIT-V-Pipeline and Array Processing and Multi Processors
No ratings yet
UNIT-V-Pipeline and Array Processing and Multi Processors
51 pages
atII Bks Lec 2021 28
No ratings yet
atII Bks Lec 2021 28
6 pages
L2
No ratings yet
L2
27 pages
Unit -01 easid
No ratings yet
Unit -01 easid
18 pages
Explicitly Parallel Platforms
No ratings yet
Explicitly Parallel Platforms
90 pages
Notes_FT_HA
No ratings yet
Notes_FT_HA
4 pages
Parallel Computing
No ratings yet
Parallel Computing
32 pages
Introduction To Parallel Processing
No ratings yet
Introduction To Parallel Processing
49 pages
HPA - Notes
No ratings yet
HPA - Notes
5 pages
CPARCH ACTIVITY (1)
No ratings yet
CPARCH ACTIVITY (1)
3 pages
Lecture 10 - SIMD Architecture
No ratings yet
Lecture 10 - SIMD Architecture
27 pages
Systolic Array
No ratings yet
Systolic Array
42 pages
Architecture1 1 (2012)
No ratings yet
Architecture1 1 (2012)
87 pages
F 23
No ratings yet
F 23
20 pages
W 05 Parallel Processing II
No ratings yet
W 05 Parallel Processing II
32 pages
U1-Theory of Parallelism
No ratings yet
U1-Theory of Parallelism
43 pages
Flynn's Taxonomy and SISD SIMD MISD MIMD
86% (14)
Flynn's Taxonomy and SISD SIMD MISD MIMD
7 pages
Associative Computing Models: SIMD Background
No ratings yet
Associative Computing Models: SIMD Background
39 pages
Parallel Computer Models: CEG 4131 Computer Architecture III Miodrag Bolic
No ratings yet
Parallel Computer Models: CEG 4131 Computer Architecture III Miodrag Bolic
27 pages
04 Hardware
No ratings yet
04 Hardware
109 pages
Data Parallel Algorithms
No ratings yet
Data Parallel Algorithms
14 pages
IJCRT2304397
No ratings yet
IJCRT2304397
5 pages
Parallel_computing
No ratings yet
Parallel_computing
32 pages
unit 4
No ratings yet
unit 4
16 pages
PLC: Programmable Logic Controller – Arktika.: EXPERIMENTAL PRODUCT BASED ON CPLD.
From Everand
PLC: Programmable Logic Controller – Arktika.: EXPERIMENTAL PRODUCT BASED ON CPLD.
MARIO FRANCO
No ratings yet
Which One Is Better - Rolex Forums - Rolex Watch Forum
No ratings yet
Which One Is Better - Rolex Forums - Rolex Watch Forum
1 page
Bahan Pegangan Print
No ratings yet
Bahan Pegangan Print
8 pages
Paper 1 Ancient Civilisations N
No ratings yet
Paper 1 Ancient Civilisations N
250 pages
M. Sc. (11/01-05/02) Final Year Project: CV: Jahanzeb Ahmed
No ratings yet
M. Sc. (11/01-05/02) Final Year Project: CV: Jahanzeb Ahmed
5 pages
Marx Sahlins Weber Essay
No ratings yet
Marx Sahlins Weber Essay
5 pages
Answers To Exercise Questions - Chapter 2
No ratings yet
Answers To Exercise Questions - Chapter 2
7 pages
Edge Computing
No ratings yet
Edge Computing
14 pages
Maxims Handouts
No ratings yet
Maxims Handouts
3 pages
FC 24ETH Datasheet
No ratings yet
FC 24ETH Datasheet
3 pages
1 To 100 Counting in Sanskrit
No ratings yet
1 To 100 Counting in Sanskrit
2 pages
Kalpesh Project 27
No ratings yet
Kalpesh Project 27
35 pages
Estimation &costing ever exam book (1)
No ratings yet
Estimation &costing ever exam book (1)
37 pages
Is GPS 200
No ratings yet
Is GPS 200
226 pages
Terminology: Environmental Science Is An
No ratings yet
Terminology: Environmental Science Is An
3 pages
50 Shades 13
No ratings yet
50 Shades 13
8 pages
Eligibility Criteria For GSPs
No ratings yet
Eligibility Criteria For GSPs
2 pages
Industrial Trainning Report Tamanna
No ratings yet
Industrial Trainning Report Tamanna
9 pages
Week 3_Trims & inprocess inspection-1
No ratings yet
Week 3_Trims & inprocess inspection-1
31 pages
Iv-Infusion-Rate-Calculations and Sample Questions
No ratings yet
Iv-Infusion-Rate-Calculations and Sample Questions
3 pages
Mapua University: Experiment No. 4.B Plastice Limit and Plasticity Index of Soil
No ratings yet
Mapua University: Experiment No. 4.B Plastice Limit and Plasticity Index of Soil
7 pages
BR SPECORDaccesoires V03 en AJK
No ratings yet
BR SPECORDaccesoires V03 en AJK
20 pages
Invoice Nov
No ratings yet
Invoice Nov
1 page
Vigan City Executive Summary 2020
No ratings yet
Vigan City Executive Summary 2020
5 pages
Motocalv Eg: Calibration Utilities
No ratings yet
Motocalv Eg: Calibration Utilities
2 pages
CV322 Assignment#2
No ratings yet
CV322 Assignment#2
8 pages
Module 1 Arts N Humanities
No ratings yet
Module 1 Arts N Humanities
21 pages
Cheek Plumper
No ratings yet
Cheek Plumper
6 pages
Expenditure
No ratings yet
Expenditure
3 pages
Download Complete Residential Construction Academy: Basic Principles for Construction 5th Edition Mark W. Huth PDF for All Chapters
100% (1)
Download Complete Residential Construction Academy: Basic Principles for Construction 5th Edition Mark W. Huth PDF for All Chapters
47 pages
Backer Material For Use With Cold-And Hot-Applied Joint Sealants in Portland-Cement Concrete and Asphalt Joints
No ratings yet
Backer Material For Use With Cold-And Hot-Applied Joint Sealants in Portland-Cement Concrete and Asphalt Joints
4 pages

10.introduction To Data-Parallel Architectures

Uploaded by

10.introduction To Data-Parallel Architectures

Uploaded by

10.

Introduction to Data-Parallel architectures

• e.g. add: (c1=a1+b1), (c2=a2+b2), (c3=a3+b3)

• This connectivity is established between processing

• The inter connectivity methods are:

• Bus ,crossbar, and multistage networks –used for

• It is arose in the context of spatial mapping coherent

Each VLSI chip has 16 processing elements.

You might also like