0% found this document useful (0 votes)

16 views

(Blas Lapack) F7

Uploaded by

Chris Yao

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views

(Blas Lapack) F7

Uploaded by

Chris Yao

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

Introduction

• In this lecture we will cover the following topic:

– High performance portable dense linear algebra libraries
• The following libraries will be introduced:
BLAS, LAPACK –
–
BLAS
LAPACK
BLACS, PBLAS, ScaLAPACK – BLACS
– PBLAS
– ScaLAPACK
High Performance Portable Libraries for
Dense Linear Algebra • This will also be covered:
– FORTRAN 77+

The Overall Picture NETLIB

• All software discussed in this lecture can be downloaded
free of charge from NETLIB (repository for freely
ScaLAPACK
distributable numerical software)
– http://www.netlib.org/
• Collection of all NETLIB links mentioned in the notes:
LAPACK PBLAS – http://www.netlib.org/blas/
– http://www.netlib.org/atlas/
– http://www.netlib.org/blas/gemm_based/
BLAS BLACS – http://www.netlib.org/lapack/
– http://www.netlib.org/blacs/
– http://www.netlib.org/scalapack/ (includes PBLAS)

MPI
Crash Course: FORTRAN 77+ Fixed Source Format
• FORTRAN 77+ is used in these notes to refer to the • FORTRAN 77+ has a strict source format known as the
dialect of FORTRAN 77 used by LAPACK and ”fixed source format” (removed in later standards)
ScaLAPACK developers. • Columns are used for different things:
– Straight FORTRAN 77 is quite arcane and most compilers have – 1: Comment column
implemented a set of extensions. Set your editor to expand
– 2-5: Label columns
• FORTRAN has been the language of choice for scientific tabs to spaces.
– 6: Continuation column Use 3 as tabstop (two tabs takes
and engineering for a long time, partly because it: – 7-72: Statement columns you to colunm 7)
– Has an extensive compiler support for multi-dimensional arrays – 73-: Truncated (silently)
– Has restrictions in the language to allow aggressive compiler Label Statements Truncated
optimizations
– Has language support (in FORTRAN 90 and onwards) for
dynamic memory management, derived types, object orientation,
operator overloading, generic interfaces, array expressions,
distributed arrays (co-arrays), etc Comment
Continuation

IF-THEN-ELSE GO TO, CONTINUE and Labels

• IF-statement: • Labels
– IF( <logical expression> ) <statement> – Integers from 1 to 9999
• IF-construct: – Placed in columns 2 to 5
• IF( <logical expression> ) THEN – Used as targets for GO TO statements and in DO loops
<block> • GO TO-statement:
[ELSE IF( <logical expression> ) THEN – GO TO <label>
<block>]
– Transfers control to statement labeled with <label>
[ELSE
<block>] • CONTINUE-statement:
END IF – CONTINUE
– A do-nothing statement often used as target statement and DO
loop end statement.
•
DO PROGRAM
• DO-construct: • In FORTRAN you do not have a special function called
– DO <label> <var> = <low>, <high>[, <step>] MAIN, instead you have the PROGRAM construct:
<block> – PROGRAM [name]
<label> CONTINUE <declarations>
– Example: <statements>
• DO 10 J = 1, M, 2 END [PROGRAM name]
...
10 CONTINUE
– New syntax:
• DO J = 1, M, 2
...
END DO

SUBROUTINEs and FUNCTIONs Arithmetic Operators

• SUBROUTINEs (think of C functions returning void) FORTRAN C/Java
– CALL mysub(<arglist>)
• FUNCTIONs (think of C functions returning non-void) + +
– <lval> = myfunc(<arglist>)
- -
• Declaring a SUBROUTINE: * *
– SUBROUTINE <name>(<dummy arglist>)
<dummy argument type declarations> / /
END [SUBROUTINE name]
• Declaring a FUNCTION: ** N/A
– <type> FUNCTION <name>(<dummy arglist>)
<dummy argument type declarations> +=
END [FUNCTION name] N/A
– Example: ++
N/A
• INTEGER FUNCTION MAX(a, b)
INTEGER a, b
MAX = a
IF( b .GT. a ) MAX = b
END
Logical Operators Data Types
FORTRAN C/Java • INTEGER
– Signed 32-bit (usually) integer
.GT. >
• LOGICAL
.LT. < – .TRUE. or .FALSE.
.LE. <= • CHARACTER(<length>) or CHARACTER (just one character)
– ’string’ or ”string”
.GE. >= • REAL
.EQ. == – Single precision IEEE (usually) floating point f = 5E+0
• DOUBLE PRECISION
.EQV. (logical)
– Double precision IEEE (usually) floating point d = 5D+0
.NE. != • COMPLEX
.NEQV. (logical) (exclusive or) – Single precision IEEE (usually) complex number c = (r, i)
.AND. && • COMPLEX*16
– Double precision IEEE (usually) complex number c = (r, i)
.OR. ||

.NOT. !

Arrays (Matrices and Vectors) Automatic Arrays

• Declaring a vector of 50 INTEGERs • Size of array is either known at compile time or
– INTEGER vec(50) determined by dummy arguments and the array is not a
• Declaring a 25x47 matrix of 50 INTEGERs dummy argument itself.
– INTEGER mtx(25, 47) • Storage will be allocated (think of it as being allocated on
the stack) at runtime and deallocated automatically when
• Indexing starts from 1 (unless explicitly stated in the variable falls out of scope.
declaration) – Example:
• Indexing top left element in matrix: • SUBROUTINE auto(N)
INTEGER A(N)
– mtx(1, 1)
END
• Indexing bottom right index in matrix:
– mtx(25, 47)
Assumed Shape Array Assumed Size Arrays
• The shape (extent of all dimensions) need not be known • Extent of last dimension in FORTRAN arrays need not
at compile time. be known at compile time (or at runtime for that matter)
• An array where the extent of one or more dimension is to generate indexing code (first dimension in the case of
determined by dummy arguments is referred to as an C).
assumed shape array. • An array declared with unknown last dimension extent is
– Useful for passing arrays as arguments to subroutines. referred to as an assumed size array.
– Example: – REAL A(LDA, *)
• SUBROUTINE mysub(A, LDA, M, N) • Indexing code:
INTEGER LDA, M, N A(i, j) A + (i-1) + (j-1)*LDA
REAL A(LDA, N)
END

Comments Continuation (long statements)

• Comment lines are created by putting (almost) any • Long statements (going beyond column 72) can be
character (usually * or c) in the first column: broken into several lines by placing (almost) any
– Example: character (usually numbers, $, &, +) in the continuation
•c This is a comment column (column 6)
* This is also a comment
– Example:
A = 1 c This is not a comment
• A(1, 2) = longvariablename +
$ anotherlongvariable
FORTRAN 77+/C ”Interoperability” Other things to know about FORTRAN
• Calling FORTRAN 77+ from C: • FORTRAN is case insensitive
– These are usual type relationships:
• LOGICAL (?) • FORTRAN passes everything by reference
INTEGER int
CHARACTER char • FORTRAN 77 has no type checking of arguments
REAL float
DOUBLE PRECISION double • FORTRAN 77 has no support for recursive subroutines
COMPLEX float[2]
COMPLEX*16 double[2] or functions
– Everything in FORTRAN is passed by reference
• This is usually implemented by passing a pointer.
• INTEGER int*
DOUBLE PRECISION double*
CHARACTER char*
– Symbols are usually lower case with added underscore:
• SUBROUTINE MySUB(...) mysub_
– Symbols with underscore sometimes get extra underscore:
• SUBROUTINE My_SUB(...) my_sub__

Storage Formats used by the Libraries Full Storage Format

• General matrices: Matrix Indices Memory Placement
11 12 13 14 15 16 17 18 19 0 9 18 27 36 45 54 63 72
– Column Major 21 22 23 24 25 26 27 28 29 1 10 19 28 37 46 55 64 73
31 32 33 34 35 36 37 38 39 2 11 20 29 38 47 56 65 74
• Symmetric and triangular matrices 41 42 43 44 45 46 47 48 49 3 12 21 30 39 48 57 66 75
51 52 53 54 55 56 57 58 59 4 13 22 31 40 49 58 67 76
– Column Major Column Packed 61 62 63 64 65 66 67 68 69 5 14 23 32 41 50 59 68 77
71 72 73 74 75 76 77 78 79 6 15 24 33 42 51 60 69 78
• Band matrices 81 82 83 84 85 86 87 88 89 7 16 25 34 43 52 61 70 79
– Diagonal Storage 91 92 93 94 95 96 97 98 99 8 17 26 35 44 53 62 71 80

• Tridiagonal matrices
– Diagonal Storage
Standard Packed Storage Format Rectangular Full Packed Storage Format
Matrix Indices Memory Placement Matrix Indices Memory Placement
11 * * * * * * * * 0 * * * * * * * * 11 66 76 86 96 0 9 18 27 36
21 22 * * * * * * * 1 9 * * * * * * * 21 22 77 87 97 1 10 19 28 37
31 32 33 * * * * * * 2 10 17 * * * * * * 31 32 33 88 98 2 11 20 29 38
41 42 43 44 * * * * * 3 11 18 24 * * * * * 41 42 43 44 99 3 12 21 30 39
51 52 53 54 55 * * * * 4 12 19 25 30 * * * * 51 52 53 54 55 4 13 22 31 40
61 62 63 64 65 66 * * * 5 13 20 26 31 35 * * * 61 62 63 64 65 5 14 23 32 41
71 72 73 74 75 76 77 * * 6 14 21 27 32 36 39 * * 71 72 73 74 75 6 15 24 33 42
81 82 83 84 85 86 87 88 * 7 15 22 28 33 37 40 42 * 81 82 83 84 85 7 16 25 34 43
91 92 93 94 95 96 97 98 99 8 16 23 29 34 38 41 43 44 91 92 93 94 95 8 17 26 35 44

Banded Storage Format BLAS

Full Matrix Indices
11 12 * * * * * * * • Basic Linear Algebra Subroutines (BLAS)
21 22 23 * * * * * * – http://www.netlib.org/blas/ Reference implementation
31 32 33 34 * * * * *
* 42 43 44 45 * * * * – http://www.netlib.org/atlas/ Auto-tuning HPC impl.
* * 53 54 55 56 * * *
* * * 64 65 66 67 * *
– http://www.netlib.org/blas/gemm_based/
* * * * 75 76 77 78 * GEMM-based BLAS by
* * * * * 86 87 88 89 Kågström et. al.
* * * * * * 97 98 99
– http://www.tacc.utexas.edu/resources/software/
GotoBLAS

Matrix Indices Memory Placement • Interfaces:

* 12 23 34 45 56 67 78 89 0 9 18 27 36 45 54 63 72 – FORTRAN (official)
11 22 33 44 55 66 77 88 99 1 10 19 28 37 46 55 64 73 – C, C++, Java, ... (unofficial)
21 32 43 54 65 76 87 98 * 2 11 20 29 38 47 56 65 74
31 42 53 64 75 86 97 * * 3 12 21 30 39 48 57 66 75 • Language:
– C, assembler, FORTRAN, ... (depends on vendor)
BLAS - Content Coding Conventions
• _XXYY
• BLAS contains subroutines and functions for a number – _: Data type
• S, D, C, or Z
of basic linear algebra operations. – XX: Type of matrix
– Dot product • GE, GB: GEneral, General Banded
• HE, HB, HP: HErmitian, Hermitian Banded, Hermitian Packed
– Givens rotation generation and application • SY, SB, SP: SYmmetric, Symmetric Banded, Symmetric Packed
– Vector updates • TR, TB, TP: TRiangular, Triangular Banded, Triangular Packed
– YY: Operation
– Matrix-vector product update • S: ”Solve”
• M: ”Matrix”
– Triangular system solve (with single or multiple right hand sides) • V: ”Vector”
– Matrix-matrix product update • R: Rank-1
• R2: Rank-2
– ... • RK: Rank-k
• The routines operate on various storage formats and on • R2K: Rank-2k
• Example:
four data types (single, double, complex, double – DTRSM:
• Double precision
complex). • TRiangular
• Solve
• Multiple right hand sides

Memory Traffic - Limitations Locality - Examples

• Memory bandwidth and latency can not match the high • Example of poor inherent locality: AXPY (a*x + y)
performance of floating point computations on the chip. – 2 vector loads (x, y)
• Solution: – 1 vector store (y)
– Exploit caches by data locality in space and time – 2 vector operations (*, +)
– flop/memref = 2/3
• Solution requires:
– An operation that has much inherent locality
• Metric for estimating inherent locality in linear algebra: • Example of good inherent locality: GEMM (a*A*B + b*C)
– Number of floating point operations – ~2*N^3 flops
------------------------------------------------------ – ~3*N^2 loads
Number of memory locations referenced – ~1*N^2 stores
– i.e., flop/memref – flop/memref ~ N/2
Level 1, 2, 3 LAPACK
• Level-1 BLAS: Vector operations (~1 flop/memref) • Linear Algebra PACKage (LAPACK)
– _dot – http://www.netlib.org/lapack/ Official LAPACK releases
_axpy – http://www.netlib.org/lapack/lanws/
_swap Publications related to
_copy LAPACK and DLA
_scal
... • Some vendors provide their own optimized LAPACK routines as well
as BLAS routines:
• Level-2 BLAS: Matrix-Vector operations (~1 flop/memref)
– IBM: ESSL (Proprietary)
– _gemv
_symv – AMD: ACML (Free?)
_trsv – Intel: MKL (Proprietary?)
... – Cray: libsci (Proprietary?)
• Level-3 BLAS: Matrix-Matrix operations (~N flop/memref) • Interfaces:
– _gemm – FORTRAN (official)
_syrk – C, C++, Java, ... (unofficial)
_trsm
... • Language:
– FORTRAN 77+

LAPACK - Content Workspace Management

• Compared with BLAS, the high level algorithms and • Many routines in LAPACK require auxilliary workspace
tricky numerical algorithms go into LAPACK. to function and/or run faster.
– Factorizing matrices • Users must provide this storage.
• LU, Cholesky, QR, QL, RQ, LQ, ...
• Routines take workspace via their arguments, typically:
– Applying factored-form orthogonal matrices
– WORK: Workspace
– Solving linear equations
– LWORK: Length of workspace
– Solving linear least squares problems
– Decomposing matrices • Routines requiring workspace allow workspace queries.
• SVD, Schur, ... – Workspace query:
LWORK = -1
– Computing eigenvalues and eigenvectors WORK(1) contains required workspace
• Symmetric, non-symmetric, ... Cast to INTEGER: INT(WORK(1))
– Error bounds, condition estimation – If you do a workspace query the routine will not modify any of its
arguments.
Error Reporting LAPACK - Examples
• LAPACK routines have an extra INTEGER argument at the • Solving a linear system after LU factorization
end of their argument lists: INFO – DGETRS( TRANS, N, NRHS, A, LDA, IPIV, B, LDB, INFO )
– The value of INFO tells what went wrong (if anything):
• 0: Success
• Computing QR factorization
• < 0: Argument number –INFO contained an illegal value (fatal,
– DGEQRF( M, N, A, LDA, TAU, WORK, LWORK, INFO )
programming error)
• > 0: Something went wrong during computation (exact
interpretation is routine specific)
– Example: DGETRF (LU factorization)
INFO > 0: U(INFO, INFO) is exactly zero

BLACS 2D-Grid, Scope, Context

• Basic Linear Algebra Communication Subroutines • Processes are arranged in a logical 2D-grid.
– http://www.netlib.org/blacs/ Official BLACS releases • Each process is a member of three scopes:
• Purpose: – ’All’: All processes in the grid
– Communication of submatrices appropriate for dense linear – ’Row’: All processes on the same row of the grid
algebra algorithms (e.g., ScaLAPACK)
– ’Column’: All processes on the same column of the grid
• Objection:
– ”I know MPI inside out, why should I learn BLACS?” • BLACS communication is tied to a context (think of MPI
communicators) which is an integer.
• Answer:
– It will hopefully be apparent at the end of this segment.
• Interfaces:
– FORTRAN, C (official)
• Language:
– C
Submatrix Communication Point-to-Point
• The BLACS unit of communication is a submatrix of • Send:
some specified size and shape. – xGESD2D(CTXT, M, N, A, LDA, RDST, CDST)
• Two types of submatrices: – xTRSD2D(CTXT, UPLO, DIAG, M, N, A, LDA, RDST, CDST)
– General submatrices: • Receive:
• Parameters: M, N, A, LDA – xGERV2D(CTXT, M, N, A, LDA, RSRC, CSRC)
– Trapezoidal submatrices (generalization of triangular): – xTRRV2D(CTXT, UPLO, DIAG, M, N, A, LDA, RSRC, CSRC)
• Parameters: M, N, A, LDA, UPLO, DIAG
• Packing of matrices hidden from user
• Types supported:
– I: Integer
– S: Single precision
– D: Double precision
– C: Complex single precision
– Z: Complex double precision

Collectives Collectives: Topology

• Broadcast (send): • Topologies (TOP) specify the communication pattern.
– xGEBS2D(CTXT, SCOPE, TOP, M, N, A, LDA) – ’I’: Increasing ring
– xTRBS2D(CTXT, SCOPE, TOP, UPLO, DIAG, M, N, A, LDA) – ’D’: Decreasing ring
• Broadcast (receive): – ’S’: Split ring
– xGEBR2D(CTXT, SCOPE, TOP, M, N, A, LDA, – ’M’: Multi-ring
RSRC, CSRC)
– ’1’: 1-tree
– xTRBR2D(CTXT, SCOPE, TOP, UPLO, DIAG, M, N, A, LDA,
RSRC, CSRC) – ’B’: Bidirectional exchange
– ’ ’: Default (may use MPI_Bcast)
• Combine operations (SUM, MAX, MIN):
– xGSUM2D(CTXT, SCOPE, TOP, M, N, A, LDA, RDST, CDST)
– xGAMX2D(CTXT, SCOPE, TOP, M, N, A, LDA, RA, CA,
RCFLAG, RDST, CDST)
– xGAMN2D(CTXT, SCOPE, TOP, M, N, A, LDA, RA, CA,
RCFLAG, RDST, CDST)
BLACS – Setup (FORTRAN) PBLAS
• Initializing BLACS: • Parallel BLAS
– CALL BLACS_PINFO(ME, NP) – http://www.netlib.org/scalapack/ PBLAS reference impl.
• Initializing context: is part of ScaLAPACK
– CALL BLACS_GET(0, 0, CTXT) • Interfaces:
CALL BLACS_GRIDINIT(CTXT, 'Row', P, Q)
– FORTRAN
CALL BLACS_GRIDINFO(CTXT, P, Q, MYROW, MYCOL)
– C?
• Getting someones rank from coordinates
– RANK = BLACS_PNUM(CTXT, ROW, COL)
• Language:
– C
• Getting someones coordinates from rank
– CALL BLACS_PCOORD(CTXT, RANK, ROW, COL)
• Exiting BLACS
– CALL BLACS_EXIT(0)

2D Block Cyclic Distribution Matrix Descriptors

• PBLAS operates on data distributed using the 2D block • Descriptors are used in PBLAS and ScaLAPACK to
cyclic distribution. encapsulate information on a distributed matrix.
• Recall: • A descriptor is a 9-item integer vector:
– INTEGER DESCA(9)

0 1 – DESCA(1): (DTYPE) 1
DESCA(2): (CTXT) BLACS context
11 12 13 14 15 11 12 15 13 14
0 21 22 25 23 24 DESCA(3): (M) Number of rows in global matrix
21 22 23 24 25
31 32 33 34 35 51 52 55 53 54 DESCA(4): (N) Number of columns in global matrix
41 42 43 44 45 31 32 35 33 34 DESCA(5): (MB) Row blocking factor
1
51 52 53 54 55 41 42 45 43 44 DESCA(6): (NB) Column blocking factor
DESCA(7): (RSRC) Row index of owner of A(1, 1)
5 x 5 matrix, 2 x 2 blocks 2 x 2 process grid point of view DESCA(8): (CSRC) Column index of owner of A(1, 1)
DESCA(9): (LLD) Leading dimension of the local matrix
PBLAS - Example ScaLAPACK
• Parallel version of DGEMM • SCAlable LAPACK (distributed memory)
– CALL PDGEMM( TRANSA, TRANSB, – http://www.netlib.org/scalapack/ Official ScaLAPACK releases
M, N, K,
ALPHA, A, IA, JA, DESC_A,
B, IB, JB, DESC_B,
BETA, C, IC, JC, DESC_C )
• Notice:
– PBLAS has interfaces that take descriptions of submatrices
– BLAS, on the other hand, takes submatrices implicitly

ScaLAPACK - Content ScaLAPACK – Coding Conventions

• Most of LAPACK • Symbols are similar to LAPACK (just add P)
• No support for band and packed matrices • Submatrices are referenced explicitly in interface:
• Missing some more advanced algorithms – A(I, J), LDA LAPACK submatrix reference
– SVD and QR w/ pivoting least squares – A, I, J, DESCA ScaLAPACK submatrix reference
– Generalized least squares
– Non-symmetric eigenvalue problems
– D&C SVD
– ...
Utilities: DESCINIT Utilities: NUMROC
• SUBROUTINE DESCINIT(DESC, M, N, MB, NB, RSRC, CSRC, • INTEGER FUNCTION NUMROC(N, NB, ME, SRC, NP)
CTXT, LLD, INFO)
• Finds the number of rows (or columns) mapped to a
• Initializes all elements of a descriptor.
specific grid row (or column).
• Arguments:
• Arguments:
– DESC Descriptor to initialize (output)
– N Extent of matrix dimension
– M, N Size of global matrix
– NB Blocking factor in matrix dimension
– MB, NB Blocking factors
– ME Row (or column) index of processor of interest
– RSRC, CSRC Coordinates of owner of (1, 1) matrix element
– SRC Row (or column) index of source
– CTXT BLACS context
– NP Number of processes in grid dimension
– LLD Leading dimension of local matrix (use NUMROC
to find)
– INFO Error reporting, 0: success (output)

Utilities: INFOG2L
• SUBROUTINE INFOG2L(GRINDX, GCINDX, DESC, NPROW, NPCOL,
MYROW, MYCOL, LRINDX, LCINDX, RSRC, CSRC)
• Given a global matrix element (GRINDX, GCINDX), returns
the corresponding local matrix element (LRINDX, LCINDX)
and coordinates of processor that owns that element
(RSRC, CSRC).
• Arguments:
– GRINDX, GCINDX Global matrix element
– DESC Descriptor of matrix
– NPROW, NPCOL Grid size
– MYROW, MYCOL My coordinates
– LRINDX, LCINDX Local matrix element (output)
– RSRC, CSRC Owner of element (output)

Homework 1
No ratings yet
Homework 1
6 pages
Huawei OceanStor Dorado V6 All-Flash Storage Systems Overview Presentati...
50% (2)
Huawei OceanStor Dorado V6 All-Flash Storage Systems Overview Presentati...
38 pages
HPC Unit 5 a
No ratings yet
HPC Unit 5 a
49 pages
MATLAB - Lecture # 2: Creating Arrays
No ratings yet
MATLAB - Lecture # 2: Creating Arrays
19 pages
Arrays
No ratings yet
Arrays
18 pages
C Fundamentals Learners Handbook
No ratings yet
C Fundamentals Learners Handbook
91 pages
Matrix and Graph
No ratings yet
Matrix and Graph
44 pages
03-DS-Array-2024
No ratings yet
03-DS-Array-2024
47 pages
Slides 09 Programming Languages - UET CS - Talha Waheed - Data Types
No ratings yet
Slides 09 Programming Languages - UET CS - Talha Waheed - Data Types
50 pages
9210-international-gcse-computer-science-question-paper-2-v1.0
No ratings yet
9210-international-gcse-computer-science-question-paper-2-v1.0
21 pages
Ch 11 Arrays.ppt
No ratings yet
Ch 11 Arrays.ppt
20 pages
bfe119_185ba4e86f054bc894cfc7728a46cc17
No ratings yet
bfe119_185ba4e86f054bc894cfc7728a46cc17
282 pages
DS_Array_1-2-49
No ratings yet
DS_Array_1-2-49
48 pages
PC Intro Devamı
No ratings yet
PC Intro Devamı
143 pages
09 Pointers Arrays
No ratings yet
09 Pointers Arrays
34 pages
Matlab - Matrix Laboratory
No ratings yet
Matlab - Matrix Laboratory
7 pages
Abstract Data Type (ADT) : & Array
No ratings yet
Abstract Data Type (ADT) : & Array
26 pages
Workshop On Scilab
No ratings yet
Workshop On Scilab
32 pages
Chapter 5
100% (1)
Chapter 5
22 pages
CAO CO1 p2
No ratings yet
CAO CO1 p2
62 pages
Data Structures Unit - 1 1. Algorithm
No ratings yet
Data Structures Unit - 1 1. Algorithm
64 pages
BPS Modular Test2 Q. Bank
No ratings yet
BPS Modular Test2 Q. Bank
7 pages
2023 I Puc Cs Solutions Midterm
No ratings yet
2023 I Puc Cs Solutions Midterm
17 pages
LEC12-Optimization and New Trends
No ratings yet
LEC12-Optimization and New Trends
23 pages
M02 - Programming in Matlab - XXXX - CH02
No ratings yet
M02 - Programming in Matlab - XXXX - CH02
37 pages
228 Sakshi Pahade Lab Manual 5
No ratings yet
228 Sakshi Pahade Lab Manual 5
13 pages
DSA module 1 to 6
No ratings yet
DSA module 1 to 6
89 pages
MATLAB-Introduction To Applications
No ratings yet
MATLAB-Introduction To Applications
62 pages
Lab File-1st Sem
No ratings yet
Lab File-1st Sem
127 pages
(ECE 357) Digital Signal Processing Lab: Bachelor of Technology IN Electronics and Communication
No ratings yet
(ECE 357) Digital Signal Processing Lab: Bachelor of Technology IN Electronics and Communication
58 pages
Fort Ran 3
No ratings yet
Fort Ran 3
25 pages
Lecture 1.7 - Array Traversing Insert Delete Presentation
No ratings yet
Lecture 1.7 - Array Traversing Insert Delete Presentation
38 pages
Data Structure Algorithms Using C A Practical Implementation 1st edition by Sachi Nandan Mohanty, Pabitra Kumar Tripathy 9781119752035 1119752035 pdf download
100% (2)
Data Structure Algorithms Using C A Practical Implementation 1st edition by Sachi Nandan Mohanty, Pabitra Kumar Tripathy 9781119752035 1119752035 pdf download
48 pages
Lab10 - Arrays2 - Sec450 C#
No ratings yet
Lab10 - Arrays2 - Sec450 C#
9 pages
Class XII Computer Science Practice Questions
No ratings yet
Class XII Computer Science Practice Questions
7 pages
Computer Science Full GCSE Notes
No ratings yet
Computer Science Full GCSE Notes
16 pages
Lecture 2
No ratings yet
Lecture 2
21 pages
3-Arrays
No ratings yet
3-Arrays
16 pages
Address Calculation of Array Element - Row Major Ordering
No ratings yet
Address Calculation of Array Element - Row Major Ordering
12 pages
Basics of MATLAB 1
No ratings yet
Basics of MATLAB 1
41 pages
BSIT 22 Main PG 1 168
No ratings yet
BSIT 22 Main PG 1 168
168 pages
C++ Project
No ratings yet
C++ Project
67 pages
ICT Record (2024 - 25)
No ratings yet
ICT Record (2024 - 25)
29 pages
IE142_Week_14
No ratings yet
IE142_Week_14
16 pages
Lecture2 CAED
No ratings yet
Lecture2 CAED
67 pages
Ds Unit 1
No ratings yet
Ds Unit 1
22 pages
Programming Logic and Design Comprehensive 7th Edition Joyce Farrell Test Bank - Full Version Is Ready For Free Download
100% (1)
Programming Logic and Design Comprehensive 7th Edition Joyce Farrell Test Bank - Full Version Is Ready For Free Download
38 pages
matrix-1
No ratings yet
matrix-1
12 pages
Immediate download Programming Logic and Design Comprehensive 7th Edition Joyce Farrell Test Bank all chapters
100% (29)
Immediate download Programming Logic and Design Comprehensive 7th Edition Joyce Farrell Test Bank all chapters
41 pages
Introduction To Matlab
No ratings yet
Introduction To Matlab
36 pages
Algorithm 03606257 Question - Bank 1
No ratings yet
Algorithm 03606257 Question - Bank 1
38 pages
DSUnit 1A
No ratings yet
DSUnit 1A
4 pages
DS Sanchit Sir Notes
No ratings yet
DS Sanchit Sir Notes
131 pages
Session
No ratings yet
Session
51 pages
Lecture 2
No ratings yet
Lecture 2
35 pages
Cis601 02 Matlab Intro
No ratings yet
Cis601 02 Matlab Intro
57 pages
Programming Languages Eceg - 4182: Dr. T.R.Srinivasan
No ratings yet
Programming Languages Eceg - 4182: Dr. T.R.Srinivasan
80 pages
CST201 M2 Ktunotes - in
No ratings yet
CST201 M2 Ktunotes - in
280 pages
Matlab 4
No ratings yet
Matlab 4
33 pages
Oracle Quick Guides: Part 3 - Coding in Oracle: SQL and PL/SQL
From Everand
Oracle Quick Guides: Part 3 - Coding in Oracle: SQL and PL/SQL
Malcolm Coxall
No ratings yet
Java Exception Handling
100% (1)
Java Exception Handling
8 pages
RPA Blue Prism
No ratings yet
RPA Blue Prism
106 pages
1.0 March 2021
No ratings yet
1.0 March 2021
13 pages
Module 1 - 1.5. Operating Systems Introduction
No ratings yet
Module 1 - 1.5. Operating Systems Introduction
15 pages
Bca 2 Sem C Note 2023-24
No ratings yet
Bca 2 Sem C Note 2023-24
66 pages
E Library Management System Repair
No ratings yet
E Library Management System Repair
56 pages
Alienware m15 r6 Service Manual en Us
No ratings yet
Alienware m15 r6 Service Manual en Us
92 pages
Coding Rules: Section A: Linux Kernel Style Based Coding For C Programs
No ratings yet
Coding Rules: Section A: Linux Kernel Style Based Coding For C Programs
8 pages
MIL STD 1553 Overview
No ratings yet
MIL STD 1553 Overview
54 pages
Quick Installation Guide: Connecting The Hardware
No ratings yet
Quick Installation Guide: Connecting The Hardware
2 pages
Bash Scripting Language Cheat Sheet Cheat Sheet: Danilobanjac
No ratings yet
Bash Scripting Language Cheat Sheet Cheat Sheet: Danilobanjac
11 pages
SOC Cybersecurity Analyst Build Book
100% (2)
SOC Cybersecurity Analyst Build Book
124 pages
Acer Iconia Tab B1-711 - Schematics
No ratings yet
Acer Iconia Tab B1-711 - Schematics
29 pages
Python Programming Dhaval Patel L D College of Engineering Python Programming
No ratings yet
Python Programming Dhaval Patel L D College of Engineering Python Programming
36 pages
4 Dynamic Memory
No ratings yet
4 Dynamic Memory
15 pages
Computer Networks: Topic 4: Network Topology and Architecture
No ratings yet
Computer Networks: Topic 4: Network Topology and Architecture
59 pages
08-Module 8
No ratings yet
08-Module 8
38 pages
Accenture
No ratings yet
Accenture
6 pages
USER Authorization
0% (1)
USER Authorization
78 pages
Unit 1 Dcs Mcqs
No ratings yet
Unit 1 Dcs Mcqs
6 pages
Import Data Connection To A SQL Database
100% (1)
Import Data Connection To A SQL Database
39 pages
Flashing_tutorial-G1_DE-V2.07-20241011
No ratings yet
Flashing_tutorial-G1_DE-V2.07-20241011
6 pages
Multiplication of 2 No
No ratings yet
Multiplication of 2 No
4 pages
Answer Assignment 02 PDF
No ratings yet
Answer Assignment 02 PDF
15 pages
Send Data To Thingspeak Using GSM Sim800l PDF
No ratings yet
Send Data To Thingspeak Using GSM Sim800l PDF
15 pages
Cell Phone Repair Topic
0% (1)
Cell Phone Repair Topic
4 pages
Computing End of Year Revision Document
No ratings yet
Computing End of Year Revision Document
6 pages
Data Structure Using C Notes
No ratings yet
Data Structure Using C Notes
21 pages
Tadm53: Tadm53 Sap Netweaver - Sap Web As DB Operation (Ms SQL Server)
No ratings yet
Tadm53: Tadm53 Sap Netweaver - Sap Web As DB Operation (Ms SQL Server)
5 pages

(Blas Lapack) F7

Uploaded by

(Blas Lapack) F7

Uploaded by

Introduction

• In this lecture we will cover the following topic:

The Overall Picture NETLIB

IF-THEN-ELSE GO TO, CONTINUE and Labels

SUBROUTINEs and FUNCTIONs Arithmetic Operators

Arrays (Matrices and Vectors) Automatic Arrays

Comments Continuation (long statements)

Storage Formats used by the Libraries Full Storage Format

Banded Storage Format BLAS

Matrix Indices Memory Placement • Interfaces:

Memory Traffic - Limitations Locality - Examples

LAPACK - Content Workspace Management

BLACS 2D-Grid, Scope, Context

Collectives Collectives: Topology

2D Block Cyclic Distribution Matrix Descriptors

ScaLAPACK - Content ScaLAPACK – Coding Conventions

You might also like