1.1 Teradata Architecture
1.1 Teradata Architecture
A
Swapnil Mahalle (176191)
[email protected]
What is Teradata?
subject areas
Many concurrent users
Many concurrent queries, including ad-hoc
queries
Large quantity of tables
Hundreds of gigabytes (and terabytes) of detail
data
Historical data stored (months or years)
Customer Relationship
Management
Data marts
Unique Features
Scalability
"Linear scalability" means that as you add components to the system, the
performance increase is linear.
Hardware
SMP: Symmetric Multiprocessing Platform
MPP: Massively parallel processing
systems
Complexity
Teradata is adept at complex data models that
satisfy the information needs throughout an
enterprise. It has the ability to perform large
aggregations during query run time and can
perform up to 64 joins in a single query.
Concurrent Users
Teradata can handle the most concurrent users,
who are often running multiple, complex
queries.
Unconditional Parallelism
DATAWAREHOUSING
Evolution
Various Stages of DW
Environments
Data Marts
Data Marts
A Teradata System
Node Components
Software Components
TPA, PDE, OS
The Parallel Database Extensions (PDE) software layer was added to the
operating system by NCR to support the parallel software environment.
AMP
PE
Channel Driver
Teradata Gateway
Parsing Engine
Session Control
Parser
Optimizer
Dispatcher
AMP
The AMP is a vproc that controls its portion of the data on the system. The
AMPs work in parallel, each AMP managing the data rows stored on its
vdisk.
Data Distribution
When data is loaded, inserted, and updated, the AMP:
Data Access
BYNET
Features:
Scalable
High performance
Fault tolerant
Load balanced
Point-to-point
Multicast
Broadcast
Point-to-Point Messages
Communication Between
Vprocs
Multicast Messages
Broadcast Messages
Cliques
Database
Databases
Tables
Views
Macros
Triggers
Stored procedures
USER
Teradata Objects
Tables: A table in a relational database management system is a twodimensional structure made up of columns and physical rows stored in data
blocks on the disk drives.
Views: A view is like a "window" into tables that allows multiple users to look at
portions of the same base data. A view may access one or more tables, and
may show only a subset of columns from the table(s).
Macros: Macros are pre-defined, stored sets of one or more SQL
commands and/or report-formatting (BTEQ) commands. Macros can also
contain comments.
Triggers: A trigger is a set of SQL statements usually associated with a column
or table that are programmed to be run (or "fired") when specified changes are
made to the column or table. The pre-defined change is known as a triggering
event, which causes the SQL statements to be processed.
Stored Procedures: A stored procedure is a pre-defined set of statements
invoked through a single CALL statement in SQL. While a stored procedure
may seem like a macro, it is different in that it can contain:
Teradata SQL data manipulation statements (non-procedural)
Procedural statements (in Teradata, referred to as Stored Procedure Language)
Space Management
Perm Space:
Space Management
Spool Space:
Space Management
Temporary Space:
The amount of space taken by global temporary table
during query processing.
Perm space ,not yet occupied is used as Temp space in
Teradata.
The result or inserted data remains available only
through the session.
Temp space is freed upon session completion.
Data Dictionary
Disk space
Access authorizations
Ownership
Data definitions
Data Protection
LOCKS
Exclusive, Write, Read and Access
RAID: Redundant Array of Inexpensive Disks
FALLBACK
JOURNALS
Permanent and Recovery
Locks
Types :
Exclusive Locks
Write
Read
Access
FALLBACK
JOURNALS
Permanent Journals
Optional,
Recovery Journals
An
THANK YOU
Swapnil Mahalle (176191)
[email protected]