0% found this document useful (0 votes)
11 views

Introduction

Uploaded by

danukrishnan003
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
11 views

Introduction

Uploaded by

danukrishnan003
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 13

INTRODUCTION TO TALEND

Talend Architecture
INTRODUCTION
• Talend is an ETL tool.
• ETL (extract, transform, loading)
• Talend is a software integration platform which provides solutions for
Data integration, Data quality, Data management, Data Preparation
and Big Data.
• The Talend tool is founded by Fabrice Bonan and Bertrand Diard to
identify a gap in the enterprise information world.
• The Open Studio Version 1.0 is launched in 2006.
Talend
• The Talend is an open-source software integration platform that
allows various solutions like data integration, data management
solutions, big data, data quality, and data preparation.
• Talend is a tool that makes the ETL process easy and profitable.
• Talend is one of the most powerful data integration ETL tools, cloud
computing, and big data integration tools available in the market.
• It is specialized in Big Data because it has all the plugins to integrate
with big data efficiently.
• Talend is used to unify the repository for storing and reusing the
metadata.
• Talend is available in both open source and premium versions.
Talend
• Talend's data integration had an ability which combines the data from the
various sources on to a single view that is highly advanced and of a great
utility.
• The very first product of Talend is Talend Open Studio, which is launched in
2006.
• Nowadays, it is known as Talend open Studio for Data Integration.
• it released a wide range of products, which are used commonly in the market.
• In real-time, Talend helps the organization to make decisions and become
more data-driven.
• Talend recognized as the next generation leader in the cloud and big data
integration software because after using Talend, data becomes more
accessible, its quality enhances, and it can be moved quickly to the target
systems.
Talend
• Talend offers faster development and deployment to automate a task.
• Talend is less expensive because it is open-source, which can be
downloaded free of cost.
• Talend provides a unified platform that meets all of our needs under a
common foundation.
• Talend backup up by a vast community, because it is an open-source
tool and the preferred location for all the Talend users and community
members where they can share their doubts, queries, experiences,
etc.
Talend-Data Integration
• Data integration is a process where most of the organizations get the data from
multiple places and placed them separately.
• If the organization had to take some decision, they took the data from the different
sources and put it in the unified view, and then they will analyze it and get the
result.
• Talend data integration is an open-source testing tool, which facilitates the ETL
(extract, transfer, and loading) testing that includes all the features of ELT testing.
• Data integration is a tool that has an open, scalable architecture, and it also allows
a faster response to the business request.
• The user can perform ETL tasks on the remote server having different operating
systems by using a Talend data integration tool.
• Data integration can easily integrate data with the help of other data warehouses,
or we can also say that it will synchronize the data between systems.
Architecture
• The architecture model of Talend open studio identifies the Talend
data integration functions, interactions, and corresponding IT needs.
various functional blocks
• Clients
• Servers
• Database
• Repositories
• Execution Servers
Clients

• The client block is used for building and monitoring Talend Jobs.
• The client block can have one or more Talend studio(s), and the web
browser that could be on the same or different machines.
• The Talend studio allows us to work on any project if we have the
authorization.
• We can connect the remotely based Talend administration center
through a secured HTTP protocol with the help of a web browser.
• We can also carry out the data integration process regardless of the
level of data volumes and process the complexity from the studio.
Server

• The server block is used for administration, management, and


monitoring.
• The server block contains the web-based application server, whereas
the Talend Administration Center is used to enable the management
and administration for all the projects
• The administration metadata is stored in the administration database.
For Example: the user accounts, access rights, and project
authorization.
• The data of project items like jobs, business models, and routines are
stored in the SVN or a Git server.
Databases

• The database is used to store the metadata and configuration


information.
• The administration, the audit, and monitoring the databases come
under the database block.
• The administration database is used to manage the user accounts,
access rights, and project authorization, and so on.
• An audit database is used to check different conditions of the jobs,
and implemented in a project and developed in the Talend studio.
Repositories

• In the repositories block, we will host the project metadata and binaries.
• The SVN or Git server and the Nexus repository come under the
repositories block.
SVN or the Git server: It is used to centralize all the project items
like jobs and business models, which is shared between different end-
users and also accessible from the Talend studio to develop the project
item.
• And the Talend administration central is used to publish, deploy, and
monitor the project
• The nexus repository is used to check whether the Software updates are
available to download or not. And the job which is published from the
Talend studio is ready to be deployed and executed.
Execution Server
• The execution server is used for deploying and launching the jobs.
• The Talend execution server block is having one or more execution
servers, which is deployed inside our information system.
• Talend jobs are deployed to the job servers through the
Administration Center's job conductor, which is to be executed on a
scheduled time, date, or event.

You might also like