0% found this document useful (0 votes)
27 views

Apache Hadoop Apache Spark

Amazon EMR is a managed cluster platform that simplifies running big data frameworks like Apache Hadoop and Apache Spark on AWS to process and analyze large amounts of data. It uses frameworks and open-source projects like Apache Hive and Apache Pig to process data for analytics and business intelligence workloads, and can move large amounts of data between AWS services like S3 and DynamoDB. Amazon Elastic Transcoder converts media files stored in S3 into formats for different devices. Jobs transcode individual files, pipelines manage jobs in queues, presets define common format settings, and notifications report job status via SNS.

Uploaded by

Pravin Poudel
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
27 views

Apache Hadoop Apache Spark

Amazon EMR is a managed cluster platform that simplifies running big data frameworks like Apache Hadoop and Apache Spark on AWS to process and analyze large amounts of data. It uses frameworks and open-source projects like Apache Hive and Apache Pig to process data for analytics and business intelligence workloads, and can move large amounts of data between AWS services like S3 and DynamoDB. Amazon Elastic Transcoder converts media files stored in S3 into formats for different devices. Jobs transcode individual files, pipelines manage jobs in queues, presets define common format settings, and notifications report job status via SNS.

Uploaded by

Pravin Poudel
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 2

Amazon EMR is a managed cluster platform that simplifies running big data

frameworks, such as Apache Hadoop and Apache Spark, on AWS to process and

analyze vast amounts of data. By using these frameworks and related open-source

projects, such as Apache Hive and Apache Pig, you can process data for analytics

purposes and business intelligence workloads. Additionally, you can use Amazon EMR

to transform and move large amounts of data into and out of other AWS data stores

and databases, such as Amazon Simple Storage Service (Amazon S3) and Amazon

DynamoDB.

Amazon Elastic Transcoder lets you convert media files that you have stored in

Amazon S3 into media files in the formats required by consumer playback devices.

For example, you can convert large, high-quality digital media files into formats that

users can play back on mobile devices, tablets, web browsers, and connected

televisions.

Elastic Transcoder has four components:

 Jobs do the work of transcoding. Each job converts one file into up to 30

formats. For example, if you want to convert a media file into six different formats, you

can create files in all six formats by creating a single job.

 Pipelines are queues that manage your transcoding jobs. When you create a

job, you specify which pipeline you want to add the job to.

 If you configure a job to transcode into more than one format, Elastic Transcoder

creates the files for each format in the order in which you specify the formats in the job.
-A pipeline can process more than one job simultaneously, and jobs don't necessarily

complete in the order in which you create them.

-Pipelines and jobs are associated with specific regions.

-You can temporarily stop processing jobs by pausing the pipeline.

 Presets are templates that contain most of the settings for transcoding media

files from one format to another. Elastic Transcoder includes some default presets for

common formats, for example, several iPod and iPhone versions.

 Preset are customizable

Notifications let you optionally configure Elastic Transcoder and Amazon Simple

Notification Service (SNS) to keep you apprised of the status of a job.

You might also like