0% found this document useful (0 votes)
2 views

17 Assignment 8 ADF Data Flow 5 Question

The document outlines the creation of multiple data processing pipelines involving customer and product data. It includes tasks such as copying customer data from a CSV to SQL for even customer IDs, joining customer and address files into a JSON, filtering and saving customer data from SQL and CSV as a Parquet file, and calculating the highest list price of products while excluding blue ones, saving results in specified CSV folders. The document references specific customer and product data files for these operations.

Uploaded by

akhilesh
Copyright
© © All Rights Reserved
Available Formats
Download as TXT, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
2 views

17 Assignment 8 ADF Data Flow 5 Question

The document outlines the creation of multiple data processing pipelines involving customer and product data. It includes tasks such as copying customer data from a CSV to SQL for even customer IDs, joining customer and address files into a JSON, filtering and saving customer data from SQL and CSV as a Parquet file, and calculating the highest list price of products while excluding blue ones, saving results in specified CSV folders. The document references specific customer and product data files for these operations.

Uploaded by

akhilesh
Copyright
© © All Rights Reserved
Available Formats
Download as TXT, PDF, TXT or read online on Scribd
You are on page 1/ 1

Create a pipeline to copy the customer data from csv file to SQL where the customer

id is an even number.
(Attaching the Customer File)

Create a pipeline to join the two files (Customer, Customer Address) based on
customer id and save the result as a JSON file.

Create a pipeline to read the Customer table data from SQL and CustomerAddress data
from CSV, join both of them,
and then save the result where customer id> 1000 & Customer id <2000 in ascending
order as a Parquet file.

create a pipeline to read the Product CSV file, and calculate the highest listPrice
of any product under each productcategory.
Ensure that product shouldn't be of blue in color and save the result as CSV file
inside ProductResult folder.

create a pipeline to read the Product CSV file, and calculate the highest listPrice
of any product under each productcategory.
Ensure that product shouldn't be of blue in color and save the result as a SINGLE
CSV file inside ProductSingleResult folder.

SalesLT.Customer.txt
SalesLT.CustomerAddress.txt
SalesLT.Product.txt

You might also like