0% found this document useful (0 votes)

26 views

BDA Regular Paper Solution

Uploaded by

Prash809

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views

BDA Regular Paper Solution

Uploaded by

Prash809

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Birla Institute of Technology & Science, Pilani

Work-Integrated Learning Programmes Division

Semester 1- 2024-2025
Mid-Semester Regular (EC-2)

Course No. : BA ZG525

Course Title : BIG DATA ANALYTICS
Nature of Exam : Closed Book
Weightage : 30% No. of Pages =2
Duration : 2 Hours No. of Questions = 7
Date of Exam : 22 September, 2024 - 01:00 PM
Note:
1. Please follow all the Instructions to Candidates given on the cover page of the answer book.
2. All parts of a question should be answered consecutively. Each answer should start from a fresh page.
3. Assumptions made if any, should be stated clearly at the beginning of your answer.

Q.1 Case Study:

XYZ Corporation, a multinational company, handles various types of data across its operations. The
data management team has identified several key data sources:
(a) Detailed customer profiles stored in their internal systems.
(b) A large volume of customer feedback gathered from social media platforms.
(c) Daily financial transactions recorded by the sales department.
(d) Data exchanged between systems using XML files.
Given the nature of the data sources above, identify the type of structure each data source likely
follows. Discuss the specific challenges XYZ Corporation might encounter when managing and
analyzing each type of data. [4 Marks]

Solution:
Data Sources and their Likely Structures:
(a) Detailed customer profiles stored in internal systems
Data Structure: Structured Data
Explanation: Customer profiles typically follow a well-defined schema, stored in relational
databases (e.g., SQL), with clear columns such as name, address, purchase history, etc.
Challenges:
i. Scalability: As the company grows, managing large volumes of structured data may
become cumbersome.
ii. Integration: Integrating structured customer data with unstructured data from other
sources may be challenging.
iii. Privacy & Security: Ensuring customer data protection and compliance with data
privacy laws (e.g., GDPR) is critical. 1 Mark
(b) Customer feedback from social media platforms
Data Structure: Unstructured Data
Explanation: Social media data (e.g., tweets, comments) is unstructured and does not
follow a predefined schema. It may contain text, images, videos, etc.
Challenges:
i. Data Volume: Handling large volumes of unstructured data is difficult, especially
for sentiment analysis.
ii. Data Cleaning: Extracting relevant information from noisy, informal text.
iii. Real-time Analysis: Processing feedback in real-time for timely decision-making
can be resource-intensive. 1 Mark
(c) Daily financial transactions recorded by the sales department
Data Structure: Structured Data
Explanation: Financial transactions are typically well-structured and stored in transactional
databases, with clear fields like transaction ID, date, amount, and customer ID.
Challenges:
i. Data Consistency: Ensuring data consistency across multiple branches or
departments.
ii. Real-time Processing: Processing financial data in real-time for monitoring and
fraud detection.
iii. Compliance: Adhering to financial regulations and auditing standards. 1 Mark

(d) Data exchanged between systems using XML files

Data Structure: Semi-structured Data
Explanation: XML data is semi-structured as it follows a loose schema defined by tags,
making it more flexible than fully structured data but more organized than unstructured
data.
Challenges:
i. Parsing Complexity: Parsing large XML files can be computationally expensive.
ii. Integration: Combining semi-structured XML data with structured databases or
unstructured sources may require complex transformations.
iii. Data Validation: Ensuring data accuracy and integrity when transferring data
between systems. 1 Mark

Q.2 Case Study:

ABC Financial Services, a leader in the financial sector, relies heavily on big data to enhance its
operations and customer service. The company focuses on the 5 V's of big data—Volume, Velocity,
Variety, Veracity, and Value—to manage and analyze the vast amounts of data generated daily.
Each of these aspects presents unique challenges that must be addressed to successfully leverage
big data for better decision-making and service delivery.
Discuss how ABC Financial Services can effectively manage the challenges associated with each
of the 5 V's of big data. Provide specific strategies or technologies that the company might employ
to overcome these challenges and maximize the value derived from its big data initiatives.
[4 marks]
Solution:
ABC Financial Services can effectively manage the challenges associated with the 5 V's of big data
by employing specific strategies and technologies:
1. Volume:
o Challenge: The large scale of data generated daily by financial transactions, customer
interactions, and market fluctuations is overwhelming for traditional systems.
o Strategy: Implement distributed storage systems like Hadoop Distributed File System
(HDFS) and cloud-based solutions (e.g., AWS S3) to store massive datasets. Leverage
NoSQL databases (e.g., Cassandra, MongoDB) to handle the scalability of data storage.
o Benefit: This ensures efficient storage and retrieval of vast datasets, enabling better
analysis without performance bottlenecks.
2. Velocity:
o Challenge: Financial data is generated at high speeds (real-time trading, transactions,
and customer interactions). Handling real-time data streaming is crucial to providing
timely insights and decisions.
o Strategy: Use stream processing platforms like Apache Kafka and Apache Flink to
process data in real-time. Implement real-time analytics tools like Apache Spark for
instantaneous decision-making and fraud detection.
o Benefit: These technologies allow ABC Financial to manage high-speed data
effectively and respond in real-time, improving customer service and fraud prevention.
3. Variety:
o Challenge: Financial data comes in different formats, such as structured data
(transaction records), semi-structured data (XML files), and unstructured data
(customer feedback from social media).
o Strategy: Utilize data integration platforms and tools like Apache NiFi or Talend to
ingest, process, and transform data from multiple formats into a unified system. For
unstructured data, deploy Natural Language Processing (NLP) tools and text mining
techniques to analyze social media data.
o Benefit: Managing multiple data types enables holistic analysis, including customer
sentiment analysis and pattern recognition from structured financial data.
4. Veracity:
o Challenge: Ensuring the accuracy and reliability of data is a critical issue in the
financial sector, as incorrect data can lead to poor decision-making.
o Strategy: Implement data quality management tools, such as Apache Griffin or Talend
Data Quality, to monitor and improve data accuracy. Use data validation techniques
and establish governance policies for data entry and integrity.
o Benefit: Enhanced data veracity improves trust in analytical outcomes, reducing errors
in financial reporting, risk assessment, and customer service.
5. Value:
o Challenge: Extracting actionable insights from big data is essential to gain a
competitive advantage, but it requires the right tools and expertise.
o Strategy: Use machine learning platforms like TensorFlow or H2O.ai to derive
predictive insights from data. Implement BI tools (e.g., Tableau, Power BI) to visualize
data and make it accessible to decision-makers.
o Benefit: Maximizing data value leads to better decision-making, personalized customer
experiences, and optimized operational efficiency.
1 X 4 = 4 Marks
Q.3 In a healthcare analytics project, a hospital is using big data analytics to improve patient outcomes.
The project involves four core components: Data Collection, Processing, Modeling, and Decision-
Making. Here are the details:
1. Data Collection: The hospital collects patient data from various sources, including
electronic health records (EHRs), wearable devices, and laboratory results. Each day, the
hospital collects 5 terabytes (TB) of data.
2. Processing: The processing system can handle 100 terabytes of data per month, efficiently
cleaning and transforming it for analysis.
3. Modeling: After processing, the data is used to build predictive models. Each model
requires 2 terabytes of processed data to train effectively. The hospital plans to train 15
models per month.
4. Decision-Making: The insights from the models are used for decision-making. On average,
each model generates 200 actionable insights.
Given these parameters, calculate the following:
a) How many days of data collection will the hospital need to fully utilize its monthly processing
capacity? [ 2Marks]
b) How much processed data is used per month for modeling? [ 2Marks]
c) How many actionable insights are generated per month based on the current modeling plan?
[ 2Marks]
Solution:
a) How many days of data collection will the hospital need to fully utilize its monthly
processing capacity?

Data collected per day = 5 TB

Processing capacity per month = 100 TB
To find the number of days required to collect 100 TB:

Days required = Processing capacity per month/Data collected per day = 100 TB/5TB/day
= 20 days
Answer: The hospital will need 20 days of data collection to fully utilize its monthly
processing capacity. 2 Marks

b) How much processed data is used per month for modeling?

Data required per model = 2 TB

Number of models trained per month = 15
Total data used for modeling per month:
Total data used = data required per model × Number of models trained per month
= 2 TB × 15 = 30 TB
Answer: 30 TB of processed data is used per month for modeling. 2 Marks

c) How many actionable insights are generated per month based on the current modeling
plan?
Actionable insights per model = 200
Number of models trained per month = 15
Total actionable insights generated per month:
Total insights = Actionable insights per model × Number of models
= 200 × 15 = 3,000
Answer: 3,000 actionable insights are generated per month based on the current modeling
plan. 2 Marks
Q.4 State whether the following statements are True or False with proper justification.
Answers without proper justification will not be given any marks. [ 3Marks]
a) In a MapReduce job, the 'Reduce' tasks are responsible for dividing the input data into
smaller chunks before the 'Map' tasks process them.
b) MapReduce lacks fault tolerance, so if a node fails, the entire job will fail without any task
reassignment.
c) YARN statically allocates resources to applications, which can lead to inefficient cluster
utilization and job execution.
Solution:
a) In a MapReduce job, the 'Reduce' tasks are responsible for dividing the input data into
smaller chunks before the 'Map' tasks process them.
• Answer: False
• Justification: In a MapReduce job, the 'Map' tasks are responsible for processing the input
data by dividing it into smaller chunks. The 'Reduce' tasks occur after the 'Map' phase, and
their job is to aggregate and summarize the intermediate output produced by the 'Map'
tasks. The division of input data into smaller chunks (splitting) is handled before the 'Map'
tasks execute, not by the 'Reduce' tasks. 1 Mark
b) MapReduce lacks fault tolerance, so if a node fails, the entire job will fail without any task
reassignment.
• Answer: False
• Justification: MapReduce provides built-in fault tolerance. If a node fails during the
execution of a job, the task that was running on that node is automatically reassigned to
another available node. This ensures that a node failure does not result in the failure of the
entire job, and MapReduce can continue to execute by rerunning failed tasks on other nodes
in the cluster. 1 Mark
c) YARN statically allocates resources to applications, which can lead to inefficient cluster
utilization and job execution.
• Answer: False
• Justification: YARN (Yet Another Resource Negotiator) dynamically allocates resources
to applications based on their requirements and the availability of resources in the cluster.
This dynamic allocation enables efficient cluster utilization by adjusting resource
allocation as needed during job execution, ensuring that resources are used effectively and
jobs are completed in a timely manner. 1 Mark
Q.5 You are a Hadoop administrator at a large organization that has recently adopted Hadoop for storing
and analysing its big data. The organization is planning to conduct a data science competition among
its employees to encourage innovative data analysis ideas. As the Hadoop administrator, your task
is to set up the necessary infrastructure for the competition. The competition requires participants to
store their data on HDFS, run MapReduce jobs to process the data, and present their results to the
judges. You need to create a directory on HDFS for each participant, set appropriate permissions to
ensure privacy, and provide the necessary commands for uploading, downloading, and managing
the data.
Your task is to write commands for following:
a) Verify Hadoop version and Hadoop daemon components are active. [0.5Mark]
b) Create a new directory with RollNumber as name in the root directory of the HDFS file system.
[0.5Mark]
c) Create 3 different files named as sample.txt, sample2.csv, sample3.tsv [1Mark]
d) Uploads all created files from the local file system to HDFS. [1Mark]
e) Lists the contents of a directory in HDFS. [0.5Mark]
f) Downloads a file named sample3.tsv from HDFS to the local file system. [1Marks]
g) Removes a file named sample.txt from HDFS. [0.5Mark]
h) Change the permissions of a file named sample2.csv in HDFS. [1Marks]

Solution
a) Verify Hadoop version and Hadoop daemon components are active. [0.5Mark]
# Command to verify Hadoop version
hadoop version

# Command to verify that Hadoop daemons (NameNode, DataNode, etc.) are active
jps
b) Create a new directory with RollNumber as name in the root directory of the HDFS file system.
[0.5Mark]
hadoop fs -mkdir /RollNumber
c) Create 3 different files named as sample.txt, sample2.csv, sample3.tsv [1Mark]
touch sample.txt
touch sample2.csv
touch sample3.tsv
d) Uploads all created files from the local file system to HDFS. [1Mark]
hadoop fs -put sample.txt sample2.csv sample3.tsv /RollNumber/
e) Lists the contents of a directory in HDFS. [0.5Mark]
hadoop fs -ls /RollNumber
f) Downloads a file named sample3.tsv from HDFS to the local file system. [1Marks]
hadoop fs -get /RollNumber/sample3.tsv ./
g) Removes a file named sample.txt from HDFS. [0.5Mark]
hadoop fs -rm /RollNumber/sample.txt
h) Change the permissions of a file named sample2.csv in HDFS. [1Marks]
hadoop fs -chmod 700 /RollNumber/sample2.csv
Q.6 Scenario:
A bank named XYZ Bank wants to create a new table in their Hive data warehouse to store customer
transaction details. The table should include the following fields: transaction_id (INT), customer_id
(INT), transaction_amount (DOUBLE), and transaction_date (STRING). After creating the table,
the bank needs to insert the following transaction data into the table:
• Transaction 1: transaction_id = 101, customer_id = 501, transaction_amount = 1500.75,
transaction_date = '2024-09-01'
• Transaction 2: transaction_id = 102, customer_id = 502, transaction_amount = 2500.00,
transaction_date = '2024-09-02'
Write the HiveQL commands to:
a) Create the customer_transactions table with the specified fields. [2Marks]
b) Insert the two transactions into the customer_transactions table. [2Marks]

Solution
a) Create the customer_transactions table with the specified fields. [2Marks]
CREATE TABLE customer_transactions (
transaction_id INT,
customer_id INT,
transaction_amount DOUBLE,
transaction_date STRING );

b) Insert the two transactions into the customer_transactions table. [2Marks]

INSERT INTO customer_transactions VALUES
(101, 501, 1500.75, '2024-09-01'),
(102, 502, 2500.00, '2024-09-02');

Q.7 Scenario:
A university named BITS Pilani University maintains a MongoDB collection called students to store
information about their students. Each document in the collection contains the following fields:
student_id, name, major, and year_of_enrollment. The university wants to:
• Insert a new student record into the collection.
• Update the major of an existing student.
• Retrieve the details of a specific student by their student_id.
• Delete a student record from the collection.
Write the MongoDB commands to:
a) Insert a new student with student_id = 1001, name = "John Doe", major = "Computer Science",
and year_of_enrollment = 2022. [1.5Marks]
b) Update the major of the student with student_id = 1001 to "Data Science". [1.5Marks]

Solution:
a) Insert a new student with student_id = 1001, name = "John Doe", major =
"Computer Science", and year_of_enrollment = 2022.
[1.5Marks]
db.students.insertOne({
student_id: 1001,
name: "John Doe",
major: "Computer Science",
year_of_enrollment: 2022
});

b) Update the major of the student with student_id = 1001 to "Data Science".
[1.5Marks]
db.students.updateOne(
{ student_id: 1001 },
{ $set: { major: "Data Science" } } );

******

cp5293 Big Data Analytics Question Bank
0% (1)
cp5293 Big Data Analytics Question Bank
13 pages
A Look Back at The Life of Steve Jobs Reading Comprehension Exercises Video Movie Activi - 11279
No ratings yet
A Look Back at The Life of Steve Jobs Reading Comprehension Exercises Video Movie Activi - 11279
2 pages
Animal 4D Cards PDF
No ratings yet
Animal 4D Cards PDF
2 pages
DA Practice Questions - Unit - 1
No ratings yet
DA Practice Questions - Unit - 1
4 pages
ak_as2
No ratings yet
ak_as2
15 pages
DA Practice Questions - Unit - 1
No ratings yet
DA Practice Questions - Unit - 1
5 pages
Big Data
No ratings yet
Big Data
23 pages
2403RES29 - Hemant Choudhary - CS546 - Assignment - 1
No ratings yet
2403RES29 - Hemant Choudhary - CS546 - Assignment - 1
14 pages
Finance - Unit 4
No ratings yet
Finance - Unit 4
39 pages
Big Data Analytics On Banking Sector
100% (1)
Big Data Analytics On Banking Sector
16 pages
Big Data Analytics
No ratings yet
Big Data Analytics
21 pages
Big Data Analytics Unit Test-I Answers Bank
No ratings yet
Big Data Analytics Unit Test-I Answers Bank
10 pages
Unit 5 Notes_2aa43bfaba4eaa28bfff09948b986bb8
No ratings yet
Unit 5 Notes_2aa43bfaba4eaa28bfff09948b986bb8
15 pages
Business Analytics
No ratings yet
Business Analytics
9 pages
Question Bank of Big Data
No ratings yet
Question Bank of Big Data
22 pages
Big Data and Data Science: Case Studies: Priyanka Srivatsa
No ratings yet
Big Data and Data Science: Case Studies: Priyanka Srivatsa
5 pages
bd_mcq
No ratings yet
bd_mcq
40 pages
Big Data Assignment 1 1
No ratings yet
Big Data Assignment 1 1
4 pages
Bigdata
No ratings yet
Bigdata
12 pages
Challenges in Big Data Analytics Techniques
No ratings yet
Challenges in Big Data Analytics Techniques
6 pages
BDA Complete Question Bank
No ratings yet
BDA Complete Question Bank
25 pages
unit 1 big data
No ratings yet
unit 1 big data
15 pages
Lesson 3 Big Data Overview
No ratings yet
Lesson 3 Big Data Overview
30 pages
Quantum DA Review
No ratings yet
Quantum DA Review
28 pages
A Big Data Analytics Study Challenges, Unresolved Research Issues, and Techniques
100% (1)
A Big Data Analytics Study Challenges, Unresolved Research Issues, and Techniques
8 pages
Module 1
No ratings yet
Module 1
21 pages
Emma Mensah
No ratings yet
Emma Mensah
4 pages
Business Intelligence & Big Data Analytics-CSE3124Y
No ratings yet
Business Intelligence & Big Data Analytics-CSE3124Y
25 pages
BDA_2M
No ratings yet
BDA_2M
10 pages
Assignment 1 Based On Unit 1
No ratings yet
Assignment 1 Based On Unit 1
6 pages
Module 1_big data
No ratings yet
Module 1_big data
8 pages
Cp5293 Big Data Analytics Question Bank
0% (1)
Cp5293 Big Data Analytics Question Bank
13 pages
U I Q-A
No ratings yet
U I Q-A
7 pages
Research Paper (1) .Docxxx
No ratings yet
Research Paper (1) .Docxxx
6 pages
Exam
No ratings yet
Exam
3 pages
FALLSEM2024-25_SWE2011_ETH_VL2024250103282_2024-07-15_Reference-Material-I
No ratings yet
FALLSEM2024-25_SWE2011_ETH_VL2024250103282_2024-07-15_Reference-Material-I
69 pages
Introduction To Business Analytics
No ratings yet
Introduction To Business Analytics
33 pages
BA CA 1 - Merged
No ratings yet
BA CA 1 - Merged
17 pages
BIG Data Analysis Assign - Final
No ratings yet
BIG Data Analysis Assign - Final
21 pages
Data Analysis PHASE
No ratings yet
Data Analysis PHASE
14 pages
Siew Farouk 2023 Big Data Analytics Implementation Issues a Case Study of a Large Bank in Malaysia
No ratings yet
Siew Farouk 2023 Big Data Analytics Implementation Issues a Case Study of a Large Bank in Malaysia
4 pages
SEM_BDA_QUEST
No ratings yet
SEM_BDA_QUEST
12 pages
Final Revised Big Data
No ratings yet
Final Revised Big Data
7 pages
CISSP Certification Success Guide
From Everand
CISSP Certification Success Guide
SUJAN
No ratings yet
1.big Data and Its Importance
No ratings yet
1.big Data and Its Importance
17 pages
239700a5-6c7a-43c1-810e-687c652d046e
No ratings yet
239700a5-6c7a-43c1-810e-687c652d046e
14 pages
big data notes
No ratings yet
big data notes
89 pages
Big Data Assignments Answer
No ratings yet
Big Data Assignments Answer
15 pages
Ba - Big Data - Tutorial
No ratings yet
Ba - Big Data - Tutorial
3 pages
Bda Combined
No ratings yet
Bda Combined
102 pages
Answers For Sessional 1 BDA
No ratings yet
Answers For Sessional 1 BDA
11 pages
IAT-I Question Paper With Solution of 17MCA452 Big Data Analytics Mar-2019-Gomathi T
No ratings yet
IAT-I Question Paper With Solution of 17MCA452 Big Data Analytics Mar-2019-Gomathi T
9 pages
DP-600: Implementing Analytics Solutions Using Microsoft Fabric Exam Preparation
From Everand
DP-600: Implementing Analytics Solutions Using Microsoft Fabric Exam Preparation
Georgio Daccache
No ratings yet
Introduction-to-Data-Analytics
No ratings yet
Introduction-to-Data-Analytics
15 pages
2 emerging
No ratings yet
2 emerging
10 pages
Bi Exam
No ratings yet
Bi Exam
24 pages
UIIC AO MCQ Super 60
No ratings yet
UIIC AO MCQ Super 60
16 pages
Big Data Analytics
100% (1)
Big Data Analytics
11 pages
C1 Week 4 Quiz PDF
100% (1)
C1 Week 4 Quiz PDF
13 pages
BDA Notes
No ratings yet
BDA Notes
96 pages
BDA_PartB
No ratings yet
BDA_PartB
47 pages
Lesson 5 - Business Analytics and Big Data
No ratings yet
Lesson 5 - Business Analytics and Big Data
39 pages
PPP - RTK Comparison 2016
No ratings yet
PPP - RTK Comparison 2016
31 pages
506 Lab On Internet Technology and C Sharp 101020
No ratings yet
506 Lab On Internet Technology and C Sharp 101020
4 pages
mad-unit-wise-imp-questions-with-answer-by-campusify
No ratings yet
mad-unit-wise-imp-questions-with-answer-by-campusify
116 pages
Pitch Deck
No ratings yet
Pitch Deck
2 pages
The Digital Journey of Banking and Insurance, Volume II Volker Liermann pdf download
100% (1)
The Digital Journey of Banking and Insurance, Volume II Volker Liermann pdf download
48 pages
James Rosen Search Warrant
No ratings yet
James Rosen Search Warrant
44 pages
Network Security PDF
No ratings yet
Network Security PDF
4 pages
Introduction to Quartus Updated
No ratings yet
Introduction to Quartus Updated
12 pages
Yum Command Fails With-"Curl Error (37) : Couldn't Read A File" - "Issue While Fetching Content Using Custom or Third Party Repositories"
No ratings yet
Yum Command Fails With-"Curl Error (37) : Couldn't Read A File" - "Issue While Fetching Content Using Custom or Third Party Repositories"
4 pages
Voucher VENAVINON JOURS Up 636 02.24.24 Venavinon
No ratings yet
Voucher VENAVINON JOURS Up 636 02.24.24 Venavinon
2 pages
IBM - IBM Storage Scale 5.1.9 Command and Programming Reference (2024)
No ratings yet
IBM - IBM Storage Scale 5.1.9 Command and Programming Reference (2024)
1,964 pages
TZIDC, TZIDC-110, TZIDC-120: Digital Positioner
100% (1)
TZIDC, TZIDC-110, TZIDC-120: Digital Positioner
72 pages
markscheme-HL-paper1 TRIG
100% (1)
markscheme-HL-paper1 TRIG
62 pages
Thesis On Image Compression PDF
100% (3)
Thesis On Image Compression PDF
5 pages
Topic 4 Entity Relationship Diagram (ERD) : Prepared By: Nurul Akhmal Binti Mohd Zulkefli
No ratings yet
Topic 4 Entity Relationship Diagram (ERD) : Prepared By: Nurul Akhmal Binti Mohd Zulkefli
91 pages
Log PasswordRecoveryFailed
No ratings yet
Log PasswordRecoveryFailed
6 pages
Machine Learning for Cyber Security 1st edition by Preeti Malik, Lata Nautiyal, Mangey Ram 3110766736Â 978-3110766738 - The ebook with rich content is ready for you to download
100% (5)
Machine Learning for Cyber Security 1st edition by Preeti Malik, Lata Nautiyal, Mangey Ram 3110766736Â 978-3110766738 - The ebook with rich content is ready for you to download
85 pages
Linear Regression. Examples
No ratings yet
Linear Regression. Examples
6 pages
Imanager Merged
No ratings yet
Imanager Merged
51 pages
Visual Basic For Applications
No ratings yet
Visual Basic For Applications
14 pages
ERP Software for Flexible Packaging | Samadhan's Microsoft Dynamics 365 Solution
No ratings yet
ERP Software for Flexible Packaging | Samadhan's Microsoft Dynamics 365 Solution
12 pages
MiddlewareArchitectureWSN_BookChapter
No ratings yet
MiddlewareArchitectureWSN_BookChapter
23 pages
SiC Worksheet - Practical 2 AAA
No ratings yet
SiC Worksheet - Practical 2 AAA
15 pages
A Frame Kiosks Brochure
No ratings yet
A Frame Kiosks Brochure
4 pages
6th computer N
No ratings yet
6th computer N
2 pages
Instruction Set Principles and Architectures: Computer Architecture Prof. Muhamed Mudawar
No ratings yet
Instruction Set Principles and Architectures: Computer Architecture Prof. Muhamed Mudawar
53 pages
CV-Nguyen Tan Dung-front-end-eng
No ratings yet
CV-Nguyen Tan Dung-front-end-eng
2 pages
A Practical Solution For Non-Intrusive Type II Load Monitoring Based On Deep Learning and Post-Processing
No ratings yet
A Practical Solution For Non-Intrusive Type II Load Monitoring Based On Deep Learning and Post-Processing
13 pages

BDA Regular Paper Solution

Uploaded by

BDA Regular Paper Solution

Uploaded by

Birla Institute of Technology & Science, Pilani

Work-Integrated Learning Programmes Division

Course No. : BA ZG525

Q.1 Case Study:

(d) Data exchanged between systems using XML files

Q.2 Case Study:

Data collected per day = 5 TB

b) How much processed data is used per month for modeling?

Data required per model = 2 TB

b) Insert the two transactions into the customer_transactions table. [2Marks]

You might also like