4

Uploaded by

Jagadeesh Reddy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views

4

Uploaded by

Jagadeesh Reddy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 2

1.

Introduction
2. Product table-
1.It has columns Product_id, Department, Date and Sales_Amount
Find the yesterday’s sales + todays sales for each product for each department
Department wise sales amount (Todays sales + Yesterdays sales) for each product
2.We need to rank with respect to Department order by Sales_Amount and write same query in
PySpark.
3. Lake house architecture
4. Unity catalog (Released 5 months back)
5. Consider u have 1 csv file at ADLS location you are reading that csv file but there are some of
the bad records then how you handle bad records?
6. You are reading file add inferSchema then filtering there, group by there finally using the collect, can
you please tell me how many jobs has been created and stages has been created
Spark.read.csv(“Path of file”, inferSchema=True,).filter(some
condition).groupBy(“column”).collect()
7. Wide transformation and narrow transformation
8. Optimization technique to reduce data shuffling in single dataframe
9. Difference between cache and persist
10. Storage types of persist
11. Difference between memory serialization and memory unserialization
12. What is memory serialization in persist command?
13. Why coalesce in more efficient than repartition?
14. Optimization techniques on Delta table?
15. Suppose u have table which updated many times in week then can I get Mondays copy of this data.
That is what was my data on Monday. Write a simple command for above scenario.
16. When we create delta table then 1 folder is created into ADLS which is delta log. Inside delta log
there are files. What are those files referring to.
17. When we create delta table then 1 folder is created into ADLS which is delta log. Inside delta log
there are files. There are 2 files one is crc and second is json what this file specifies. Which one is log
file and what is the crc file?
18. What is Medallion architecture?
19. Let’s assume SAP database which resides in your virtual machine. Now u want to connect to
ADLS which type of IR need?
20. Suppose u fetching data from SAP database and u are getting memory error and your pipeline
gets failed then how u can handled this error?
21. Can u give me brief introduction about yourself.
22. What are the challenges you faced in your project?
23. You faced metadata issue or not?
24. Consider company table having different columns like company and sales etc. Write a query to get
company wise total sales.
25. Above query in PySpark?
26. What is difference between truncate and delete and drop
27. How can you handle triggers in SQL?
28. Consider 10 tables of data then how can you handle those 10 tables of data from on-premise to
cloud?
29. Integration Runtime
30. For any success or failure of pipeline how can you handle the email notification?
31. Explain the flow to handle the email notification and which type of activity u used to handle email
notification?
32. How u can handle incremental load of data from source?
33. How u can create mount to ADLS?
34. Normal query vs Stored procedure
35. What type of distribution methods/Stored procedure you have used in Synapse?
36. What are the facts and dimensions?
37. You handle facts and dimensions tables?
38. Can we join 2 fact tables?
39. Star schema vs snow flake schema
40. How can you handle data validation techniques?
41. Which data validation techniques you used in your project?
42. Any question from your end?

Second round
1.We have 50 table in on-premise DB then I want to copy these into Azure Blob Storage. Then how many
pipelines we need for that scenario.
2.Suppose from 50 table, 10 tables copy data done, then how u can copy remaining table after completion of
10 tables.
3.Delta table vs parquet table similarities and dissimilarities
4.How to call child notebook into parent notebook and how u can pass parameters into child notebook
5.Optimization techniques u used in databricks
6.Driver node & executer node details. Purpose of both Node.
7.Is Worker node contains multiple executor nodes
8.Expalin SCD
9.How to handle Incremental data
10.Go into deep DataBricks

Databricks Certified Data Engineer Professional Dumps by Ball 21-03-2024 10qa Ebraindumps
No ratings yet
Databricks Certified Data Engineer Professional Dumps by Ball 21-03-2024 10qa Ebraindumps
19 pages
Databricks Question 1668314325
No ratings yet
Databricks Question 1668314325
104 pages
SQL: For Beginners: Your Guide To Easily Learn SQL Programming in 7 Days
From Everand
SQL: For Beginners: Your Guide To Easily Learn SQL Programming in 7 Days
i Code Academy
5/5 (4)
EN Programming ELCO Micro-ANTS LEB02 Basic Encoder V2.2 26-10-2020
100% (2)
EN Programming ELCO Micro-ANTS LEB02 Basic Encoder V2.2 26-10-2020
35 pages
HCI Final Project
No ratings yet
HCI Final Project
15 pages
Types of Electric Welding
No ratings yet
Types of Electric Welding
7 pages
AppearTV Hardware Maintenance Guide
No ratings yet
AppearTV Hardware Maintenance Guide
10 pages
Algebra 1 Chapter 10 Quadratic Equations and Functions Prentice Hall Mathematics - 2
No ratings yet
Algebra 1 Chapter 10 Quadratic Equations and Functions Prentice Hall Mathematics - 2
4 pages
Tiger Analytics 1735834470
No ratings yet
Tiger Analytics 1735834470
27 pages
azure comapny wise question
No ratings yet
azure comapny wise question
68 pages
100 Puzzles to Learn Data Warehousing
From Everand
100 Puzzles to Learn Data Warehousing
Cristian Scutaru
No ratings yet
Must Know Before Your Next Databricks Interview
No ratings yet
Must Know Before Your Next Databricks Interview
7 pages
azure DE interview que
100% (1)
azure DE interview que
25 pages
Power Apps Interview Questions
No ratings yet
Power Apps Interview Questions
3 pages
GETTING STARTED WITH SQL: Exercises with PhpMyAdmin and MySQL
From Everand
GETTING STARTED WITH SQL: Exercises with PhpMyAdmin and MySQL
Remy Lentzner
No ratings yet
Etl Interview Questions
100% (1)
Etl Interview Questions
4 pages
Function Along With The Argument: Trunc Month
No ratings yet
Function Along With The Argument: Trunc Month
1 page
AZURE_ETL__1741608374
No ratings yet
AZURE_ETL__1741608374
14 pages
Databricks Practice Questions 1 (2)
No ratings yet
Databricks Practice Questions 1 (2)
10 pages
Azure de Interview Question Set Part 1 1710925748
No ratings yet
Azure de Interview Question Set Part 1 1710925748
9 pages
My_Walmart_interviewExperience_Answers
No ratings yet
My_Walmart_interviewExperience_Answers
13 pages
Coding Interview Questions and Answers
From Everand
Coding Interview Questions and Answers
Chinmoy Mukherjee
No ratings yet
New Text Document
No ratings yet
New Text Document
1 page
BASF_Interview_QA
No ratings yet
BASF_Interview_QA
4 pages
Data Engineer
No ratings yet
Data Engineer
19 pages
Databricks Certified Data Engineer Associate Exam Guide
No ratings yet
Databricks Certified Data Engineer Associate Exam Guide
7 pages
DATABASE From the conceptual model to the final application in Access, Visual Basic, Pascal, Html and Php: Inside, examples of applications created with Access, Visual Studio, Lazarus and Wamp
From Everand
DATABASE From the conceptual model to the final application in Access, Visual Basic, Pascal, Html and Php: Inside, examples of applications created with Access, Visual Studio, Lazarus and Wamp
Olga Maria Stefania Cucaro
No ratings yet
The Informed Company: How to Build Modern Agile Data Stacks that Drive Winning Insights
From Everand
The Informed Company: How to Build Modern Agile Data Stacks that Drive Winning Insights
Dave Fowler
No ratings yet
Creating your MySQL Database: Practical Design Tips and Techniques
From Everand
Creating your MySQL Database: Practical Design Tips and Techniques
Marc Delisle
3/5 (1)
Elements of Android Room
From Everand
Elements of Android Room
Mark Murphy
No ratings yet
Getting started with OpenOffice Base
From Everand
Getting started with OpenOffice Base
Remy Lentzner
No ratings yet
Jump Start MySQL: Master the Database That Powers the Web
From Everand
Jump Start MySQL: Master the Database That Powers the Web
Timothy Boronczyk
No ratings yet
Introduction to Microsoft SQL Server
From Everand
Introduction to Microsoft SQL Server
Eric Frick
No ratings yet
interviewsss
No ratings yet
interviewsss
4 pages
SQLite Database Programming for Xamarin: Cross-platform C# database development for iOS and Android using SQLite.XM
From Everand
SQLite Database Programming for Xamarin: Cross-platform C# database development for iOS and Android using SQLite.XM
Anthony Serpico
No ratings yet
ETL Interview Preparation
No ratings yet
ETL Interview Preparation
18 pages
Databricks Course Curriculum
No ratings yet
Databricks Course Curriculum
2 pages
Interview Questions
No ratings yet
Interview Questions
2 pages
ETL Question and Answers
No ratings yet
ETL Question and Answers
6 pages
SAS Interview Questions You'll Most Likely Be Asked
From Everand
SAS Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
SQL Advanced Q and A
No ratings yet
SQL Advanced Q and A
5 pages
Data Bricks
No ratings yet
Data Bricks
20 pages
Learn SQL in 24 Hours
From Everand
Learn SQL in 24 Hours
Alex Nordeen
5/5 (4)
SQL Tutorial For Beginners
From Everand
SQL Tutorial For Beginners
HAU DANG
No ratings yet
PySpark Real Time Q&A
No ratings yet
PySpark Real Time Q&A
5 pages
Amazon Interview Questions ➖
No ratings yet
Amazon Interview Questions ➖
7 pages
questions and answers
No ratings yet
questions and answers
7 pages
top 51 data architect interview questions and how to answer them _ datacamp
No ratings yet
top 51 data architect interview questions and how to answer them _ datacamp
19 pages
Upgrading your skills with Access
From Everand
Upgrading your skills with Access
Rémy Lentzner
No ratings yet
DBMS_notes_unit-1,2,3
No ratings yet
DBMS_notes_unit-1,2,3
29 pages
BC0058 Assignment
No ratings yet
BC0058 Assignment
8 pages
Azure Data Engineer Interview Questions
No ratings yet
Azure Data Engineer Interview Questions
15 pages
Knight's Microsoft SQL Server 2012 Integration Services 24-Hour Trainer
From Everand
Knight's Microsoft SQL Server 2012 Integration Services 24-Hour Trainer
Brian Knight
No ratings yet
SQL All-in-One For Dummies
From Everand
SQL All-in-One For Dummies
Allen G. Taylor
4.5/5 (2)
Google BigQuery Analytics
From Everand
Google BigQuery Analytics
Jordan Tigani
3/5 (1)
Real-Time Big Data Analytics: Emerging Trends
From Everand
Real-Time Big Data Analytics: Emerging Trends
Trilokesh Khatri
No ratings yet
data science
No ratings yet
data science
6 pages
Interview Questions
No ratings yet
Interview Questions
6 pages
Administering Microsoft Azure SQL Solutions DP 300
From Everand
Administering Microsoft Azure SQL Solutions DP 300
Manish Soni
No ratings yet
Data Structures I Essentials
From Everand
Data Structures I Essentials
Dennis Smolarski
No ratings yet
Informatica Interview Questions
100% (1)
Informatica Interview Questions
11 pages
Amazon DynamoDB - The Definitive Guide: Explore enterprise-ready, serverless NoSQL with predictable, scalable performance
From Everand
Amazon DynamoDB - The Definitive Guide: Explore enterprise-ready, serverless NoSQL with predictable, scalable performance
Aman Dhingra
No ratings yet
Visual Basic 2010 Coding Briefs Data Access
From Everand
Visual Basic 2010 Coding Briefs Data Access
Kevin Hough
5/5 (1)
Data Warehouse Interview Questions
No ratings yet
Data Warehouse Interview Questions
5 pages
Dbms Questions: 1.) Define: Schema, Sub-Schema, Instances, Entity, Attribute, and Domain
No ratings yet
Dbms Questions: 1.) Define: Schema, Sub-Schema, Instances, Entity, Attribute, and Domain
17 pages
Christmas Gnomo Con Candil
100% (8)
Christmas Gnomo Con Candil
51 pages
The Price of Time
No ratings yet
The Price of Time
308 pages
BUBA Presentation Set 110813 - 201311150709479471 PDF
No ratings yet
BUBA Presentation Set 110813 - 201311150709479471 PDF
7 pages
Baban A Zakat Khan
No ratings yet
Baban A Zakat Khan
70 pages
Botany 1st
No ratings yet
Botany 1st
17 pages
Dir6200 Isuzu User Manual - 130304 - Web
100% (1)
Dir6200 Isuzu User Manual - 130304 - Web
86 pages
Single Lesson: Confident Me
No ratings yet
Single Lesson: Confident Me
22 pages
Data Cloud 2
No ratings yet
Data Cloud 2
1 page
4BCA Question Bank
No ratings yet
4BCA Question Bank
8 pages
DIP-Assignment 03 (10 Points) : Fatima Jinnah Women University Computer Science
No ratings yet
DIP-Assignment 03 (10 Points) : Fatima Jinnah Women University Computer Science
3 pages
Ip Pips Lesson Plan Year 2 Topic 6
No ratings yet
Ip Pips Lesson Plan Year 2 Topic 6
9 pages
2 AQUA Domestic Pump0712 PDF
No ratings yet
2 AQUA Domestic Pump0712 PDF
111 pages
Generational Learning Styles Handout
No ratings yet
Generational Learning Styles Handout
4 pages
Solidworks Leather Belt Jig
No ratings yet
Solidworks Leather Belt Jig
1 page
Hull Framing System
No ratings yet
Hull Framing System
11 pages
29426_1981_GEN
No ratings yet
29426_1981_GEN
143 pages
Ethics in Social Marketing PDF
No ratings yet
Ethics in Social Marketing PDF
2 pages
Adarsh Meth
No ratings yet
Adarsh Meth
14 pages
Communicating The Book of Job in The Twenty-First Century
No ratings yet
Communicating The Book of Job in The Twenty-First Century
11 pages
X
No ratings yet
X
2 pages
Augmented Reality A comprehensive review
No ratings yet
Augmented Reality A comprehensive review
24 pages
Flares Over-The-Top Beginners Guide To SFXv2.1
No ratings yet
Flares Over-The-Top Beginners Guide To SFXv2.1
57 pages
Rackett 1970
No ratings yet
Rackett 1970
4 pages
Lecture 6 Control of Pests and Diseases 202203
No ratings yet
Lecture 6 Control of Pests and Diseases 202203
95 pages
Teacher Aptitude Test
No ratings yet
Teacher Aptitude Test
2 pages

4

Uploaded by

4

Uploaded by

1.

You might also like