0% found this document useful (0 votes)
170 views

Data Analytics Interview QnAs

This document contains interview questions and answers related to data analysis. It includes questions about SQL CASE statements, database relationships in SQL, using cycle fields in Tableau, the differences between functions and formulas in Excel, SQL injection, creating hyperlinks in Excel, what DAX is in Power BI, the differences between deep and shallow copy in Python, using sets and groups in Tableau, Power Pivot and Power Query in Excel, ways to improve performance in Tableau, what macros are in Excel, primary and unique keys in SQL, stacked column charts in Tableau, the split() and join() functions in Python, where data is stored in Power BI, data cleansing, affinity diagrams, questions to ask before creating a dashboard,

Uploaded by

Ahsan Ahmad Beg
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
170 views

Data Analytics Interview QnAs

This document contains interview questions and answers related to data analysis. It includes questions about SQL CASE statements, database relationships in SQL, using cycle fields in Tableau, the differences between functions and formulas in Excel, SQL injection, creating hyperlinks in Excel, what DAX is in Power BI, the differences between deep and shallow copy in Python, using sets and groups in Tableau, Power Pivot and Power Query in Excel, ways to improve performance in Tableau, what macros are in Excel, primary and unique keys in SQL, stacked column charts in Tableau, the split() and join() functions in Python, where data is stored in Power BI, data cleansing, affinity diagrams, questions to ask before creating a dashboard,

Uploaded by

Ahsan Ahmad Beg
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 21

50+ Important

Data Analyst
Interview
Questions &
Answers
(Save It Now)

www.cloudyml.com
1. What is the case when in SQL Server?

The CASE statement is used to construct logic in which one column’s value is
determined by the values of other columns.

At least one set of WHEN and THEN commands makes up the SQL Server
CASE Statement. The condition to be tested is specified by the WHEN
statement. If the WHEN condition returns TRUE, the THEN sentence explains
what to do.

When none of the WHEN conditions return true, the ELSE statement is
executed. The END keyword brings the CASE statement to a close.

2. What is a relationship in SQL and what are they?

Database Relationship is defined as the connection between the tables in a


database. There are various data base relationships, and they are as follows:.

One to One Relationship.

One to Many Relationship.

Many to One Relationship.

Self-Referencing Relationship.

3. What is the use of cycle fields in tableau?

Cycle fields help in switching and trying different colour combinations or views
in a cyclic order. It will work only if we have a chart that allows more than one
measure such as stacked bar chart and we are unable to finalize the
visualizations then we can use cycle fields. To use cycle field, click on

www.cloudyml.com
analysis menu in the toolbar then select cycle fields to take a quick look at an
alternative visualization.

4. What is the difference between a function and a formula in Excel?

A formula is a user-defined expression that calculates a value. A function is


pre-defined built-in operation that can take the specified number of
arguments. A user can create formulas that can be complex and can have
multiple functions in it. For example, =A1+A2 is a formula and =SUM(A1:A10)
is a function.

5. What is SQL Injection?

SQL injection is a sort of flaw in website and web app code that allows
attackers to take control of back-end processes and access, retrieve, and
delete sensitive data stored in databases. In this approach, malicious SQL
statements are entered into a database entry field, and the database
becomes exposed to an attacker once they are executed. By utilising
data-driven apps, this strategy is widely utilised to get access to sensitive
data and execute administrative tasks on databases.

6. How do you create a hyperlink in Excel?

Hyperlinks are used to navigate between worksheets and files/websites. To


create a hyperlink, the shortcut used is Ctrl+K.

The ‘Insert Hyperlink’ box appears. Enter the address and the text to display.

www.cloudyml.com
7. What is DAX in Power BI?

DAX stands for Data Analysis Expressions. It's a collection of functions,


operators, and constants used in formulas to calculate and return values. In
other words, it helps you create new info from data you already have.

8. What is the difference between deep and shallow copy in


python?

Shallow copy is used when a new instance type gets created and it keeps the
values that are copied in the new instance. Shallow copy is used to copy the
reference pointers just like it copies the values. Deep copy is used to store
the values that are already copied. Deep copy doesn’t copy the reference
pointers to the objects. It makes the reference to an object and the new object
that is pointed by some other object gets stored.

9. What are sets and groups in Tableau?

Sets and groups are used group data based on some specific conditions. The
main difference between these two is that a group can divide the dataset into
multiple groups whereas a set can have only two options which is either in or
out. A user should choose to apply group or sets based on the requirements.

10. What is Power Pivot & Power Query?

Power Pivot is an add-on provided by Microsoft for Excel since 2010. Power
Pivot was designed to extend the analytical capabilities and services of
Microsoft Excel.

www.cloudyml.com
Power Query is a business intelligence tool designed by Microsoft for Excel.
Power Query allows you to import data from various data sources and will
enable you to clean, transform and reshape your data as per the
requirements. Power Query allows you to write your query once and then run
it with a simple refresh.

11. State some ways to improve the performance of Tableau?

Use an Extract to make workbooks run faster

Reduce the scope of data to decrease the volume of data

Reduce the number of marks on the view to avoid information overload

Try to use integers or Booleans in calculations as they are much faster than
strings

Hide unused fields

Use Context filters

Reduce filter usage and use some alternative way to achieve the same result

Use indexing in tables and use the same fields for filtering

Remove unnecessary calculations and sheets.

www.cloudyml.com
SWITCH YOUR CAREER TO
DATA SCIENCE & ANALYTICS .
“Make your career in this fastest growing Data Science
industry without paying your lakh of rupees.”
Features of this course :-
✅Get Hands-on Practical Learning Experience
✅Topic Wise Structured Tutorial Videos
✅Guided Practice Assignments
✅Capstone End-to-End Projects
✅1-1 Doubt Clearance Support Everyday
✅One Month Internship Opportunity
✅Interview QnA PDF Collection
✅Course Completion Certificate
✅Lifetime Course Content Access
✅No Prior Coding Experience Required to Join
✅Resume Review Feature
✅Daily Interview QnA Mail Everyday
✅Job Opening Mail & More.

Visit Our Website & Enroll Today


Chosen & Trusted by 8000+ Happy Learners .

www.cloudyml.com
12. What is macro in excel?

Macro refers to an algorithm or a set of actions that help automate a task in


Excel by recording and playing back the steps taken to complete that task.

Once the steps are stored, you create a Macro, and it can be edited and
played back as many times as the user wants.

Macro is great for repetitive tasks and also eliminates errors. For example,
suppose an account manager has to share reports regarding the company
employees for non-payment of dues. In that case, it can be automated using
a Macro and doing minor changes every month, as needed.

13. What is the difference between primary key and unique key
in SQL?

Both primary and unique keys carry unique values but a primary key cannot
have a null value, while a unique key can. In a table, there cannot be more
than one primary key, but there can be multiple unique keys.

14. What is a Stacked Column Chart in Tableau?

Stacked Column Chart, composed of multiple bars stacked vertically, one on


another. The length of the bar depends on the value in the data point. A
stacked column chart is the best one to know the changes in all variables.
This type of chart should be checked when the number of series is higher
than two.

15. Explain split() and join() functions in Python?

www.cloudyml.com
You can use split() function to split a string based on a delimiter to a list of
strings.You can use join() function to join a list of strings based on a delimiter
to give a single string.

16. Where is the data stored in Power BI?

Primarily, PowerBI uses two repositories to store its data: Azure Blob Storage
and Azure SQL Database. Azure Blob Storage typically stores the data that is
uploaded by the users. Azure SQL Database stores all the metadata and
artifacts for the system itself.

17. What is the difference between primary key and unique key
in SQL?

Both primary and unique keys carry unique values but a primary key cannot
have a null value, while a unique key can. In a table, there cannot be more
than one primary key, but there can be multiple unique keys.

18. What is a Stacked Column Chart in Tableau?

Stacked Column Chart, composed of multiple bars stacked vertically, one on


another. The length of the bar depends on the value in the data point. A
stacked column chart is the best one to know the changes in all variables.
This type of chart should be checked when the number of series is higher
than two.

19. Explain split() and join() functions in Python?

You can use split() function to split a string based on a delimiter to a list of
strings.You can use join() function to join a list of strings based on a delimiter
to give a single string.

www.cloudyml.com
20. Where is the data stored in Power BI?

Primarily, PowerBI uses two repositories to store its data: Azure Blob Storage
and Azure SQL Database. Azure Blob Storage typically stores the data that is
uploaded by the users. Azure SQL Database stores all the metadata and
artifacts for the system itself.

21. Explain data cleansing.

Data cleaning, also known as data cleansing or data scrubbing or wrangling,


is basically a process of identifying and then modifying, replacing, or deleting
the incorrect, incomplete, inaccurate, irrelevant, or missing portions of the
data as the need arises. This fundamental element of data science ensures
data is correct, consistent, and usable.

22. What is an Affinity Diagram?

An Affinity Diagram is an analytical tool used to cluster or organize data into


subgroups based on their relationships. These data or ideas are mostly
generated from discussions or brainstorming sessions and are used in
analyzing complex issues.

23. Which questions should you ask the user/client before you
create a dashboard?

Though this depends on the user’s requirements, still some of the common
questions that I would ask the client before creating a dashboard are :

What is the purpose of the dashboard?Should the dashboard be retrospective


or real-time?How detailed the dashboard should be?How tech and
data-savvy is the end-user?Does the data need to be segmented?Should I
explain the dashboard design to you?

www.cloudyml.com
24. What is an Alias in SQL?

An alias is a feature of SQL that is supported by most, if not all, RDBMSs. It is


a temporary name assigned to the table or table column for the purpose of a
particular SQL query. In addition, aliasing can be employed as an confusion
technique to secure the real names of database fields. A table alias is also
called a correlation name.

An alias is represented explicitly by the AS keyword but in some cases, the


same can be performed without it as well.

25. What do Tableau's sets and groups mean?

Data is grouped using sets and groups according to predefined criteria. The
primary distinction between the two is that although a set can have only two
options—either in or out—a group can divide the dataset into several groups.
A user should decide which group or sets to apply based on the conditions.

26.What in Excel is a macro?

An Excel macro is an algorithm or a group of steps that helps automate an


operation by capturing and replaying the steps needed to finish it. Once the
steps have been saved, you may construct a Macro that the user can alter
and replay as often as they like.

Macro is excellent for routine work because it also gets rid of mistakes.
Consider the scenario when an account manager needs to share reports
about staff members who owe the company money. If so, it can be automated
by utilising a macro and making small adjustments each month as necessary.

27.Gantt chart in Tableau

www.cloudyml.com
A Tableau Gantt chart illustrates the duration of events as well as the
progression of value across the period. Along with the time axis, it has bars.
The Gantt chart is primarily used as a project management tool, with each bar
representing a project job.

28.In Microsoft Excel, how do you create a drop-down list?

Start by selecting the Data tab from the ribbon.

Select Data Validation from the Data Tools group.

Go to Settings > Allow > List next.

Choose the source you want to offer in the form of a list array.

29. What are the common problems that data analysts


encounter during analysis?

The common problems steps involved in any analytics project are:

Handling duplicate data

Collecting the meaningful right data at the right time

Handling data purging and storage problems

Making data secure and dealing with compliance issues

30. Explain the Type I and Type II errors in Statistics?

In Hypothesis testing, a Type I error occurs when the null hypothesis is


rejected even if it is true. It is also known as a false positive.

www.cloudyml.com
A Type II error occurs when the null hypothesis is not rejected, even if it is
false. It is also known as a false negative.

31. How do you make a dropdown list in MS Excel?

First, click on the Data tab that is present in the ribbon.

Under the Data Tools group, select Data Validation.

Then navigate to Settings > Allow > List.

Select the source you want to provide as a list array.

32. How do you subset or filter data in SQL?

To subset or filter data in SQL, we use WHERE and HAVING clauses which
give us an option of including only the data matching certain conditions.

33. What is a Gantt Chart in Tableau?

A Gantt chart in Tableau depicts the progress of value over the period, i.e., it
shows the duration of events. It consists of bars along with the time axis. The
Gantt chart is mostly used as a project management tool where each bar is a
measure of a task in the project

34. What are different types of Collation Sensitivity?

Following are the different types of Collation Sensitivity:

- Case sensitive: A and a, B and b

- Kana sensitive: Japanese Kana characters

- Width sensitive: single byte characters and double-byte characters.

www.cloudyml.com
- Accent Sensitive.

35. What is OLTP ?

OLTP or Online Transaction Processing is a type of data processing that


consists of executing a number of transactions occurring concurrently—online
banking, shopping, order entry, or sending text messages, for example.
These transactions traditionally are referred to as economic or financial
transactions, recorded and secured so that an enterprise can access the
information anytime for accounting or reporting purposes.

36. What is OLAP?

OLAP stands for On-Line Analytical Processing. OLAP is a classification of


software technology which authorizes analysts, managers, and executives to
gain insight into information through fast, consistent, interactive access in a
wide variety of possible views of data that has been transformed from raw
information to reflect the real dimensionality of the enterprise as understood
by the clients.

OLAP implement the multidimensional analysis of business information and


support the capability for complex estimations, trend analysis, and
sophisticated data modeling.

37. How OLAP Works?

Fundamentally, OLAP has a very simple concept. It pre-calculates most of the


queries that are typically very hard to execute over tabular databases, namely
aggregation, joining, and grouping. These queries are calculated during a
process that is usually called 'building' or 'processing' of the OLAP cube. This
process happens overnight, and by the time end users get to work - data will
have been updated.

www.cloudyml.com
38. Is indentation required in python?

Indentation is necessary for Python. It specifies a block of code. All code


within loops, classes, functions, etc is specified within an indented block. It is
usually done using four space characters. If your code is not indented
necessarily, it will not execute accurately and will throw errors as well.

39. What are Entities and Relationships?

Entity: An entity can be a real-world object that can be easily identifiable. For
example, in a college database, students, professors, workers, departments,
and projects can be referred to as entities.

Relationships: Relations or links between entities that have something to do


with each other. For example – The employee’s table in a company’s
database can be associated with the salary table in the same database.

40. What are Aggregate and Scalar functions?

An aggregate function performs operations on a collection of values to return


a single scalar value. Aggregate functions are often used with the GROUP
BY and HAVING clauses of the SELECT statement. A scalar function returns
a single value based on the input value.

41. What are Custom Visuals in Power BI?

Custom Visuals are like any other visualizations, generated using Power BI.
The only difference is that it develops the custom visuals using a custom
SDK. The languages like JQuery and JavaScript are used to create custom
visuals in Power BI

www.cloudyml.com
SWITCH YOUR CAREER TO
DATA SCIENCE & ANALYTICS .
“Make your career in this fastest growing Data Science
industry without paying your lakh of rupees.”
Features of this course :-
✅Get Hands-on Practical Learning Experience
✅Topic Wise Structured Tutorial Videos
✅Guided Practice Assignments
✅Capstone End-to-End Projects
✅1-1 Doubt Clearance Support Everyday
✅One Month Internship Opportunity
✅Interview QnA PDF Collection
✅Course Completion Certificate
✅Lifetime Course Content Access
✅No Prior Coding Experience Required to Join
✅Resume Review Feature
✅Daily Interview QnA Mail Everyday
✅Job Opening Mail & More.

Visit Our Website & Enroll Today


Chosen & Trusted by 8000+ Happy Learners .

www.cloudyml.com
42.What is the use of cycle fields in tableau?

Cycle fields help in switching and trying different colour combinations or views
in a cyclic order. It will work only if we have a chart that allows more than one
measure such as stacked bar chart and we are unable to finalize the
visualizations then we can use cycle fields. To use cycle field, click on
analysis menu in the toolbar then select cycle fields to take a quick look at an
alternative visualization.

43. How to use Power BI in excel?

To use Power BI in Excel, there is an Analyse in Excel option for every report
in the Power BI service. To use it, you will need to enable editing and enable
content for the report for the first time. So, what this option provides us is that
it gives us the underlying data set of our Power BI report. It comes as a data
connection in excel. And we get to play with the data in excel. It is up to us
how we analyze the same data, either through pivot tables, charts, etc. By
default, when the data is extracted in excel for any report- it gives a Pivot
table by default.

44. What are sets and groups in Tableau?

Sets and groups are used group data based on some specific conditions. The
main difference between these two is that a group can divide the dataset into
multiple groups whereas a set can have only two options which is either in or
out. A user should choose to apply group or sets based on the requirements.

www.cloudyml.com
45. What is the difference between a function and a formula in
Excel?

A formula is a user-defined expression that calculates a value. A function is


pre-defined built-in operation that can take the specified number of
arguments. A user can create formulas that can be complex and can have
multiple functions in it. For example, =A1+A2 is a formula and =SUM(A1:A10)
is a function.

46. What are the roles & responsibilities of a Power BI


developer?

The specific responsibilities that a Power BI Developer performs vary widely


based on the industry and the organization for which they work. A Power BI
developer should expect to encounter some or all of the roles and
responsibilities listed below:

Build Analysis Services reporting models.

Using Power BI desktop, build visual reports, dashboards, and KPI


scorecards.

Data analysis skills

In Power BI, use row-level data security and learn about application security
layer models.

On the Power BI desktop, run DAX queries.

Use the data set to do advanced level calculations.

Develop custom visuals for Power BI.

www.cloudyml.com
Integrate Power BI reports into other applications

Should be well-versed with secondary tools like Microsoft Azure, SQL data
warehouse, PolyBase, Visual Studio, and others.

47. What is Power BI Desktop?

Power BI Desktop is an open-source application designed and developed by


Microsoft. Power BI Desktop will allow users to connect to, transform, and
visualize your data with ease. Power BI Desktop lets users build visuals and
collections of visuals that can be shared as reports with your colleagues or
your clients in your organization.

48. What is Power Pivot?

Power Pivot is an add-on provided by Microsoft for Excel since 2010. Power
Pivot was designed to extend the analytical capabilities and services of
Microsoft Excel.

49. What is Power Query?

Power Query is a business intelligence tool designed by Microsoft for Excel.


Power Query allows you to import data from various data sources and will
enable you to clean, transform and reshape your data as per the
requirements. Power Query allows you to write your query once and then run
it with a simple refresh.

50. What are the roles & responsibilities of a Power BI


developer?

The specific responsibilities that a Power BI Developer performs vary widely


based on the industry and the organization for which they work. A Power BI

www.cloudyml.com
developer should expect to encounter some or all of the roles and
responsibilities listed below:

Build Analysis Services reporting models.

Using Power BI desktop, build visual reports, dashboards, and KPI


scorecards.

Data analysis skills

In Power BI, use row-level data security and learn about application security
layer models.

On the Power BI desktop, run DAX queries.

Use the data set to do advanced level calculations.

Develop custom visuals for Power BI.

Integrate Power BI reports into other applications

Should be well-versed with secondary tools like Microsoft Azure, SQL data
warehouse, PolyBase, Visual Studio, and others.

www.cloudyml.com
SWITCH YOUR CAREER TO
DATA SCIENCE & ANALYTICS .
“Make your career in this fastest growing Data Science
industry without paying your lakh of rupees.”
Features of this course :-
✅Get Hands-on Practical Learning Experience
✅Topic Wise Structured Tutorial Videos
✅Guided Practice Assignments
✅Capstone End-to-End Projects
✅1-1 Doubt Clearance Support Everyday
✅One Month Internship Opportunity
✅Interview QnA PDF Collection
✅Course Completion Certificate
✅Lifetime Course Content Access
✅No Prior Coding Experience Required to Join
✅Resume Review Feature
✅Daily Interview QnA Mail Everyday
✅Job Opening Mail & More.

Visit Our Website & Enroll Today


Chosen & Trusted by 8000+ Happy Learners .

www.cloudyml.com
51. What is Power BI Desktop?

Power BI Desktop is an open-source application designed and developed by


Microsoft. Power BI Desktop will allow users to connect to, transform, and
visualize your data with ease. Power BI Desktop lets users build visuals and
collections of visuals that can be shared as reports with your colleagues or
your clients in your organization.

52. What is Power Pivot?

Power Pivot is an add-on provided by Microsoft for Excel since 2010. Power
Pivot was designed to extend the analytical capabilities and services of
Microsoft Excel.

53. What is Power Query?

Power Query is a business intelligence tool designed by Microsoft for Excel.


Power Query allows you to import data from various data sources and will
enable you to clean, transform and reshape your data as per the
requirements. Power Query allows you to write your query once and then run
it with a simple refresh.

Visit Our Website &


Become Job-Ready
www.cloudyml.com

You might also like