Ba Lab Record
Ba Lab Record
DEPARTMENT OF
ARTIFICIAL INTELLIGENCE AND DATA SCIENCE
CCW331
BUSINESS ANALYTICS LABORATORY
NAME :
REGISTER NO :
BRANCH :
SEM/YEAR :
KARPAGA VINAYAGA
COLLEGE ENGINEERING & TECHNOLOGY
Chinnakolambakkam, Madurantakam Tk, 603308
Name ………………………………………………………………………………………………………………………………………………….
Certified that this is a bonafide record of work done by the above student in the ………………………………
………………………………………………………………………………… Laboratory during the year ………………………………
________________________ ______________________________
_________________________ __________________________
4 18-03-25
Perform Z-test, T-test & ANOVA
21 - 29
5 18-03-25
Perform data pre-processing operations
30 - 34
i) Handling Missing data
ii) Normalization
AIM:
To explore the features of MS-EXCEL
PROCEDURE:
Features of MS-EXCEL
Quick Access Toolbar – Collection of buttons that provide one click access to commonly
used
commands such as Save, Undo or Redo. You can also customize this according to your
preference.
Title Bar – A bar the display the name of active workbook
Ribbon – The main set of commands and controls organized task in Tabs and groups, you can
also customize the ribbon according to your preference.
Column Headings – The letters that appear along the top of the worksheet to identify the
different columns in the worksheet.
Worksheet Window – A window that displays an Excel worksheet, basically this is where
you
work all the tasks.
Vertical Scroll Bar – Scroll bar to use when you want to scroll vertically through the
Worksheet window.
Horizontal Scroll Bar – Scroll bar to use when you want to scroll horizontally through the
worksheet window.
Zoom Controls – Used for magnifying and shrinking of the active worksheet.
View Shortcuts – Buttons used to change how the worksheet content is displayed. Normal,
Page Layout or Page Break Preview.
Sheet Tabs – Tabs the display the name of the worksheet in the workbook, by default its
name
3
sheet 1, sheet 2, etc. You can rename this to any name the best represent to your sheet.
Sheet Tab Scrolling Buttons – Buttons to scroll the sheet tabs in the workbook
Row Headings – The number that appears on the left of the worksheet window to identify the
Different rows.
Select All Button – A button that selects all the cells in the active worksheet
Active Cell – The cell selected in the active worksheet
Name Box – A box that displays the cell reference of the active cell
Formula Bar – A bar that displays the value or formula entered in the active cell
Office Button/File Tab – It provides access to workbook level features and program settings.
You will notice that in Excel 2007 there is a circle
Features of Ms-Excel
Microsoft excel is an integrated electronic spread sheet program developed by Microsoft
corporation.It includes the following features.
Autocalc: This feature is very useful to sum a group of numbers is selected them. Their sum
will automatically appear in the status area.
Auto complete: Excel now intelligence to anticipate what you are going to type! Based upon
entries you’ve already made, AutoComplete will try to figure out what you intended to
type,
once you’ve entered few letters.
Autocorrect: Excel can support automatically correct mistakes.
Better Drag-and-Drop: Do you want to move a group of cells? Excel’s drag and drop
feature lets
you reposition selected portion of your spreadsheet by simply dragging them with your
mouse.
Cell tips and Scroll Tips: To help you get around better with mouse, Excel now includes
scroll
tips. When you click and drag a scroll bar, a small window tells you what row or column you are
heading for.
Number Formatting: It’s easy to format numbers with excel’s new number formatting
feature.
Select your numbers and choose cells command from format menu.
Templates and Template wizard: Excel’s template facility has been greatly enhanced. You
can
choose from a variety of elegantly designed templates for your home or business. You can
even
have a template wizard link your worksheets to a database.
Shared Lists: you can now have worksheets that are shared simultaneously over a network.
Conditional Formatting:Conditional formatting helps users to quickly focus on important
aspects of a spreadsheet or to highlight errors and to identify important patterns in data.
4
Sorting and Filtering: Excel spreadsheets help us make sense of large amounts of data. To
make it easier to find what you need, you can reorder the data or pick out just the data you
need,
based on parameters you set within Excel. Sorting and filtering your data will save you time
and make your spreadsheet more effective.
Excel Charts: Excel charts help you communicate insights & information with ease. By
choosing your charts wisely and formatting them cleanly, you can convey a lot.
ENTERING AND EDITING DATA IN WORKSHEET
You have several options when you want to enter data manually in Excel. You can enter data
in
one cell, in several cells at the same time, or on more than one worksheet at the same time.
The
data that you enter can be numbers, text, dates, or times. You can format the data in a variety
of
ways. And, there are several settings that you can adjust to make data entry easier for you.
Enter text or a number in a cell
1. On the worksheet, click a cell.
2. Type the numbers or text that you want to enter, and then press Enter or Tab.
To enter data on a new line within a cell, enter a line break by pressing
Alt+Enter Editing text or a number in a cell
1. Double click the cell containing the data you want to edit.
2. Make any changes to the cell contents.
3. Press enter key. The change will accept. To cancel your changes, press Ese key.
Change the width of a column
a. Click the cell for which you want to change the column width.
b. On the Home tab, in the Cells group, click Format
For Example, in the above worksheet, the formula = B5+C5+D+ adds the contents
10+20+30and produce the results. One can enter and edit formula in two ways.
1. Directly into cell by double clicking where the formula wants.
2. At formula bar after selection of required cell.
To edit an existing formula
Click on the cell which contains the formula or results
Click in formula bar make necessary changes.
Press enter key or click on check mark.
6
RESULT:
The features of MS-EXCEL explored successfully and displayed desired output
in neat format.
8
AIM:
To implement numerical operations using MS-EXCEL.
PROCEDURE:
Step 3: Calculate the Maximum of the given marks using max function.
Step 4: Calculate the Minimum of the given marks using MIN function.
Step 5: Calculate the average of the given marks using average function.
Step 6: Calculate the sum of the given marks using sum function.
Step 7: Calculate the square root of the given mark using SQRT function
.
Step 8: Calculate the Round of the given mark using Roundup function.
Step 9: Display the desired output of all numerical operation in neat format.
Step 10: Save the excel file and Close the Ms Excel application
9
OUTPUT:
RESULT:
The numerical operations were implemented using MS-EXCEL successfully and
the desired output was displayed.
12
AIM:
To perform data import/export operations for different file formats using MS-EXCEL.
PROCEDURE:
Step 4: Export the file into CSV file using file menu and export option.
Step 5: Next, import CSV file using data menu and get data option.
Step 7: Save the excel file and Close the Ms Excel application
13
When you have data that needs to be transferred to another system, export it from Excel in a
format that can be interpreted by other programs, such as a text or CSV file.
1. Click the File tab.
A dialog box appears stating that some of the workbook features may be lost.
8. Click Yes.
14
Import Data
Excel can import data from external data sources including other files, databases, or web
pages.
1. Click the Data tab on the Ribbon..
2. Click the Get Data button.
Some data sources may require special security access, and the connection process can
often be very complex. Enlist the help of your organization’s technical support staff for
assistance.
3. Select From File.
4. Select From Text/CSV.
15
If you have data to import from Access, the web, or another source, select one of those
options in the Get External Data group instead.
5. Select the file you want to import.
6. Click Import.
If, while importing external data, a security notice appears saying that it is connecting
to an external source that may not be safe, click OK.
Because we've specified the data is separated by commas, the delimiter is already set. If
you need to change it, it can be done from this menu.
2. Click Load.
16
RESULT:
The data import/export operations for different file formats were preformed successfully
using MS-EXCEL.
17
AIM:
To Perform statistical operations using MS-EXCEL.
PROCEDURE:
Step 3: If you haven't already installed the Analysis ToolPak , Click the Microsoft
Office button, then click on the Excel Options , and then select Add-Ins ,
Click Go, check the Analysis ToolPak box, and click Ok
Step 4: Select Data tab, then click on the Data Analysis option, then selects
Descriptive Statistics from the list and Click Ok. [Data tab >> Data
Analysis >> Descriptive Statistics]
Step 5: In the Input Range we select the data, and then select Output Range where
you want the output to be stored. If you don’t specify the output range it
will throw output in the new worksheet.
Step 6: Check Summary Statistics and Confidence Level for Mean options. By
default the confidence level is 95%. You can change the level as per the
hypothesis standard of study.
Step 7: When you click Ok, you will see the result in the selected output range.
Step 8: Save the excel file and Close the Ms Excel application.
18
OUTPUT:
RESULT:
The statistical operations were performed successfully using MS-EXCEL
and the desired output was displayed in neat format.
21
AIM:
To Perform Z-test operations using MS-EXCEL.
PROCEDURE:
Step 3: If you haven't already installed the Analysis ToolPak , Click the Microsoft
Office button, then click on the Excel Options , and then select Add-Ins ,
Click Go, check the Analysis ToolPak box, and click Ok
Step 4: Select Data tab, then click on the Data Analysis option, then selects
Descriptive Statistics from the list and Click Ok. [Data tab >> Data
Analysis >>z-test two sample means]
Step 5: In the Input Range we select range of the data for variable 1 and variable 2
and Give variable 1 and variable 2 value as 0.5. then select Output Range
where you want the output to be stored. If you don’t specify the output range
it will throw output in the new worksheet.
Step 6: Then select Output Range where you want the output to be stored. If you
don’t specify the output range it will throw output in the new worksheet.
Step 7: When you click Ok, you will see the result in the selected output range.
Step 8: Save the excel file and Close the Ms Excel application.
22
23
OUTPUT:
RESULT:
The Z-test operation was performed successfully using MS-EXCEL and the
desired output was displayed in neat format.
24
AIM:
To Perform T-test operations using MS-EXCEL.
PROCEDURE:
Step 3: If you haven't already installed the Analysis ToolPak , Click the Microsoft
Office button, then click on the Excel Options , and then select Add-Ins ,
Click Go, check the Analysis ToolPak box, and click Ok
Step 4: Select Data tab, then click on the Data Analysis option, then selects
Descriptive Statistics from the list and Click Ok. [Data tab >> Data
Analysis >> T-test Paired two sample for means]
Step 5: In the Input Range we select range of the data for variable 1 and variable
2 and Give alpha value as 0.05. then select Output Range where you want
the output to be stored. If you don’t specify the output range it will throw
output in the new worksheet.
Step 6: Then select Output Range where you want the output to be stored. If you
don’t specify the output range it will throw output in the new worksheet.
Step 7: When you click Ok, you will see the result in the selected output range.
Step 8: Save the excel file and Close the Ms Excel application.
25
26
OUTPUT:
RESULT:
The T-test operation was performed successfully using MS-EXCEL and the
desired output was displayed in neat format.
27
AIM:
To Perform ANOVA operations using MS-EXCEL.
PROCEDURE:
Step 3: If you haven't already installed the Analysis ToolPak , Click the Microsoft
Office button, then click on the Excel Options , and then select Add-Ins ,
Click Go, check the Analysis ToolPak box, and click Ok
Step 4: Select Data tab, then click on the Data Analysis option, then selects
Descriptive Statistics from the list and Click Ok. [Data tab >> Data
Analysis >> Anova : Single factor ]
Step 5: In the Input Range we select range of the data and Give alpha value as 0.05.
then select Output Range where you want the output to be stored. If you
don’t specify the output range it will throw output in the new worksheet.
Step 6: Then select Output Range where you want the output to be stored. If you
don’t specify the output range it will throw output in the new worksheet.
Step 7: When you click Ok, you will see the result in the selected output range.
Step 8: Save the excel file and Close the Ms Excel application.
28
29
OUTPUT:
RESULT:
The ANOVA operations was performed successfully using MS-EXCEL and the
desired output was displayed in neat format.
30
AIM:
To handle the missing data in data pre-processing operations on the dataset using MS-EXCEL.
PROCEDURE:
Step 3: If you haven't already installed the PrimaXL Addin, install it. Click the
PrimaXL tab , choose missing
Step 4: In the Input Range we select marks of all subjects with missing values and
select the Choice as “filling of the missing data by taking average” or ”
filling of the missing data by random pick”.
Step 5: Then select Output Range where you want the output to be stored. If you don’t
specify the output range it will throw output in the new worksheet.
Step 6: Then select Output Range where you want the output to be stored. If you
don’t specify the output range it will throw output in the new worksheet.
Step 7: When you click Ok, you will see the result in the selected output range.
Step 8: Save the excel file and Close the Ms Excel application.
31
32
OUTPUT:
RESULT:
The missing data on dataset was handled successfully using MS-EXCEL
and the desired output was displayed in neat format.
33
AIM:
To normalize in the given dataset using MS-EXCEL.
It is the process of scaling data in such a way that all data points lie in a range of 0 to 1.
Thus, this technique, makes it possible to bring all data points to a common scale. The
mathematical formula for normalization is given as:
where X is the data point, Xmax and Xmin are the maximum and minimum value in the group of
records respectively. The process of normalization is generally used when the distribution of
data does not follow the Gaussian distribution.
PROCEDURE:
Step 5: Apply the normalization formula using maximum value, minimum value and
difference value.
Step 8: Save the excel file and Close the Ms Excel application.
34
OUTPUT:
RESULT:
The given dataset was normalized using MS-EXCEL and the desired
output was displayed in neat format.
35
EX NO: 6 Perform dimensionality reduction operation using PCA, KPCA & SVD
DATE: 18-03-25
AIM:
To Perform dimensionality reduction operation using PCA, KPCA & SVD
PROCEDURE:
Step 5: In the Outputs tab, activate the option to display significant correlations in
bold characters (Test significancy).
Step 6: In the Charts tab, in order to display the labels on all charts, and to display all
the observations (observations charts and biplots), uncheck the filtering
option.
Step 7: If there is a lot of data, displaying the labels might slow down the global
display of the results. Displaying all the observations might make the results
unreadable. In these cases, filtering the observations to display is
recommended
Step 7: Save the excel file and Close the Ms Excel application.
36
OUTPUT:
RESULT:
The given dataset was performed dimensionality reduction operation using
PCA, KPCA & SVD and the desired output was displayed in neat format.
37
AIM:
To Perform bivariate analysis on the dataset using MS-EXCEL.
PROCEDURE:
Step 3: If you haven't already installed the Analysis ToolPak , Click the Microsoft
Office button, then click on the Excel Options , and then select Add-Ins ,
Click Go, check the Analysis ToolPak box, and click Ok
Step 4: Select Data tab, then click on the Data Analysis option, then selects
Descriptive Statistics from the list and Click Ok. [Data tab >> Data
Analysis >>]
Step 5: In the Input Range we select quantity as x range and discount as y range
then select Output Range where you want the output to be stored. If you don’t
specify the output range it will throw output in the new worksheet.
Step 6: Then select Output Range where you want the output to be stored. If you
don’t specify the output range it will throw output in the new worksheet.
Step 7: When you click Ok, you will see the result in the selected output range.
Step 8: Save the excel file and Close the Ms Excel application.
38
39
OUTPUT:
RESULT:
The bivariate analysis on dataset was performed successfully using MS-EXCEL and the desired output
was displayed in neat format.
40
AIM:
To Perform multivariate analysis on the dataset using MS-EXCEL.
PROCEDURE:
Step 3: If you haven't already installed the Analysis ToolPak , Click the Microsoft
Office button, then click on the Excel Options , and then select Add-Ins ,
Click Go, check the Analysis ToolPak box, and click Ok
Step 4: Select Data tab, then click on the Data Analysis option, then selects
Descriptive Statistics from the list and Click Ok. [Data tab >> Data
Analysis >>]
Step 5: In the Input Range we Select quantity as x range and discount and profit as y
range , then select Output Range where you want the output to be stored. If you
don’t specify the output range it will throw output in the new worksheet.
Step 6: Then select Output Range where you want the output to be stored. If you
don’t specify the output range it will throw output in the new worksheet.
Step 7: When you click Ok, you will see the result in the selected output range.
Step 8: Save the excel file and Close the Ms Excel application.
41
42
OUTPUT:
RESULT:
The multivariate analysis on the dataset was performed successfully using MS-
EXCEL and the desired output was displayed in neat format.
43
AIM:
To apply and explore various plotting functions on the data set using MS-EXCEL.
PROCEDURE:
Step 5: Choose any chart from the list of charts Excel recommends for your data on the
Recommended Charts tab, and click it to preview how it will look with your data.
Step 6: lick on All Charts if you are unable to locate a chart you like.
Step 7: Click on the chart that you prefer and then click OK.
Step 8: Add chart elements such as axis titles or data labels, customize the appearance of
the chart, or change the data displayed in the chart by clicking on Chart Elements,
Chart Styles, and Chart Filters in the upper-right corner of the chart.
Step 9: Click on the chart TOOLS tab on the ribbon to add additional design and formatting
capabilities and then click the options you desire under the DESIGN and FORMAT
tabs.
Step 10: When you click Ok, you will see the result in the selected output range.
Step 11: Save the excel file and Close the Ms Excel application.
44
OUTPUT:
RESULT:
The plotting functions was applied and explored on the data set
successfully using MS-EXCEL and the desired output was displayed in neat
format.
45
AIM:
To explore the features of Power BI Desktop
PROCEDURE:
Step 4: Create visuals, such as charts or graphs that provide visual representations of
the data.
Step 5: Create reports that are collections of visuals on one or more report pages.
Important
Power BI Desktop is updated and released on a monthly basis, incorporating customer
feedback and new features. Only the most recent version of Power BI Desktop is supported;
customers who contact support for Power BI Desktop will be asked to upgrade to the
mostrecent version. You can get the most recent version of Power BI Desktop from the
Windows Store, or as a single executable containing all supported languages that you
download and install on your computer.
There are three views available in Power BI Desktop, which you select on the left side of the
canvas. The views, shown in the order they appear, are as follows:
• Report: You create reports and visuals, where most of your creation time is spent.
• Data: You see the tables,measures, and other data used in the data model associated with
your report, and transform the data for best use in the report's model.
• Model: You see and manage the relationships among tables in your data model.
47
The following image shows the three views, as displayed along the left side of the canvas:
Connect to data
To get started with Power BI Desktop, the first step is to connect to data. There are many
different data sources you can connect to from Power BI Desktop.
To connect to data:
2. When you select a data type, you're prompted for information, such as the URL and
credentials, necessary for Power BI Desktop to connect to the data source on your behalf.
48
1. After you connect to one or more data sources, you may want to transform the data
so it's useful for you.
In Power BI Desktop, you can clean and transform data using the built-in Power Query
Editor.
With Power Query Editor, you make changes to your data, such as changing a data type,
removing columns, or combining data from multiple sources. It's like sculpting: you start
with a
large block of clay (or data), then shave off pieces or add others as needed, until the shape of
the
data is how you want it.
Each step you take in transforming data (such as renaming a table, transforming a data type,
or deleting a column) is recorded by Power Query Editor. Every time this query connects to
the data source, those steps are carried out so that the data is always shaped the way you
specify.
The following image shows the Power Query Editor window for a query that was shaped,
and turned into a model.
Once your data is how you want it, you can create visuals.
Create visuals
After you have a data model, you can drag fields onto the report canvas to create visuals. A
visual is a graphic representation of the data in your model. There are many different types of
visuals to choose from in Power BI Desktop. The following visual shows a simple column
chart.
50
If you already have a visual selected on the report canvas, the selected visual changes to the
type you selected.
If no visual is selected on the canvas, a new visual is created based on your selection.
Create reports
More often, you'll want to create a collection of visuals that show various aspects of the data
you've used to create your model in Power BI Desktop. A collection of visuals, in one Power
BI Desktop file, is called a report. A report can have one or more pages, just like an Excel
file can have one or more worksheets.
With Power BI Desktop you can create complex and visually rich reports, using data from
multiple sources, all in one report that you can share with others in your organization.
In the following image, you see the first page of a Power BI Desktop report, named
Overview, as seen on the tab near the bottom of the image.
51
Share reports
After a report is ready to share with others, you can publish the report to the Power BI
service, and make it available to anyone in your organization who has a Power BI license.
Power BI Desktop connects you to the Power BI service with your Power BI
account.
2. You're prompted to select where in the Power BI service you'd like to share the
report. For example, your workspace, a team workspace, or some other location
in the Power BI service.
Following are some of the features of Power BI –
1. Power BI Desktop
Power BI Desktop is free software that you can download and install, and it allows
you to build reports by accessing data easily. For using Power BI desktop, you do
not need advanced report designing, or query skills to build a report.
2. Stream Analytics
Power BI's primary advantage is its support stream analytics. From factory sensors
to social media sources, Power BI assists in real-time analytics to make timely
decisions.
3. Multiple Data Sources
Support for various data sources is one of the vital features of Power BI. You can
access various sources of data such as Excel, CSV, SQL Server, Web files, etc. to
create interactive visualizations.
4. Custom Visualization
While dealing with complex data, Power BI's default standard might not be enough
in some cases. In that case, you can access the custom library of visualization that
meets your needs.
RESULT:
The features of Power BI explored successfully and displayed desired output in
neat format.
52
AIM:
To Prepare & Load data in Power BI Desktop
PROCEDURE:
Step 2: Goto to home menu , select get data and then choose excel worksheet
Step 3: Select your file to load into Power Bi application and press load button
Step 4: Now the file is loaded and now click data view in the left panel
OUTPUT:
RESULT:
The Excel data sheet was prepared and loaded successfully into Power BI Desktop and
displayed desired output in neat format.
55
AIM:
To Develop the data model using Power BI Desktop
PROCEDURE:
Step 2: Goto to home menu, select get data and then choose excel worksheet
Step 3: Select your file to load into Power Bi application and press load button
Step 4: Now the file is loaded and now click model view in the left panel
Step 5: Now data modeling of the given tables are displayed in output screen
OUTPUT:
RESULT:
The data model was developed and loaded successfully using Power BI Desktop and
displayed desired output in neat format.
57
AIM:
To perform DAX calculations using Power BI Desktop
PROCEDURE:
Step 2: Goto to home menu, select get data and then choose excel worksheet
Step 3: Select your file to load into Power Bi application and press load button
Step 4: Now the file is loaded and now click report view in the left panel
Step 5: Goto data panel and right click on data file and choose new measure
Step 6: Type the formula for new measure and click the new measure “ was added
OUTPUT:
RESULT:
The perform DAX calculations were performed successfully using Power BI Desktop and
displayed desired output in neat format.
61
AIM:
To using Power BI Desktop
PROCEDURE:
Step 2: Goto to home menu, select get data and then choose excel worksheet
Step 3: Select your file to load into Power Bi application and press load button
Step 4: Now the file is loaded and now click report view in the left panel
Step 5: Goto data panel and right click on data file and choose new measure
Step 6: Type the formula for new measure and click the new measure “was added
Step 7: Select clustered column chat and copy the visualization in workspace.
OUTPUT:
RESULT:
The report was created successfully using Power BI Desktop and displayed desired
output in neat format.
64
AIM:
To create of a dashboard and perform data analysis using Power BI Desktop
PROCEDURE:
Step 2: Goto to home menu, select get data and then choose excel worksheet
Step 3: Select your file to load into Power Bi application and press load button
Step 4: Now the file is loaded and now click report view in the left panel
Step 5: Goto data panel and right click on data file and choose team filed and slicer
visualization
Step 6: Now teamwise slicer was displayed. Next copy the visualization and change
to card visualization
Step 8: Next copy the visualization and change to donut visualization and drag
gender field and change value filed as count of gender
OUTPUT:
RESULT:
The dashboard was created and data analysis was performed successfully using
Power BI Desktop and displayed desired output in neat format.
70
AIM:
To Presentation of a case study for Campus Recruitment Analysis using
Power BI Desktop
The Challenge
Campus Placements/ Campus Recruitment drives are conducted in various educational
institutes for providing job opportunities to the students who are pursuing their particular
academic courses.
As much as it is important to the students, it is also important to the institute as it gives
a chance to contemplate about the process. This data includes students from various
colleges.
The Solution
What every management team wants to know:
● How many companies appeared?
● How many students appeared?
● Which companies have hired the most of the students and from which college?
● How many students accepted the offer?
● Were the companies beneficial in accordance to their vacancies?
● How many students were selected by the companies?
● What was the ratio between students applied and the vacancies available?
● Which college were the most preferred by the students?
● Which technologies were the most preferred by the students?
● Which colleges were benefitted by this recruitment in terms of the selection?
● Which colleges did not perform up to the mark?
● Which colleges were involved the most in terms of the students registered?
● Which companies were beneficial in accordance to their vacancies?
OUTPUT:
RESULT:
The Presentation of a case study for Campus Recruitment Analysis was performed
successfully using Power BI Desktop and displayed desired output in neat format.