0% found this document useful (0 votes)
12 views11 pages

ISDA2 - Stage 2 Group 47

Uploaded by

missmokoena20
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
12 views11 pages

ISDA2 - Stage 2 Group 47

Uploaded by

missmokoena20
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 11

SCHOOL OF BUSINESS SCIENCES

INFORMATION SYSTEMS DATA ANALYTICS IIA (INFO2006A)

Group Number: ______74__________ Group Name: _____ Brogrammers _____

Stage 2:
Excel +
Part 1 - Excel
VBA

No. Group Member Name Student Number


1 Progress Chendume 2449879
2 Orathile Molai 2447673
3 Batseba Molepo 2324039
4 Rethabile Nhlapho 2445327
5 Bhekisisa Bayanda Vilakazi 2480767
6 Siphosihle Zwane 2457046

1
Question 1
Screenshots
Transactions data

Cashbook data

2
Salesperson data

Explanation
For each data set that has “Salesperson ID”
- Under “Salesperson ID” column, click CTRL+SHIFT+Down Arrow
- On the home tab, click on “Find and Select” on the top left
- Click on “Replace”
- Find: WWSP
- Replace with: SP
- Click “Replace all”

Question 2
Screenshot
Transactions data

3
Explanation
To correct the invoice numbers
 Click on cell A2
 CTRL+Shift+Down Arrow
 Click on the yellow diamond shape icon with an exclamation mark in it, which will be located on the top right of
the column
 From the menu that appears, select ‘Convert to number’

Screenshot
Cashbook data

Explanation
All column containing monetary values should include the rand (R) format
 Select the amount (incl. VAT) column from C2
 Go to General under the Home tab
 Select Accounting Format

Screenshot
Product data

4
Explanation
All column containing monetary values should include the rand (R) format
 Select the average price column from C2
 Go to General under the Home tab
 Select Accounting Format

Screenshot
Stores Data

Explanation
The number format in the home tab was used to format the cell phones by using the custom option which allows custom
settings for the cells in a column. This function was used because it formats all the cells in a row to a ‘customised’ cell
phone number format which is consistent with the rest of the data and new inputs into the row will automatically be
formatted to the cell phone number format.

Question 3
Screenshot

Explanation
Data which was not in the correct columns in the stores data was in the ‘street column’ so the inputs were copied and
pasted into the correct columns for completeness in the data. For the inputs in the zip code column which started with 0,
a customized number format from the number drop down in the home tab was used for those cells to appear in the zip
code column as they have appeared in the street column.

5
Question 4
Screenshots
Transaction Data

Stores Data

Customer Data

6
Salesperson Data

Explanation
Transaction Data
Step 1: Use the textjoin formula to combine the data in the different cells in the transaction data while subsequently
creating a new column "helper 1". This is done to include the invoice column when doing the power query.

Step 2: Go to the data tab and select "from table" to start a power query.

Step 3: Select keep rows the select keep duplicates to identify the duplicate entries in the transaction data then create a
new sheet to clearly see the duplicate entries.

Step 4: Exit power query and select original transaction data then select remove duplicates which should be under data
tab.

15 duplicates should be removed.

Customer Data, Salesperson Data and Stores Data


The duplicates entries for the customer data (the card no column), salesperson data (the salesperson number and
email) and stores data (store ID) were identified using the using step 2 and 3.

- The duplicates in the customer data and salesperson data cannot be removed unless the correct information is
acquired and updated as this will result in either the deletion of an entire row or missing / blank spaces.
- The duplicates in the stores data can be removed by deleting either column A or B in the data set.

7
Question 5
Screenshots
Highest sales

Salesperson

Explanation
- The max function returns the highest value in a given set of numeric values. This function was used to find the
highest sales amount of R10 829.06 in the given set of sales.
- The index function returns a value from within a table or range. The index function was used to find the
salesperson ID of the salesperson with the highest number of sales.
- The match function searches for a specified item in a range of cells and then returns the relative position of that
item in the range. This function was used to find the row number of the salesperson ID for the salesperson with
the highest number of sales.

Question 6
Screenshots
VAT Amount

8
VAT Percentage

Explanation
- The VAT amount was calculated by subtracting the amount exclusive of VAT from the amount inclusive of VAT.
- The VAT percentage was calculated by dividing the VAT amount by the amount exclusive of VAT.

Question 7

Screenshot

Formula
=IF(H2=1,"Food and Beverages",IF(H2=2,"Clothing",IF(H2=3,"Baby",IF(H2=4,"Home and Lifestyle",IF(H2=5,"Alcoholic
Beverages",IF(H2=6,"Health and Beauty"))))))

Explanation
The IF function returns one value if a condition is true and another value if the condition is false. In this case, several IF
functions were utilized therefore if the first condition was not met, the next condition was tested. This went on until a
condition was met.

Question 8
Screenshots

9
Explanation

- The COUNTIF function is used to count the number of cells that meet a certain criteria. In this case, it was used
to count the total number of transactions for each store. The criteria was the Store ID.
- The MAX function was then used to calculate the maximum number of transactions in the range of total number
of transactions for each store.

Question 9
To detect fraud in excel data we use:

Benford's law which is an observation that in many real life sets of numerical data, the leading digit is likely to be small.
1. Obtain transaction-by-employee data
 Isolate the first digit of each transaction using the LEFT function

2. Chart Benford's distribution


 The probability for Benford's distribution is uniform for all data sets but is subject to deviation

3. Determine the desired confidence level


 Using the 95% confidence level determine
 Upper limit: P. + 1.96*VP. * (1 - P.)/n + 1/(2n)
 Lower limit: Pe-1.96 * VPe* (1 - P.)/n - 1/(2n)
 Plot the transaction data on the distribution chart
10
4. Analyse the transactions data
 Using the COUNTIF function count the different first digit totals
 Determine the digit frequency
 Insert a combo chart that represents the Benford's distribution, upper limit, lower limit and the digit frequency
 When the digit frequency lies above the upper limit of the data, we have reason to believe there has been
fraudulent activity

11

You might also like