Manual
Manual
1. Merging columns
1. Go to Data>Get & transform data>Get Data>from Excel workbook.
2. Select the workbook you wish to load. You can select multiple
workbooks.
3. Select transform data.
4. Range the target columns in the order you want them merged.
5. Go to Transform/Add column*>Merge Columns. Select the separator
and rename the merged column.
6. Click OK.
*“Add column” ribbon → select columns by pressing control according to the sequence → Merge
columns → select separator and name the column. → See a new column created.
“Transform” ribbon → select columns by pressing control according to the sequence → Merge columns
→ select separator and name the column. → Only see ONE column, does not add columns
Tips: You may also do customisation on the new column you want to add. Go to Add Column>Custom
Column to add a new column and merge all the target columns into this new column. But this method
cannot be used to merge columns with “number” format. So, we must first change the format to “text”
before customising.
Tips:
1. To load your edited data into Excel, click Home>Close & Load.
2. To edit the table in Power Query, click Query>Edit.
3. The data in Excel is linked to the original data through “Power Query”, so even if the data is deleted,
when we click refresh, the data will be loaded back from the original data. If need the loaded data to do
report, need to unlink. Go to “Table design” → convert to range / unlink.
Tips: Columns must be the same. Append tables means to put the data line by line – layman term is to
combine all data under the same column headers.
1. Go to Data>Get & transform data>Get Data>from Excel workbook.
2. Select the workbook you wish to load. You can select multiple
workbooks.
3. Select transform data.
4. Go to Power Query, select Home>Combine>Merge Queries as New.
5. Select the target tables and target columns (use control button) to
merge.
6. After merging, load the final query into Excel
Tips:
1. If the column headers are not the same, the data will be loaded in a separate column. Even if there is a
difference of spacing, Excel will deem as different.
2. If there are new tables added into the folder and you wish to update, click Home>Refresh to have the
new tables combined.
4. Pivoting columns
5. Unpivoting columns
6. Transpose data
Tips:
1. You may need to TRIM your data before you start splitting.
2. You may need to split more than one time in a single table, and may need to use more than one type of
splitting method.
Try it!
8. Indexing columns
Revision!
Combining data from folder – use “Sales Data” folder!
Can you do it?
You can also do this for columns of other data type, e.g. number.
Try to find the profit!
11. Remove duplicates
*Tips: You can always “undo” by cancelling the steps you have done. Just click the cross button beside
the steps listed under “Applied Steps” at right hand side of Power Query window.
Data Analytics
You need plug in to run this session.
https://support.microsoft.com/en-us/office/load-the-analysis-toolpak-in-excel-6a63e598-cd6d-42e3-9317-
6b40ba1a66b4
Click the File tab, click Options, and then click the Add-Ins category.
In the Add-Ins box, check the Analysis ToolPak check box, and then click OK.
- If Analysis ToolPak is not listed in the Add-Ins available box, click Browse to locate it.
- If you are prompted that the Analysis ToolPak is not currently installed on your
computer, click Yes to install it.
12. Descriptive Analysis
Tips:
Mean – Average
Standard Error – A measure of how accurate the mean of a sample is likely to be compared to the true
population mean. A small standard error indicates that the sample mean is a reliable estimate of the
population mean, while a large standard error means that the sample mean may vary a lot from the
population mean. The standard error decreases as the sample size increases
Median – Center value
Mode – The value that occurs most frequently in a given set of data.
Standard Deviation - A measure of the amount of variation or dispersion of a set of values.
The smallest possible value for the standard deviation is 0, and that happens only in contrived situations
where every single number in the data set is exactly the same (no deviation).
Sample Variance – A measure of how far a set of numbers is spread out from their average value
Kurtosis – The sharpness of the peak of a frequency-distribution curve. A positive value for the kurtosis
indicates a distribution more peaked than normal, while a negative kurtosis indicates a shape flatter than
normal.
Skewness – A measure of the symmetry of a distribution. It can be used to determine whether a dataset is
symmetric or skewed. A negative value for skewness indicates that the tail is on the left side of the
distribution, which extends towards more negative values. A positive value for skewness indicates that the
tail is on the right side of the distribution, which extends towards more positive values. If skewness = 0,
the data are perfectly symmetrical.
Range = Max - Min
Minimum – Smallest value
Maximum – Largest value
Sum – Addition of all values
Count – The number of values
Tips: You can also do descriptive analysis using pivot table. How?
13. Correlation