0% found this document useful (0 votes)
3 views

week_2_topic_2

The document covers essential Excel data cleaning functions necessary for business and data analysts, including methods to remove duplicates and various text manipulation functions like MID, RIGHT, LEFT, TRIM, UPPER, LOWER, PROPER, SUBSTITUTE, CONCATENATE, and TEXT. Each function is explained with its syntax and arguments, emphasizing their importance in ensuring data accuracy and quality. Mastery of these functions is crucial for effective data analysis.

Uploaded by

alexisogbolu
Copyright
© © All Rights Reserved
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
3 views

week_2_topic_2

The document covers essential Excel data cleaning functions necessary for business and data analysts, including methods to remove duplicates and various text manipulation functions like MID, RIGHT, LEFT, TRIM, UPPER, LOWER, PROPER, SUBSTITUTE, CONCATENATE, and TEXT. Each function is explained with its syntax and arguments, emphasizing their importance in ensuring data accuracy and quality. Mastery of these functions is crucial for effective data analysis.

Uploaded by

alexisogbolu
Copyright
© © All Rights Reserved
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 13

CWW Tech Africa

Data Analysis
Week 2 , Topic 2
Cleaning Functions
Excel Data Cleaning is a significant skill that all
Business and Data Analysts must possess. In the
current era of data analytics, everyone expects the
accuracy and quality of data to be of the highest
standards. A major part of Excel Data Cleaning involves
the elimination of blank spaces, incorrect, and outdated
information.
REMOVE DUPLICATES
There is a considerable probability that it might duplicate
unintentionally the data without the user's knowledge. In such
scenarios, you can eliminate the duplicate values.
Here, you will consider a simple student dataset that has duplicate
values. You will use Excel's built-in function to remove duplicates,
as shown below.
To eliminate the duplicate data, you need to select the data option
in the toolbar, and in the Data Tools ribbon, select the "Remove
Duplicates" option. This will provide you with the new dialogue box,
as shown below.
The Excel MID function extracts a given number of
characters from the middle of a supplied text string.
Syntax: =MID (text, start_num, num_chars)
Arguments
text - The text to extract from.
start_num - The location of the first character to extract.
num_chars - The number of characters to extract
The Excel RIGHT function extracts a given number of
characters from the right side of a supplied text string
Syntax: =RIGHT (text, [num_chars])
Arguments
text - The text from which to extract characters on the
right.
num_chars - [optional] The number of characters to
extract, starting on the right. Optional, default = 1.
The Excel LEFT function extracts a given number of
characters from the left side of a supplied text string
Syntax: =LEFT (text, [num_chars])
Arguments
text - The text from which to extract characters.
num_chars - [optional] The number of characters to
extract, starting on the left side of text. Default = 1.
The Excel TRIM function strips extra spaces from text,
leaving only a single space between words and no
space characters at the start or end of the text.
Syntax =TRIM (text)
Arguments
text - The text from which to remove extra space.
The Excel UPPER function converts a text string to all uppercase letters. Numbers, punctuation,
and spaces are not affected.
Purpose
Convert text to upper case
Return value
Uppercase text.
Syntax =UPPER (text)
Arguments
text - The text that to convert to upper case.
The LOWER function converts a text string to all lowercase letters. The LOWER function
takes just one argument, text, which can be a text value or cell reference. LOWER
changes any uppercase characters in the text value to lowercase. Numbers,
punctuation, and spaces are not affected.
Syntax =LOWER (text)
Arguments
text - The text that should be converted to lower case.
The Excel PROPER function capitalizes each word in a given text string.
Numbers, punctuation, and spaces are not affected. Syntax
=PROPER (text)
Arguments
text - The text that should be converted to proper case.
The Excel SUBSTITUTE function replaces text in a given string by matching.
Return value
The processed text
Syntax
=SUBSTITUTE (text, old_text, new_text, [instance])
Arguments
text - The text to change.
old_text - The text to replace.
new_text - The text to replace with.
instance - [optional] The instance to replace. If not supplied, all instances
are replaced.
Use CONCATENATE, one of the text functions, to join two
or more text strings into one string.
Syntax: CONCATENATE(text1, [text2], ...)

rgument name Description


text1 (required) The first item to join. The item
can be a text value, number, or
cell reference.
Text2, ... (optional) Additional text items to join. You
can have up to 255 items, up to
a total of 8,192 characters.
The TEXT function lets you change the way a number appears by
applying formatting to it with format codes. It's useful in
situations where you want to display numbers in a more readable
format, or you want to combine numbers with text or symbols. In
its simplest form, the TEXT function says:
=TEXT(Value you want to format, "Format code you want
to apply") Formula
=TEXT(1234.567,"$#,##0.00")
Description
Currency with a thousands separator and 2
decimals, like $1,234.57. Note that Excel rounds the
value to 2 decimal places.

=TEXT(TODAY(),"MM/DD/YY") Today’s date in MM/DD/YY format, like 03/14/12

=TEXT(TODAY(),"DDDD") Today’s day of the week, like Monday

=TEXT(NOW(),"H:MM AM/PM") Current time, like 1:29 PM


=TEXT(0.285,"0.0%") Percentage, like 28.5%
=TEXT(4.34 ,"# ?/?") Fraction, like 4 1/3
=TRIM(TEXT(0.34,"# ?/?")) Fraction, like 1/3. Note this uses the TRIM function to
remove the leading space with a decimal value.

=TEXT(12200000,"0.00E+00") Scientific notation, like 1.22E+07


=TEXT(1234567898,"[<=9999999]###-####; Special (Phone number), like (123) 456-7898
(###) ###-####")
=TEXT(1234,"0000000") Add leading zeros (0), like 0001234

=TEXT(123456,"##0° 00' 00''") Custom - Latitude/Longitude

You might also like