week_2_topic_2
week_2_topic_2
Data Analysis
Week 2 , Topic 2
Cleaning Functions
Excel Data Cleaning is a significant skill that all
Business and Data Analysts must possess. In the
current era of data analytics, everyone expects the
accuracy and quality of data to be of the highest
standards. A major part of Excel Data Cleaning involves
the elimination of blank spaces, incorrect, and outdated
information.
REMOVE DUPLICATES
There is a considerable probability that it might duplicate
unintentionally the data without the user's knowledge. In such
scenarios, you can eliminate the duplicate values.
Here, you will consider a simple student dataset that has duplicate
values. You will use Excel's built-in function to remove duplicates,
as shown below.
To eliminate the duplicate data, you need to select the data option
in the toolbar, and in the Data Tools ribbon, select the "Remove
Duplicates" option. This will provide you with the new dialogue box,
as shown below.
The Excel MID function extracts a given number of
characters from the middle of a supplied text string.
Syntax: =MID (text, start_num, num_chars)
Arguments
text - The text to extract from.
start_num - The location of the first character to extract.
num_chars - The number of characters to extract
The Excel RIGHT function extracts a given number of
characters from the right side of a supplied text string
Syntax: =RIGHT (text, [num_chars])
Arguments
text - The text from which to extract characters on the
right.
num_chars - [optional] The number of characters to
extract, starting on the right. Optional, default = 1.
The Excel LEFT function extracts a given number of
characters from the left side of a supplied text string
Syntax: =LEFT (text, [num_chars])
Arguments
text - The text from which to extract characters.
num_chars - [optional] The number of characters to
extract, starting on the left side of text. Default = 1.
The Excel TRIM function strips extra spaces from text,
leaving only a single space between words and no
space characters at the start or end of the text.
Syntax =TRIM (text)
Arguments
text - The text from which to remove extra space.
The Excel UPPER function converts a text string to all uppercase letters. Numbers, punctuation,
and spaces are not affected.
Purpose
Convert text to upper case
Return value
Uppercase text.
Syntax =UPPER (text)
Arguments
text - The text that to convert to upper case.
The LOWER function converts a text string to all lowercase letters. The LOWER function
takes just one argument, text, which can be a text value or cell reference. LOWER
changes any uppercase characters in the text value to lowercase. Numbers,
punctuation, and spaces are not affected.
Syntax =LOWER (text)
Arguments
text - The text that should be converted to lower case.
The Excel PROPER function capitalizes each word in a given text string.
Numbers, punctuation, and spaces are not affected. Syntax
=PROPER (text)
Arguments
text - The text that should be converted to proper case.
The Excel SUBSTITUTE function replaces text in a given string by matching.
Return value
The processed text
Syntax
=SUBSTITUTE (text, old_text, new_text, [instance])
Arguments
text - The text to change.
old_text - The text to replace.
new_text - The text to replace with.
instance - [optional] The instance to replace. If not supplied, all instances
are replaced.
Use CONCATENATE, one of the text functions, to join two
or more text strings into one string.
Syntax: CONCATENATE(text1, [text2], ...)