Module 4
Module 4
● Location Analytics
○ Sources of Location Data
○ Categories of Location Analytics
○ Location Analytics and Privacy Concerns
○ Location Analytics Tools
● Search Engine Analytics
○ Types of Search Engines
○ Search Engine Analytics
○ Search Engine Analytics Tools
Location Analytics
● Location analytics, also known as spatial analysis or geo-analytics is
concerned with mapping, visualizing, and mining the location of
people, data, and other resources.
● All sectors, including business, government, nonprofit, and academia,
can benefit from location analytics.
● scientists used six million geo-located Twitter messages to observe the
“heartbeat” of New York City. Using the dataset, the scientist were able to
study and map the waking, sleeping, commuting, work, and leisure
dynamics of the people living in the city during the weekday and
weekends.
● Such geo-analytics can be instrumental in better understanding our
cities
Module 4
● Location Analytics
○ Sources of Location Data
○ Categories of Location Analytics
○ Location Analytics and Privacy Concerns
○ Location Analytics Tools
● Search Engine Analytics
○ Types of Search Engines
○ Search Engine Analytics
○ Search Engine Analytics Tools
SOURCES OF LOCATION DATA
POSTAL ADDRESS
● Most business analytics applications rely on address information of their
customers, including city names, locality names, and postal or zip
codes.
LATITUDE AND LONGITUDE
● In geography, latitude (shown as a horizontal line on a globe) and
longitude (shown as a vertical line on a globe) are used to find exact
location on Earth.
GPS-BASED
● GPS is a satellite-based navigation system that can be used to find
exact location of people and resources. Mobile analytics mostly rely
on
SOURCES OF LOCATION DATA
IP-BASED
● Public IP (Internet protocol) can be used to determine the location
of Internet users.
● A public IP address is an exclusive numerical address (like a home address)
assigned to a device connected to the Internet. Different regions in the world
are assigned a specific block of public IP addresses; hence, it can be used to
mine approximate geo-location of Internet users.
Categories of Location Analytics
● Based on its scope, location analytics can be broadly classified into two
categories:
● Real-World Context
A map links data to the real world which clearly demonstrates how location relates to other data
aspects. This strengthens the insights obtained as analytics teams and end users delve down into
visualizations to discover new opportunities and enhance operations.
● Actionable Insights
Users may create insightful geographical analysis instantly without the learning curve of
other GIS tools with location analytics. It also has the depth of analytical capabilities required
for predictive analysis and other location based optimizations.
Location Analytics and Privacy Concerns
● While location-based services bring ease, convenience, and safety to customers and value
to business, they also raise serious privacy issues related to collection, retention, use,
and disclosure of location information.
● Tracking, mining, and storing location information can endanger some fundamental
human rights, such as freedom of movement and freedom from being observed.
● Minch raised several issues arising from location-based services, including the
following.
● Should users of location-enabled devices be informed when location tracking is in
use?
● Should users of location-enabled devices be permitted to control the storage
of location information?
● Should location information as stored be personally identifiable, or should the
user have the option to preserve degrees of anonymity?
Location Analytics and Privacy Concerns
● What legal protection should a person’s historical location information
have against unreasonable search and seizure?
● To what extent should users of location-based services be allowed to
choose their own level of identifiability/anonymity?
● What level of disclosure control should be dictated by government
regulation? By the affected individual customers, users, etc.? By other
parties?
● What governmental legislation and regulation is appropriate to assure
citizens’ rights of privacy in an era of location-aware mobile devices?
Location Analytics Tools
● Agos: Agos is a geo-tagging and reporting platform that helps communities deal with
climate change adaptation and disaster risk reduction
● Tweepsmap: Tweepsmap (https://tweepsmap.com/) maps your Twitter followers by
country, state, or city.
● Trendsmap: Trendsmap (http://trendsmap.com/) is real-time tool that maps the latest
trends from Twitter, anywhere in the world.
● Followerwonk: This tool helps you perform basic Twitter analytics, such as, who are
your followers? Where are they located? When do they tweet? The tool can be
accessed via http://followerwonk.com/ Esri:
● Esri’s GIS (geographic information systems) is software to map, visualize,
question, analyze, and interpret data to understand relationships, patterns, and
trends (http://www.esri.com/).
Review Questions
analytics?
1) Crawler-based
2) Directories
3) Metasearch engines.
TYPES OF SEARCH ENGINES : Crawler-based
● crawler-based search engines create their databases or lists automatically, without any
human intervention.
● Example: Google.com and Bing.com.
● Crawler-based search engines are widely used to find and access content over the
Internet.
● They operate in three steps:
1) web crawling
2) indexing
3) searching.
TYPES OF SEARCH ENGINES : Crawler-based
Web crawling
● Search engines start by collecting and storing information about web
pages. This mechanism is termed web crawling.
● A web crawler (also known as web spider or bot) is a computer
program or software specifically designed to collect and store data
about websites for indexing.
TYPES OF SEARCH ENGINES : Crawler-based
Indexing
● Indexing helps classify a website correctly for searching purposes.
● The data crawled or extracted is then indexed and stored in a database
for quick access.
● Every search engine may follow different techniques for indexing web page
data.
● Common indexing techniques include storing meta tags (which are used
in the header of a web page and provide descriptions of the website) and
keywords related to a website.
TYPES OF SEARCH ENGINES : Crawler-based
Searching
● Searching is the final step in search engine operations.
● When a user requests specific information by entering keywords in a search
engine, the search engine queries the index and provides a list of the most
relevant web pages by matching it with the indexed keywords.
● However, it may not be that simple; search engines use a variety of factors
to rank and provide a list of matching websites.
● The engine ranks content on a number of factors, such as the
authoritativeness of a page, back links to the page and keywords a page
contains
TYPES OF SEARCH ENGINES : Crawler-based
● In order to achieve good search results an organization must place keywords in section
titles, images, and in the general content of its website.
● A keyword density of 5–8 percent (i.e., five to eight keywords per one hundred words)
is an optimal number.
● Having important keywords embedded in a website enables a search engine robot
to evaluate the website as being the most suitable site for the searched word.
● However, if one repeatedly uses the same keywords or definitions in page content, it may
be perceived by a robot to be spamming.
● Research has shown that the position of keywords in a website, as well as their
duplication, layout, and combination, impact web page visibility in a search
engine,which can be improved by increasing the frequency of keywords in the title,
the full text, and in both the title and full-text.
TYPES OF SEARCH ENGINES : Crawler-based
● In conjunction with key words, the overall design of a website is an
important factor that must be taken into consideration when
discussing search engine optimization.
● For a corporation to better understand its Internet presence, its website
statistics should also be checked on a regular basis so as to
understand both how users access and utilize the site and also what
impact site changes may have on these behaviors.
TYPES OF SEARCH ENGINES : DIRECTORIES
● The listings in directories are manually compiled and created by human editors.
● People who want to be listed in a directory submit an address, title, and brief
description of their website, which is then reviewed by the editor and included in it.
● Some good examples of human-created directories are Yahoo Directory, Open
Directory, and LookSmart.
TYPES OF SEARCH ENGINES : METASEARCH ENGINES
● Metasearch engines compiles and display results from other search engines.
● When a user enters a query, the metasearch engine submits the query to several
individual search engines, and results returned from all the search engines are
integrated, ranked, and displayed to the user.
● Examples :Metacrawler, Mamma, and Dogpile.
● By integrating results from several search engines, metasearch engines are capable
of handling large amounts of data and can help us save time by focusing on one
search engine.
● Based on their scope, search engines can be divided into two types:
1) Local
2) Global
TYPES OF SEARCH ENGINES : METASEARCH ENGINES
LOCAL SEARCH ENGINES
● A search engine is local in the sense that it is embedded within a website and
only indexes and searches the content of that website.
○ Amazon’s CloudSearch or any other search engine embedded within a website
is an example of local search engine.
From a business perspective, Google Trends can help also answer a variety of questions,
including the following.
● How people search for your brand?
● When does interest spike in your products or services?
● Which keywords drive more traffic?
● Which regions are interested in your brand?
● What are trending topics over the Internet?
● How are your competitors performing?
SEARCH ENGINE ANALYTICS TOOLS
● Google Trends: Google Trends (http://trends.google.com/) is a search engine analytics
tool. This chapter includes a detailed tutorial on Google Trends.
● Canopy: Canopy is multimedia analytics tool designed to support deep investigation of
large multimedia collections, such as images, videos, and documents. More information
on Canopy is available here: http://www.vacommunity.org/article32
● Google Alerts: Google Alerts (https://www.google.com/alerts) is a content detection and
notification service that automatically notifies users when new content over the
Internet (e.g., social media, web, blogs, video and/or discussion groups) matches a set of
search terms based on user queries. Users are alerted through e-mail. Find out about how
to use Google Alerts.
SEARCH ENGINE ANALYTICS TOOLS
● Icerocket: Icerocket (http://www.icerocket.com/) specializes in blog searches and also
captures activity on Facebook, Twitter, and Flickr.
● Social Mention: Social Mention (http://socialmention.com/) is similar to Google Alerts,
but it only focuses on social media sites, and you can choose to focus on particular
areas, such as blogs. With Social Mention you can monitor for the appearance of
particular keywords and it will give you information on related users, hashtags, and
more.
● TweetBeep: TweetBeep (http://tweetbeep.com/) is like Google Alerts for Twitter. Choose
some keywords and receive daily search results via e-mail.
Review Questions
1. What is the function of a search engine?
2. Explain different types of search engines.
3. Differentiate between local and global search engines.
4. What is search engine analytics?
5. Explain the two main categories of search engine analytics.
6. What is the purpose of search engine optimization?
7. What is the purpose of search engine trend analysis?