0% found this document useful (0 votes)
7 views

1st Unit DA

The document provides an overview of the evolution of data analytics from early manual methods to modern techniques using artificial intelligence and machine learning. It then describes key components of text analytics including preprocessing, sentiment analysis, entity recognition, and applications in various domains. Finally, it outlines aspects of web analytics like data collection, common metrics, and how it helps optimize websites and understand user behavior.

Uploaded by

Ashwin
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
7 views

1st Unit DA

The document provides an overview of the evolution of data analytics from early manual methods to modern techniques using artificial intelligence and machine learning. It then describes key components of text analytics including preprocessing, sentiment analysis, entity recognition, and applications in various domains. Finally, it outlines aspects of web analytics like data collection, common metrics, and how it helps optimize websites and understand user behavior.

Uploaded by

Ashwin
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 7

Page 1 of 7

1. Introduction to data analytics:

Evolution of Data Analytics:


The evolution of data analytics has been a dynamic and ongoing process, shaped by
advances in technology, changes in business practices, and the increasing availability of data. Here's
an overview of its evolution:

1. Early Data Processing (Pre-20th Century): Before the 20th century, data analysis was
mostly manual and limited in scope. People used basic statistical methods and charts to
analyze data.
2. Statistical Analysis and Sampling (20th Century): With the advent of modern statistics,
data analysis became more systematic. Sampling techniques, hypothesis testing, and basic
statistical methods were developed and widely used.
3. Mainframe Computing (1950s-1960s): The introduction of mainframe computers allowed
for more complex data analysis. Statistical software packages like SPSS and SAS emerged
during this period.
4. Spreadsheet Software (1980s): The introduction of spreadsheet software, notably Microsoft
Excel, made it easier for business professionals to conduct data analysis. It democratized data
analysis to some extent.
5. Data Warehousing (1990s): The 1990s saw the rise of data warehousing, where
organizations began to collect and store large volumes of data for analysis. Data
warehousing made it possible to analyze historical data.
6. Data Mining and Business Intelligence (Late 1990s): Data mining and business
intelligence tools emerged, allowing organizations to extract valuable insights from large
datasets. OLAP (Online Analytical Processing) technology made multidimensional analysis
feasible.
7. The Internet and Big Data (2000s): With the growth of the internet and the proliferation of
digital data, the volume of data available for analysis exploded. This period also saw the rise
of open-source tools like Hadoop and NoSQL databases designed for big data analytics.
8. Predictive Analytics and Machine Learning (2010s): The 2010s brought a surge in
predictive analytics and machine learning. Organizations started using advanced algorithms
to make predictions and automate decision-making processes. Tools like R, Python, and
machine learning frameworks like TensorFlow and PyTorch gained popularity.
9. Real-time Analytics and IoT (2010s): The proliferation of IoT (Internet of Things) devices
and the need for real-time analysis led to the development of real-time analytics platforms.
These platforms can process and analyze data in real-time to make immediate decisions.
10. Data Visualization and BI Platforms (2010s): Tools for data visualization, like Tableau and
Power BI, became essential for conveying insights to non-technical stakeholders. Data
analytics platforms also evolved to provide end-to-end solutions for data collection, analysis,
and reporting.
Page 2 of 7

11. Artificial Intelligence and Deep Learning (2010s-Present): AI and deep learning have
transformed data analytics. They allow for more complex pattern recognition, natural
language processing, and image analysis. AI-driven analytics can uncover insights that were
previously impossible to obtain.
12. Cloud Computing and Data Analytics as a Service (DaaS) (2010s-Present): The cloud has
made it easier to store, process, and analyze large datasets without the need for significant
on-premises infrastructure. Many organizations have turned to Data Analytics as a Service
(DaaS) providers to access powerful analytics tools without large upfront costs.
13. Ethics and Privacy Concerns (Ongoing): As data analytics has grown, so have concerns
about data privacy and ethical use. Regulations like GDPR and CCPA have been implemented
to protect individuals' data rights.
14. Future Trends: The evolution of data analytics is likely to continue, with trends including
automated machine learning (AutoML), quantum computing, increased use of natural
language processing, and more integration with AI.

The evolution of data analytics has been driven by technological advancements and the increasing
recognition of data as a valuable resource for making informed decisions in various fields, including
business, healthcare, and scientific research. It will continue to adapt and evolve in response to
changing technologies and societal needs.

Text Analytics:
Text analytics, also known as text mining or natural language processing (NLP), is the process of
extracting valuable insights, patterns, and information from unstructured text data. Unstructured text
data includes a wide range of sources, such as documents, social media posts, emails, customer
reviews, and more. Text analytics leverages linguistic, statistical, and machine learning techniques to
transform this unstructured text into structured data that can be analyzed for various purposes. Here
are the key components and applications of text analytics:

1. Text Preprocessing: Before analysis, text data often requires preprocessing, which includes tasks
like tokenization (splitting text into words or phrases), stemming (reducing words to their root form),
and removing stop words (common words that don't carry much meaning). This helps clean and
structure the text for analysis.

2. Sentiment Analysis: Sentiment analysis, also known as opinion mining, is a common application
of text analytics. It involves determining the emotional tone or sentiment expressed in a piece of text,
whether it's positive, negative, or neutral. Businesses use sentiment analysis to gauge public opinion,
customer feedback, and social media sentiment about their products or services.

3. Named Entity Recognition (NER): NER is a process that identifies and classifies entities
mentioned in text, such as names of people, organizations, locations, dates, and more. It's useful in
applications like information retrieval, document categorization, and chatbots.
Page 3 of 7

4. Text Categorization and Classification: Text analytics is used to categorize and classify
documents or text snippets into predefined categories or topics. This is valuable for organizing
content, routing emails, and automating content tagging.

5. Topic Modeling: Topic modeling techniques, such as Latent Dirichlet Allocation (LDA), are used to
identify underlying topics within a corpus of text. This can help uncover hidden themes and patterns
in large collections of documents.

6. Text Summarization: Text analytics can be used to automatically generate summaries of long
documents or articles. This is helpful for quickly extracting key information from lengthy texts.

7. Text Clustering: Text clustering groups similar documents or text fragments together, which is
useful for organizing content or uncovering patterns in a large dataset.

8. Information Extraction: Information extraction techniques aim to identify specific pieces of


structured information from unstructured text. For example, extracting names and dates from news
articles or product specifications from product descriptions.

9. Text Search and Retrieval: Text analytics is fundamental to search engines and information
retrieval systems. It helps match user queries with relevant documents by analyzing the content of
documents and user queries.

10. Text-Based Chatbots: NLP and text analytics are essential components of chatbots and virtual
assistants, allowing them to understand and respond to user queries in natural language.

11. Content Analysis and Market Research: Businesses use text analytics to analyze customer
feedback, reviews, and social media content to gain insights into market trends, customer
preferences, and competitive intelligence.

12. Legal and Compliance: In the legal field, text analytics can be used for e-discovery, contract
analysis, and compliance monitoring.

Text analytics is a versatile and powerful tool with applications in various domains, including
marketing, customer service, healthcare, finance, and more. It enables organizations to make data-
driven decisions and extract valuable information from the vast amount of unstructured text data
available today.
Page 4 of 7

Web Analytics:
Web analytics is the collection, measurement, analysis, and reporting of data related to web traffic
and user behavior on websites and web applications. It provides valuable insights into how users
interact with a website, helping businesses and website owners understand their audience, optimize
their online presence, and make data-driven decisions. Here are some key aspects and components
of web analytics:

1. Data Collection: Web analytics tools collect data from various sources, including website
logs, tracking scripts, cookies, and other technologies. This data includes information about
visitor behavior, such as page views, clicks, session duration, and more.
2. Key Metrics: Web analytics tools track a range of key metrics, including but not limited to:
 Pageviews: The total number of pages viewed on a website.
 Visits or Sessions: The number of individual visits to the site.
 Unique Visitors: The number of distinct individuals who visited the site.
 Bounce Rate: The percentage of single-page visits where a user leaves the site
without further interaction.
 Conversion Rate: The percentage of visitors who complete a desired action (e.g.,
making a purchase or filling out a form).
 Traffic Sources: Information about where website visitors come from, such as search
engines, referral websites, or direct traffic.
 User Location: Geographic information about visitors.
 User Demographics: Data about the age, gender, and interests of the audience (if
available).
3. User Behavior Tracking: Web analytics tools can track user interactions on a site, including
which pages are viewed, the paths users take through the site, and the actions they perform,
such as clicking links or filling out forms.
4. E-commerce Tracking: For online retailers, web analytics can track e-commerce-specific
metrics, like product views, cart additions, and purchase transactions. This information is
crucial for understanding sales performance.
5. Event Tracking: Event tracking allows webmasters to monitor user interactions that aren't
necessarily pageviews, such as video plays, file downloads, or interactions with embedded
widgets.
6. Custom Goals and Conversions: Web analytics tools enable website owners to define and
track custom goals and conversions. These might include sign-ups, newsletter subscriptions,
or other user interactions that align with business objectives.
7. A/B Testing and Multivariate Testing: Web analytics can be integrated with A/B testing
tools to help businesses compare the performance of different web page variations and
determine which ones are more effective at achieving specific goals.
8. Mobile Analytics: With the increasing use of mobile devices, web analytics tools often
provide insights into mobile traffic, user behavior, and device-specific metrics.
9. Real-Time Analytics: Some web analytics tools offer real-time data, providing immediate
insights into user behavior on a website.
Page 5 of 7

10. Data Visualization and Reporting: Web analytics platforms often include features for
creating reports and visualizing data through charts, graphs, and dashboards.
11. User Segmentation: Web analytics allows you to segment your audience based on various
criteria, enabling more targeted analysis and marketing efforts.
12. Search Engine Optimization (SEO) Analytics: Web analytics tools provide data related to
search engine traffic, keywords driving traffic, and the performance of specific landing pages
in search results.

Popular web analytics tools include Google Analytics, Adobe Analytics, and various other commercial
and open-source solutions. These tools help website owners and businesses make informed
decisions about content, design, marketing strategies, and user experience improvements, ultimately
leading to a better online presence and user engagement.

Prescriptive Analytics:
Prescriptive analytics is the most advanced form of data analytics, which focuses on providing
specific recommendations or actions to optimize decision-making. It goes beyond descriptive
analytics (what happened) and predictive analytics (what is likely to happen) by not only predicting
outcomes but also suggesting the best course of action to achieve desired results. Here are key
aspects and characteristics of prescriptive analytics:

1. Optimization: Prescriptive analytics aims to optimize decisions, processes, and outcomes. It


takes into account various constraints, objectives, and potential actions to determine the best
possible course of action.
2. Data Integration: It relies on a wide range of data sources, including historical data, real-
time data, external data, and contextual data, to make informed recommendations.
3. Machine Learning and Advanced Algorithms: Prescriptive analytics often involves complex
machine learning algorithms and mathematical modeling. These models analyze historical
data and consider multiple variables to make recommendations.
4. Scenarios and What-If Analysis: It allows decision-makers to explore different scenarios
and conduct "what-if" analysis. This enables users to understand the potential consequences
of different decisions and actions.
5. Decision Support Systems: Prescriptive analytics is frequently integrated into decision
support systems, which provide actionable insights and recommendations to users. These
recommendations can be tailored to specific business goals or objectives.
6. Real-Time and Continuous Processing: In some cases, prescriptive analytics operates in
real-time or near-real-time to provide timely recommendations that can adapt to changing
conditions.
7. Industry Applications: It is used in various industries, such as healthcare, finance, supply
chain management, manufacturing, marketing, and more. For example, in healthcare, it can
recommend treatment plans for patients based on their medical history and symptoms.
Page 6 of 7

8. Risk Mitigation: Prescriptive analytics can help organizations identify and mitigate risks. It
provides recommendations for minimizing potential losses or disruptions in various
scenarios.
9. Resource Allocation: It helps organizations allocate resources efficiently, whether it's about
optimizing supply chain logistics, scheduling production, or managing advertising budgets.
10. Personalization: In marketing and e-commerce, prescriptive analytics can offer personalized
product recommendations and pricing strategies based on user behavior and preferences.
11. Ethical Considerations: Prescriptive analytics raises ethical concerns, especially when
decisions involve individuals. Ensuring fairness, transparency, and compliance with privacy
regulations is essential.
12. Human-Machine Collaboration: While prescriptive analytics can make sophisticated
recommendations, it often requires human judgment and expertise to incorporate contextual
knowledge and make the final decision.

Examples of prescriptive analytics in action include:

 Recommending the most cost-effective supply chain routes for a manufacturing company.
 Suggesting personalized investment strategies for a financial advisor's clients.
 Optimizing scheduling and resource allocation in healthcare to improve patient care and
reduce costs.
 Recommending pricing and discount strategies for an e-commerce website.

Overall, prescriptive analytics empowers organizations to make more informed, data-driven decisions
and take proactive steps to achieve their business objectives and strategic goals.

Diagnostic Analytics:
Diagnostic analytics is a category of data analytics that focuses on examining data to understand
why certain events or outcomes occurred. It goes beyond descriptive analytics (which summarizes
historical data) and predictive analytics (which forecasts future outcomes) to dig deeper into the
causes behind specific occurrences. Diagnostic analytics aims to uncover patterns, relationships, and
anomalies in the data to provide insights into the root causes of particular events or problems. Here
are some key characteristics and use cases of diagnostic analytics:

1. Root Cause Analysis: Diagnostic analytics aims to identify the underlying factors or root
causes of specific events or issues. It helps answer questions like "Why did this happen?" or
"What led to this outcome?"
2. Historical Data: It relies on historical data to analyze past events or problems. This data
often includes transaction records, sensor data, logs, or other historical information.
3. Data Exploration: Diagnostic analytics involves a thorough exploration of data through
various techniques, including data visualization, statistical analysis, and data mining, to
uncover insights.
Page 7 of 7

4. Correlation and Causation: The analysis often seeks to identify correlations between
variables and determine whether one variable causes changes in another. This can help
distinguish causation from mere correlation.
5. Hypothesis Testing: Hypotheses are formulated and tested to determine the validity of
potential causes. Statistical tests and experiments may be conducted to validate or reject
these hypotheses.
6. Anomaly Detection: It can identify anomalies or irregularities in data that might be
indicators of problems or unusual events. Anomalies can serve as a starting point for further
investigation.
7. Industry Applications: Diagnostic analytics is used in various industries, including
healthcare, manufacturing, finance, and IT. In healthcare, it might be used to analyze patient
data to understand the causes of adverse events. In manufacturing, it can be used to identify
the reasons behind production defects.
8. Problem Solving: It aids in problem-solving by providing insights into how to prevent or
mitigate future issues. Organizations can use diagnostic analytics to make data-informed
decisions on process improvements and risk management.
9. Human Expertise: While diagnostic analytics tools can reveal patterns and correlations,
human expertise is often essential to interpret results and apply domain knowledge to draw
meaningful conclusions.

Examples of diagnostic analytics include:

 Investigating the reasons for a sudden increase in customer complaints about a specific
product.
 Analyzing machine sensor data to understand the factors leading to equipment failures in a
manufacturing facility.
 Identifying the root causes of budget overruns in a construction project by analyzing project
data.
 Examining patient records and medical histories to determine the factors contributing to a
particular disease outbreak in a healthcare facility.

Diagnostic analytics provides organizations with valuable insights into the factors contributing to
various events, enabling them to address issues, make informed decisions, and prevent similar
problems from occurring in the future.

You might also like