C-Cdlilt-B - CDL Ilt Deck - Module 2 (v1.1)
C-Cdlilt-B - CDL Ilt Deck - Module 2 (v1.1)
Lessons
Lessons
What is data?
Data
Economies of scale
Automation
Rapid elasticity
Data access
Proprietary + Confidential
Start Finish
Proprietary + Confidential
Data point
Dataset
into ‘buckets’
into ‘buckets’
into ‘buckets’
Dataset Dataset
Proprietary + Confidential
Unstructured Structured
Proprietary + Confidential
AI
ML
Proprietary + Confidential
Take 2 mins to play with the intersections 1. What insight could I gain if these datasets
between these datasets. Consider two or more were combined?
datasets and ask yourself: 2. How can I explore this data further and turn it
into actionable insights?
Lessons
databases
Database
Cloud SQL
(Structured Query
Language)
Databases are built and optimized to Data warehouses are built to rapidly analyse
ingest large amounts of data from many and report massive and multi-dimensional
different sources efficiently. datasets on an ongoing basis, in real-time.
Proprietary + Confidential
BigQuery
BigQuery is serverless
Proprietary + Confidential
Pub/Sub Dataflow
data lakes
Data lake
Looker
Exercise
10 min Class Page 9
Example 1
A coworking office rental business uses an online tool to record
daily desk, room, and meeting bookings. If a client books a desk
for the day, that data is captured and desk availability is updated
in real time on all customer channels. The rental business now
want to do even more with their data. They want to use multiple
types and sources of data to gain insights about facility quality
and, ultimately, to improve their service to customers.
Example 1
A coworking office rental business uses an online tool to record
daily desk, room, and meeting bookings. If a client books a desk
for the day, that data is captured and desk availability is updated
in real time on all customer channels. The rental business now
want to do even more with their data. They want to use multiple
types and sources of data to gain insights about facility quality
and, ultimately, to improve their service to customers.
Example 2
A bank is launching a mobile banking app, and wants to track
money transfers from one account to another. They want to
make sure the transferred figure is updated in the bank’s records
in real time and the user is able to see the most up-to-date
account balance.
Example 2
A bank is launching a mobile banking app, and wants to track
money transfers from one account to another. They want to
make sure the transferred figure is updated in the bank’s records
in real time and the user is able to see the most up-to-date
account balance.
Answer: Database
Proprietary + Confidential
Example 3
An online music streaming company stores raw music data that
is accessed by users worldwide and constantly analyzed by their
systems. They want to geographically disperse backup copies of
their raw data in very large volumes. This data comes in a
variety of formats, must retain full fidelity, and be accessible for
processing and analysis at any time, at short notice.
Example 3
An online music streaming company stores raw music data that
is accessed by users worldwide and constantly analyzed by their
systems. They want to geographically disperse backup copies of
their raw data in very large volumes. This data comes in a
variety of formats, must retain full fidelity, and be accessible for
processing and analysis at any time, at short notice.
Example 4
A lifestyle company is launching a casual dating mobile app. By
signing onto the app through social media, users provide details
such as gender, location, and interests, as well as headshot
images. The lifestyle company wants to display this information
to other app users through an algorithm, which depends on
compatibility, and needs a cost-effective data management
solution that can hold large volumes of data. They also can’t
afford downtime that would drive users away.
Example 4
A lifestyle company is launching a casual dating mobile app. By
signing onto the app through social media, users provide details
such as gender, location, and interests, as well as headshot
images. The lifestyle company wants to display this information
to other app users through an algorithm, which depends on
compatibility, and needs a cost-effective data management
solution that can hold large volumes of data. They also can’t
afford downtime that would drive users away.
Answer: Database
Proprietary + Confidential
Afternoon break
Please return in 15 minutes
Proprietary + Confidential
Lessons
01 It has coverage
03 It’s complete
Proprietary + Confidential
No Yes No Yes
Vertex AI
Custom model
Pre-trained APIs AutoML
tooling
Proprietary + Confidential
APIs
Proprietary + Confidential
AI Hub
A hosted repository of
plug-and-play AI components.
Proprietary + Confidential
Vision API
Natural Language
AutoML Natural
Language
with Vertex AI
Proprietary + Confidential
Build models
Feature engineering
Gather data
Proprietary + Confidential
Vertex AI: tools for data labeling, training, Custom model tooling
Document AI
Contact Center
AI
MLOps
IT infrastructure,
applications, and 1
data management
Proprietary + Confidential
Question 1
Images and videos are examples of what type of
data?
A. Unstructured
B. Structured
C. Semi-structured
D. Organized
Proprietary + Confidential
Question 1
Answer: A) Unstructured
Why? Images and videos are not arranged
according to a pre-set data model or schema.
Since they have no organization and tend to be
qualitative they are seen as unstructured data.
Proprietary + Confidential
Question 2
What is a data lake?
A. A database for storing of structured and
unstructured data
B. A large pool of data accessible to
database administrators only
C. A repository of data from various sources
stored in its native format for processing
D. A refined data repository accessible by
employees and select customers
Proprietary + Confidential
Question 2
Answer: C) A repository of data from
various sources stored in its native
format for processing
Why? The data in a data lake can come from
different sources, but is stored in its raw, native
format and not transformed when ingested
Proprietary + Confidential
Question 3
What is a common business problem that machine
learning solves?
Question 3
Answer: A) Creating personalized
customer experiences