SlideShare a Scribd company logo
Nikolay Novozhilov
Wego.com
Using BigQuery as a main Big
Data solution
About Wego
Wego.com is Asia Pacific and the Middle East’s
leading flight/hotel metasearch engine used by
millions of travelers.
Wego was founded in 2005 in Singapore
Introducing BigQuery
Service for interactive analysis of massive datasets
(TBs)
Query billions of rows: seconds to write, seconds to
return
Uses a SQL-style query syntax
It's a service, accessed by a RESTful API
Pay only for what you use
Based on internal Google tool - Dremel
Column oriented, append only…
Data architecture in Wego
...
Why did we do it?
MySQL
“Zoo”
BigQuery
Why Hadoop is more popular?
My collection of concerns
Your data goes to cloud
Not open-source, Google can stop the service
“Strange” pricing model
Hadoop is trending, has bigger community
Append only database
???
Costs: storage + cost per query
Same fallacy again:
 “I want to launch a mom@pop – let’s buy a
building”
 “I want to build a site – let’s by servers”
 “I want big data – let’s build a data-warehouse”
Usual concerns:
 No realistic estimate upfront
 “Fear of running a query”
StackOverflow support
53
minutes!
Append only…
Slowly changing dimensions:
 daily re-load from MySQL
 daily upload from MySQL, keeping history
Absolutely necessary updates:
 do you really need it?
 BigQuery allows to save query to initial table:
Your
table Query
Actually useful - “Discovery mode”
Actually useful
Huge joins
REGEXT_MATCH(), …
Rich SQL - window functions
Nested data
My answer
What is Big Data revolution?
There is no difference
between big data and small
data anymore
Contacts
Blog: www.novozhilov.co
Email: nik@wego.com
“Yes, Sir, I tired to build an ROI case for
our BI project - but I couldn’t access any
reliable data!”
TimoElliott.com

More Related Content

What's hot (20)

PDF
Quick Intro to Google Cloud Technologies
Chris Schalk
 
PDF
About Pragmatic Works
MILL5
 
PDF
Google BigQuery
Matthias Feys
 
PDF
Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...
Cathrine Wilhelmsen
 
PDF
Visualisation Meets Virtualisation
Denodo
 
PPTX
Big Data with hadoop, Spark and BigQuery (Google cloud next Extended 2017 Kar...
Imam Raza
 
PDF
The Evolving Landscape of Data Engineering
Andrei Savu
 
PDF
TDC2016SP - Trilha BigData
tdc-globalcode
 
PPTX
Event Sponsor NetApp - CSO- Jon Kissane
Hostway|HOSTING
 
PPTX
Beyond Batch: Is ETL still relevant in the API economy?
SnapLogic
 
PDF
Big Query Basics
Ido Green
 
PDF
2017 09-27 democratize data products with SQL
Yu Ishikawa
 
PPTX
Pick a Winner: How to Choose a Data Warehouse
Matillion
 
PPTX
Data Structure and Types
Anjani Phuyal
 
ODP
Big Data Analytics with Google BigQuery. By Javier Ramirez. All your base Co...
javier ramirez
 
PDF
Google Bigtable
GirdhareeSaran
 
PDF
Cloud Developer Days - BigQuery
Wlodek Bielski
 
PPTX
Tor Hovland: Taking a swim in the big data lake
AnalyticsConf
 
PPTX
Cloud and Big Data trends
Sebastien Goasguen
 
PDF
Simplify Data Analytics Over the Cloud
Tyler Wishnoff
 
Quick Intro to Google Cloud Technologies
Chris Schalk
 
About Pragmatic Works
MILL5
 
Google BigQuery
Matthias Feys
 
Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...
Cathrine Wilhelmsen
 
Visualisation Meets Virtualisation
Denodo
 
Big Data with hadoop, Spark and BigQuery (Google cloud next Extended 2017 Kar...
Imam Raza
 
The Evolving Landscape of Data Engineering
Andrei Savu
 
TDC2016SP - Trilha BigData
tdc-globalcode
 
Event Sponsor NetApp - CSO- Jon Kissane
Hostway|HOSTING
 
Beyond Batch: Is ETL still relevant in the API economy?
SnapLogic
 
Big Query Basics
Ido Green
 
2017 09-27 democratize data products with SQL
Yu Ishikawa
 
Pick a Winner: How to Choose a Data Warehouse
Matillion
 
Data Structure and Types
Anjani Phuyal
 
Big Data Analytics with Google BigQuery. By Javier Ramirez. All your base Co...
javier ramirez
 
Google Bigtable
GirdhareeSaran
 
Cloud Developer Days - BigQuery
Wlodek Bielski
 
Tor Hovland: Taking a swim in the big data lake
AnalyticsConf
 
Cloud and Big Data trends
Sebastien Goasguen
 
Simplify Data Analytics Over the Cloud
Tyler Wishnoff
 

Similar to Using BigQuery as a main Big Data solution (20)

PDF
Big Query - Women Techmarkers (Ukraine - March 2014)
Ido Green
 
PDF
Exploring BigData with Google BigQuery
Dharmesh Vaya
 
PPTX
Google Developer Group - Cloud Singapore BigQuery Webinar
Rasel Rana
 
PDF
Google BigQuery is the future of Analytics! (Google Developer Conference)
Rasel Rana
 
PDF
An indepth look at Google BigQuery Architecture by Felipe Hoffa of Google
Data Con LA
 
PPTX
Google BigQuery 101 & What’s New
DoiT International
 
PDF
Big Data Analytics with Google BigQuery, by Javier Ramirez, datawaki, at Span...
javier ramirez
 
PDF
Supercharge your data analytics with BigQuery
Márton Kodok
 
PDF
Big query
Tanvi Parikh
 
PPTX
Taras Kloba "Аналіз 100 мільярдів записів за 30 секунд за допомогою Google Bi...
Lviv Startup Club
 
PDF
Big datalab
David Chen
 
PDF
VoxxedDays Bucharest 2017 - Powering interactive data analysis with Google Bi...
Márton Kodok
 
PPTX
bigquery.pptx
Harissh16
 
PDF
Google BigQuery for Everyday Developer
Márton Kodok
 
PDF
Executive Intro to BigQuery
William M. Cohee
 
PDF
Mongo DB: Operational Big Data Database
Xpand IT
 
PDF
[Webinar] Getting Started with BigQuery: Basics, Its Appilcations & Use Cases
Tatvic Analytics
 
PDF
Big query the first step - (MOSG)
Soshi Nemoto
 
PDF
API Analytics with Redis and Bigquery. NoSQLmatters Cologne '14 edition. Javi...
javier ramirez
 
Big Query - Women Techmarkers (Ukraine - March 2014)
Ido Green
 
Exploring BigData with Google BigQuery
Dharmesh Vaya
 
Google Developer Group - Cloud Singapore BigQuery Webinar
Rasel Rana
 
Google BigQuery is the future of Analytics! (Google Developer Conference)
Rasel Rana
 
An indepth look at Google BigQuery Architecture by Felipe Hoffa of Google
Data Con LA
 
Google BigQuery 101 & What’s New
DoiT International
 
Big Data Analytics with Google BigQuery, by Javier Ramirez, datawaki, at Span...
javier ramirez
 
Supercharge your data analytics with BigQuery
Márton Kodok
 
Big query
Tanvi Parikh
 
Taras Kloba "Аналіз 100 мільярдів записів за 30 секунд за допомогою Google Bi...
Lviv Startup Club
 
Big datalab
David Chen
 
VoxxedDays Bucharest 2017 - Powering interactive data analysis with Google Bi...
Márton Kodok
 
bigquery.pptx
Harissh16
 
Google BigQuery for Everyday Developer
Márton Kodok
 
Executive Intro to BigQuery
William M. Cohee
 
Mongo DB: Operational Big Data Database
Xpand IT
 
[Webinar] Getting Started with BigQuery: Basics, Its Appilcations & Use Cases
Tatvic Analytics
 
Big query the first step - (MOSG)
Soshi Nemoto
 
API Analytics with Redis and Bigquery. NoSQLmatters Cologne '14 edition. Javi...
javier ramirez
 
Ad

Recently uploaded (20)

PDF
apidays Munich 2025 - The Physics of Requirement Sciences Through Application...
apidays
 
PDF
717629748-Databricks-Certified-Data-Engineer-Professional-Dumps-by-Ball-21-03...
pedelli41
 
PPT
introdution to python with a very little difficulty
HUZAIFABINABDULLAH
 
PDF
apidays Munich 2025 - Integrate Your APIs into the New AI Marketplace, Senthi...
apidays
 
PDF
apidays Munich 2025 - Making Sense of AI-Ready APIs in a Buzzword World, Andr...
apidays
 
PDF
202501214233242351219 QASS Session 2.pdf
lauramejiamillan
 
PDF
blockchain123456789012345678901234567890
tanvikhunt1003
 
PPTX
Insurance-Analytics-Branch-Dashboard (1).pptx
trivenisapate02
 
PPTX
Introduction to Data Analytics and Data Science
KavithaCIT
 
PDF
Classifcation using Machine Learning and deep learning
bhaveshagrawal35
 
PPTX
Customer Segmentation: Seeing the Trees and the Forest Simultaneously
Sione Palu
 
PPTX
Future_of_AI_Presentation for everyone.pptx
boranamanju07
 
PPT
Real Life Application of Set theory, Relations and Functions
manavparmar205
 
PPT
From Vision to Reality: The Digital India Revolution
Harsh Bharvadiya
 
PDF
WISE main accomplishments for ISQOLS award July 2025.pdf
StatsCommunications
 
PPTX
lecture 13 mind test academy it skills.pptx
ggesjmrasoolpark
 
PPTX
Data Security Breach: Immediate Action Plan
varmabhuvan266
 
PPTX
The whitetiger novel review for collegeassignment.pptx
DhruvPatel754154
 
PDF
apidays Munich 2025 - The Double Life of the API Product Manager, Emmanuel Pa...
apidays
 
PPTX
Introduction to computer chapter one 2017.pptx
mensunmarley
 
apidays Munich 2025 - The Physics of Requirement Sciences Through Application...
apidays
 
717629748-Databricks-Certified-Data-Engineer-Professional-Dumps-by-Ball-21-03...
pedelli41
 
introdution to python with a very little difficulty
HUZAIFABINABDULLAH
 
apidays Munich 2025 - Integrate Your APIs into the New AI Marketplace, Senthi...
apidays
 
apidays Munich 2025 - Making Sense of AI-Ready APIs in a Buzzword World, Andr...
apidays
 
202501214233242351219 QASS Session 2.pdf
lauramejiamillan
 
blockchain123456789012345678901234567890
tanvikhunt1003
 
Insurance-Analytics-Branch-Dashboard (1).pptx
trivenisapate02
 
Introduction to Data Analytics and Data Science
KavithaCIT
 
Classifcation using Machine Learning and deep learning
bhaveshagrawal35
 
Customer Segmentation: Seeing the Trees and the Forest Simultaneously
Sione Palu
 
Future_of_AI_Presentation for everyone.pptx
boranamanju07
 
Real Life Application of Set theory, Relations and Functions
manavparmar205
 
From Vision to Reality: The Digital India Revolution
Harsh Bharvadiya
 
WISE main accomplishments for ISQOLS award July 2025.pdf
StatsCommunications
 
lecture 13 mind test academy it skills.pptx
ggesjmrasoolpark
 
Data Security Breach: Immediate Action Plan
varmabhuvan266
 
The whitetiger novel review for collegeassignment.pptx
DhruvPatel754154
 
apidays Munich 2025 - The Double Life of the API Product Manager, Emmanuel Pa...
apidays
 
Introduction to computer chapter one 2017.pptx
mensunmarley
 
Ad

Using BigQuery as a main Big Data solution

Editor's Notes

  • #7: https://www.google.com.sg/trends/explore#q=BigQuery%2C%20Amazon%20Redshift%2C%20Apache%20Spark%2C%20ElasticSearch%2C%20Hadoop&cmpt=q&tz=Etc%2FGMT-8
  • #14: https://www.google.com.sg/trends/explore#q=BigQuery%2C%20Amazon%20Redshift%2C%20Apache%20Spark%2C%20ElasticSearch%2C%20Hadoop&cmpt=q&tz=Etc%2FGMT-8