SlideShare a Scribd company logo
NoSQL way in PostgreSQL 
Vibhor Kumar (Principal System Engineer)
Agenda 
• Intro to JSON, HSTORE and PL/V8 
• JSON History in Postgres 
• JSON Data Types, Operators and Functions 
• JSON, JSONB– when to use which one? 
• JSONB and Node.JS – easy as pie 
• NoSQL Performance in Postgres – fast as greased lightning 
• Say ‘Yes’ to ‘Not only SQL’ 
• Useful resources 
© 2014 EnterpriseDB Corporation. All rights reserved. 2
Let’s Ask Ourselves, Why NoSQL? 
• Where did NoSQL come from? 
− Where all cool tech stuff comes from – Internet companies 
• Why did they make NoSQL? 
− To support huge data volumes and evolving demands for ways 
to work with new data types 
• What does NoSQL accomplish? 
− Enables you to work with new data types: email, mobile 
interactions, machine data, social connections 
− Enables you to work in new ways: incremental development 
and continuous release 
• Why did they have to build something new? 
− There were limitations to most relational databases 
© 2014 EnterpriseDB Corporation. All rights reserved. 3
NoSQL: Real-world Applications 
• Emergency Management System 
− High variability among data sources required high schema 
flexibility 
• Massively Open Online Course 
− Massive read scalability, content integration, low latency 
• Patient Data and Prescription Records 
− Efficient write scalability 
• Social Marketing Analytics 
− Map reduce analytical approaches 
Source: Gartner, A Tour of NoSQL in 8 Use Cases, 
by Nick Heudecker and Merv Adrian, February 28, 2014 
© 2014 EnterpriseDB Corporation. All rights reserved. 4
Postgres’ Response 
• HSTORE 
− Key-value pair 
− Simple, fast and easy 
− Postgres v 8.2 – pre-dates many NoSQL-only solutions 
− Ideal for flat data structures that are sparsely populated 
• JSON 
− Hierarchical document model 
− Introduced in Postgres 9.2, perfected in 9.3 
• JSONB 
− Binary version of JSON 
− Faster, more operators and even more robust 
− Postgres 9.4 
© 2014 EnterpriseDB Corporation. All rights reserved. 5
Postgres: Key-value Store 
• Supported since 2006, the HStore 
contrib module enables storing 
key/value pairs within a single 
column 
• Allows you to create a schema-less, 
ACID compliant data store within 
Postgres 
• Create single HStore column and 
include, for each row, only those keys 
which pertain to the record 
• Add attributes to a table and query 
without advance planning 
•Combines flexibility with ACID compliance 
© 2014 EnterpriseDB Corporation. All rights reserved. 6
HSTORE Examples 
• Create a table with HSTORE field 
CREATE TABLE hstore_data (data HSTORE); 
• Insert a record into hstore_data 
INSERT INTO hstore_data (data) VALUES (’ 
"cost"=>"500", 
"product"=>"iphone", 
"provider"=>"apple"'); 
• Select data from hstore_data 
SELECT data FROM hstore_data ; 
------------------------------------------ 
"cost"=>"500”,"product"=>"iphone”,"provider"=>"Apple" 
(1 row) 
© 2014 EnterpriseDB Corporation. All rights reserved. 7
Postgres: Document Store 
• JSON is the most popular 
data-interchange format on the web 
• Derived from the ECMAScript 
Programming Language Standard 
(European Computer Manufacturers 
Association). 
• Supported by virtually every 
programming language 
• New supporting technologies 
continue to expand JSON’s utility 
− PL/V8 JavaScript extension 
− Node.js 
• Postgres has a native JSON data type (v9.2) and a JSON parser and a 
variety of JSON functions (v9.3) 
• Postgres will have a JSONB data type with binary storage and indexing 
(coming – v9.4) 
© 2014 EnterpriseDB Corporation. All rights reserved. 8
JSON Examples 
• Creating a table with a JSONB field 
CREATE TABLE json_data (data JSONB); 
• Simple JSON data element: 
{"name": "Apple Phone", "type": "phone", "brand": 
"ACME", "price": 200, "available": true, 
"warranty_years": 1} 
• Inserting this data element into the table json_data 
INSERT INTO json_data (data) VALUES 
(’ { "name": "Apple Phone", 
"type": "phone", 
"brand": "ACME", 
"price": 200, 
"available": true, 
"warranty_years": 1 
} ') 
© 2014 EnterpriseDB Corporation. All rights reserved. 9
JSON Examples 
• JSON data element with nesting: 
{“full name”: “John Joseph Carl Salinger”, 
“names”: 
[ 
{"type": "firstname", “value”: ”John”}, 
{“type”: “middlename”, “value”: “Joseph”}, 
{“type”: “middlename”, “value”: “Carl”}, 
{“type”: “lastname”, “value”: “Salinger”} 
] 
} 
© 2014 EnterpriseDB Corporation. All rights reserved. 10
A simple query for JSON data 
SELECT DISTINCT 
data->>'name' as products 
FROM json_data; 
products 
------------------------------ 
Cable TV Basic Service 
Package 
AC3 Case Black 
Phone Service Basic Plan 
AC3 Phone 
AC3 Case Green 
Phone Service Family Plan 
AC3 Case Red 
AC7 Phone 
© 2014 EnterpriseDB Corporation. All rights reserved. 11 
This query does not 
return JSON data – it 
returns text values 
associated with the 
key ‘name’
A query that returns JSON data 
SELECT data FROM json_data; 
data 
------------------------------------------ 
{"name": "Apple Phone", "type": "phone", 
"brand": "ACME", "price": 200, 
"available": true, "warranty_years": 1} 
This query returns the JSON data in its 
original format 
© 2014 EnterpriseDB Corporation. All rights reserved. 12
JSON and ANSI SQL - PB&J for the DBA 
• JSON is naturally 
integrated with ANSI SQL 
in Postgres 
• JSON and SQL queries 
use the same language, the 
same planner, and the same ACID compliant 
transaction framework 
• JSON and HSTORE are elegant and easy to use 
extensions of the underlying object-relational model 
© 2014 EnterpriseDB Corporation. All rights reserved. 13
JSON and ANSI SQL Example 
SELECT DISTINCT 
product_type, 
data->>'brand' as Brand, 
data->>'available' as Availability 
FROM json_data 
JOIN products 
ON (products.product_type=json_data.data->>'name') 
WHERE json_data.data->>'available'=true; 
product_type | brand | availability 
---------------------------+-----------+-------------- 
AC3 Phone | ACME | true 
ANSI SQL 
© 2014 EnterpriseDB Corporation. All rights reserved. 14 
JSON 
No need for programmatic logic to combine SQL and 
NoSQL in the application – Postgres does it all
Bridging between SQL and JSON 
Simple ANSI SQL Table Definition 
CREATE TABLE products (id integer, product_name text ); 
Select query returning standard data set 
SELECT * FROM products; 
id | product_name 
----+-------------- 
1 | iPhone 
2 | Samsung 
3 | Nokia 
Select query returning the same result as a JSON data set 
SELECT ROW_TO_JSON(products) FROM products; 
{"id":1,"product_name":"iPhone"} 
{"id":2,"product_name":"Samsung"} 
{"id":3,"product_name":"Nokia”} 
© 2014 EnterpriseDB Corporation. All rights reserved. 15
JSON Data Types 
• 1. Number: 
− Signed decimal number that may contain a fractional part and may use exponential 
notation. 
− No distinction between integer and floating-point 
• 2. String 
− A sequence of zero or more Unicode characters. 
− Strings are delimited with double-quotation mark 
− Supports a backslash escaping syntax. 
• 3. Boolean 
− Either of the values true or false. 
• 4. Array 
− An ordered list of zero or more values, 
− Each values may be of any type. 
− Arrays use square bracket notation with elements being comma-separated. 
• 5. Object 
− An unordered associative array (name/value pairs). 
− Objects are delimited with curly brackets 
− Commas to separate each pair 
− Each pair the colon ':' character separates the key or name from its value. 
− All keys must be strings and should be distinct from each other within that object. 
• 6. null 
− An empty value, using the word null 
© 2014 EnterpriseDB Corporation. All rights reserved. 16 
JSON is defined per RFC – 7159 
For more detail please refer 
http://tools.ietf.org/html/rfc7159
JSON Data Type Example 
{ 
"firstName": "John", -- String Type 
"lastName": "Smith", -- String Type 
"isAlive": true, -- Boolean Type 
"age": 25, -- Number Type 
"height_cm": 167.6, -- Number Type 
"address": { -- Object Type 
"streetAddress": "21 2nd Street”, 
"city": "New York”, 
"state": "NY”, 
"postalCode": "10021-3100” 
}, 
"phoneNumbers": [ // Object Array 
{ // Object 
"type": "home”, 
"number": "212 555-1234” 
}, 
{ 
"type": "office”, 
"number": "646 555-4567” 
} 
], 
"children": [], 
"spouse": null // Null 
} 
© 2014 EnterpriseDB Corporation. All rights reserved. 17
JSON 9.4 – New Operators and Functions 
• JSON 
− New JSON creation functions (json_build_object, json_build_array) 
− json_typeof – returns text data type (‘number’, ‘boolean’, …) 
• JSONB data type 
− Canonical representation 
− Whitespace and punctuation dissolved away 
− Only one value per object key is kept 
− Last insert wins 
− Key order determined by length, then bytewise comparison 
− Equality, containment and key/element presence tests 
− New JSONB creation functions 
− Smaller, faster GIN indexes 
− jsonb subdocument indexes 
− Use “get” operators to construct expression indexes on subdocument: 
− CREATE INDEX author_index ON books USING GIN ((jsondata -> 
'authors')); 
− SELECT * FROM books WHERE jsondata -> 'authors' ? 'Carl 
Bernstein' 
© 2014 EnterpriseDB Corporation. All rights reserved. 18
JSON and BSON 
• BSON – stands for 
‘Binary JSON’ 
• BSON != JSONB 
− BSON cannot represent an integer or 
floating-point number with more than 
64 bits of precision. 
− JSONB can represent arbitrary JSON values. 
• Caveat Emptor! 
− This limitation will not be obvious during early 
stages of a project! 
© 2014 EnterpriseDB Corporation. All rights reserved. 19
JSON, JSONB or HSTORE? 
• JSON/JSONB is more versatile than HSTORE 
• HSTORE provides more structure 
• JSON or JSONB? 
− if you need any of the following, use JSON 
− Storage of validated json, without processing or indexing it 
− Preservation of white space in json text 
− Preservation of object key order Preservation of duplicate object 
keys 
− Maximum input/output speed 
• For any other case, use JSONB 
© 2014 EnterpriseDB Corporation. All rights reserved. 20
JSONB and Node.js - Easy as π 
• Simple Demo of Node.js to Postgres cnnection 
© 2014 EnterpriseDB Corporation. All rights reserved. 21
JSON Performance Evaluation 
• Goal 
− Help our customers understand when to chose 
Postgres and when to chose a specialty 
solution 
− Help us understand where the NoSQL limits of 
Postgres are 
• Setup 
− Compare Postgres 9.4 to Mongo 2.6 
− Single instance setup on AWS M3.2XLARGE 
(32GB) 
• Test Focus 
− Data ingestion (bulk and individual) 
− Data retrieval 
© 2014 EnterpriseDB Corporation. All rights reserved. 22
Performance Evaluation 
Generate 50 Million 
JSON Documents 
Load into MongoDB 2.6 
© 2014 EnterpriseDB Corporation. All rights reserved. 23 
(IMPORT) 
Load into 
Postgres 9.4 
(COPY) 
50 Million individual 
INSERT commands 
50 Million individual 
INSERT commands 
Multiple SELECT 
statements 
Multiple SELECT 
statements 
T1 
T2 
T3
NoSQL Performance Evaluation 
© 2014 EnterpriseDB Corporation. All rights reserved. 24 
Correction to earlier versions: 
MongoDB console does not allow for 
INSERT of documents > 4K. This 
lead to truncation of the MongoDB 
size by approx. 25% of all records in 
the benchmark.
Performance Evaluations – Next Steps 
• Initial tests confirm that Postgres’ can handle many 
NoSQL workloads 
• EDB is making the test scripts publically available 
• EDB encourages community participation to 
better define where Postgres should be used 
and where specialty solutions are appropriate 
• Download the source at 
https://github.com/EnterpriseDB/pg_nosql_benchmark 
• Join us to discuss the findings at 
http://bit.ly/EDB-NoSQL-Postgres-Benchmark 
© 2014 EnterpriseDB Corporation. All rights reserved. 25
Structured or Unstructured? 
“No SQL Only” or “Not Only SQL”? 
• Structures and standards emerge! 
• Data has references (products link to catalogues; 
products have bills of material; components appear in 
multiple products; storage locations link to ISO country 
tables) 
• When the database has duplicate data entries, then the 
application has to manage updates in multiple places – 
what happens when there is no ACID transactional 
model? 
© 2014 EnterpriseDB Corporation. All rights reserved. 26
Ultimate Flexibility with Postgres 
© 2014 EnterpriseDB Corporation. All rights reserved. 27
Say yes to ‘Not only SQL’ 
• Postgres overcomes many of the standard objections 
“It can’t be done with a conventional database system” 
• Postgres 
− Combines structured data and unstructured data (ANSI SQL 
and JSON/HSTORE) 
− Is faster (for many workloads) than than the leading NoSQL-only 
solution 
− Integrates easily with Web 2.0 application development 
environments 
− Can be deployed on-premise or in the cloud 
Do more with Postgres – the Enterprise NoSQL Solution 
© 2014 EnterpriseDB Corporation. All rights reserved. 28
Useful Resources 
• Postgres NoSQL Training Events 
− Bruce Momjian & Vibhor Kumar @ pgEurope 
− Madrid (Oct 21): Maximizing Results with JSONB and PostgreSQL 
• Whitepapers @ http://www.enterprisedb.com/nosql-for-enterprise 
− PostgreSQL Advances to Meet NoSQL Challenges (business 
oriented) 
− Using the NoSQL Capabilities in Postgres (full of code examples) 
• Run the NoSQL benchmark 
− https://github.com/EnterpriseDB/pg_nosql_benchmark 
© 2014 EnterpriseDB Corporation. All rights reserved. 29
© 2014 EnterpriseDB Corporation. All rights reserved. 30

More Related Content

What's hot (20)

PPT
Database design
Dhani Ahmad
 
PPT
Algebra relacional
Luis Polanco-Balcazar
 
PPTX
Schema migrations in no sql
Dr-Dipali Meher
 
PPTX
NoSQL databases - An introduction
Pooyan Mehrparvar
 
PPTX
Sql vs NoSQL
RTigger
 
ODP
Dimensional Modelling
Prithwis Mukerjee
 
PDF
Overview of Databases and Data Modelling-1.pdf
Christalin Nelson
 
PPT
DbMs
amanrock2012
 
PDF
Relational_Algebra_Calculus Operations.pdf
Christalin Nelson
 
PPTX
5. stored procedure and functions
Amrit Kaur
 
PDF
Overview of Databases and Data Modelling-2.pdf
Christalin Nelson
 
PPT
SAP ABAP - Needed Notes
Akash Bhavsar
 
PDF
Overview of Database and Database Management
Mayuree Srikulwong
 
PPTX
PL/SQL Interview Questions
Srinimf-Slides
 
PPS
Database Design Slide 1
ahfiki
 
PPTX
Oracle sql analytic functions
mamamowebby
 
PDF
Converting Relational to Graph Databases
Antonio Maccioni
 
PDF
NoSQL databases
Marin Dimitrov
 
PPTX
Intro To Mongo Db
chriskite
 
Database design
Dhani Ahmad
 
Algebra relacional
Luis Polanco-Balcazar
 
Schema migrations in no sql
Dr-Dipali Meher
 
NoSQL databases - An introduction
Pooyan Mehrparvar
 
Sql vs NoSQL
RTigger
 
Dimensional Modelling
Prithwis Mukerjee
 
Overview of Databases and Data Modelling-1.pdf
Christalin Nelson
 
Relational_Algebra_Calculus Operations.pdf
Christalin Nelson
 
5. stored procedure and functions
Amrit Kaur
 
Overview of Databases and Data Modelling-2.pdf
Christalin Nelson
 
SAP ABAP - Needed Notes
Akash Bhavsar
 
Overview of Database and Database Management
Mayuree Srikulwong
 
PL/SQL Interview Questions
Srinimf-Slides
 
Database Design Slide 1
ahfiki
 
Oracle sql analytic functions
mamamowebby
 
Converting Relational to Graph Databases
Antonio Maccioni
 
NoSQL databases
Marin Dimitrov
 
Intro To Mongo Db
chriskite
 

Viewers also liked (20)

PDF
Postgres NoSQL - Delivering Apps Faster
EDB
 
PDF
Ten Reasons Why You Should Prefer PostgreSQL to MySQL
anandology
 
PPTX
NoSQL on ACID: Meet Unstructured Postgres
EDB
 
PPT
5 Data Modeling for NoSQL 1/2
Fabio Fumarola
 
ODP
The PostgreSQL JSON Feature Tour
Stefanie Janine Stölting
 
PDF
Postgres como base de datos NoSQL. Codemotion 2015
Ruben Gómez García
 
PDF
EnterpriseDB Positioned in Leaders Quadrant in Gartner Magic Quadrant
EDB
 
PPT
Mongodb
Manav Prasad
 
PDF
Intro to Postgres 9 Tutorial
Robert Treat
 
PDF
Native JSON Support in SQL2016
Ivo Andreev
 
PDF
Sql or no sql, that is the question
hugo lu
 
PDF
Foreign Data Wrappers and You with Postgres
EDB
 
PDF
Java Micro Edition (ME) 8 Deep Dive
terrencebarr
 
PDF
Writing A Foreign Data Wrapper
psoo1978
 
PDF
On Beyond (PostgreSQL) Data Types
Jonathan Katz
 
ODP
Introduction to PostgreSQL
Jim Mlodgenski
 
PPTX
Migrating Data Pipeline from MongoDB to Cassandra
Demi Ben-Ari
 
PDF
NoSQL + SQL = PostgreSQL (TDC2014 - Porto Alegre/RS)
Fabrízio Mello
 
PPT
Best Practices: Data Admin & Data Management
Empowered Holdings, LLC
 
PPTX
Repeating History...On Purpose...with Elixir
Barry Jones
 
Postgres NoSQL - Delivering Apps Faster
EDB
 
Ten Reasons Why You Should Prefer PostgreSQL to MySQL
anandology
 
NoSQL on ACID: Meet Unstructured Postgres
EDB
 
5 Data Modeling for NoSQL 1/2
Fabio Fumarola
 
The PostgreSQL JSON Feature Tour
Stefanie Janine Stölting
 
Postgres como base de datos NoSQL. Codemotion 2015
Ruben Gómez García
 
EnterpriseDB Positioned in Leaders Quadrant in Gartner Magic Quadrant
EDB
 
Mongodb
Manav Prasad
 
Intro to Postgres 9 Tutorial
Robert Treat
 
Native JSON Support in SQL2016
Ivo Andreev
 
Sql or no sql, that is the question
hugo lu
 
Foreign Data Wrappers and You with Postgres
EDB
 
Java Micro Edition (ME) 8 Deep Dive
terrencebarr
 
Writing A Foreign Data Wrapper
psoo1978
 
On Beyond (PostgreSQL) Data Types
Jonathan Katz
 
Introduction to PostgreSQL
Jim Mlodgenski
 
Migrating Data Pipeline from MongoDB to Cassandra
Demi Ben-Ari
 
NoSQL + SQL = PostgreSQL (TDC2014 - Porto Alegre/RS)
Fabrízio Mello
 
Best Practices: Data Admin & Data Management
Empowered Holdings, LLC
 
Repeating History...On Purpose...with Elixir
Barry Jones
 
Ad

Similar to The NoSQL Way in Postgres (20)

PDF
No sql way_in_pg
Vibhor Kumar
 
PPT
Do More with Postgres- NoSQL Applications for the Enterprise
EDB
 
PDF
NoSQL on ACID - Meet Unstructured Postgres
EDB
 
PDF
NoSQL Now: Postgres - The NoSQL Cake You Can Eat
DATAVERSITY
 
PDF
EDB NoSQL German Webinar 2015
EDB
 
PDF
Postgres: The NoSQL Cake You Can Eat
EDB
 
PDF
Webscale PostgreSQL - JSONB and Horizontal Scaling Strategies
Jonathan Katz
 
PDF
Postgrtesql as a NoSQL Document Store - The JSON/JSONB data type
Jumping Bean
 
PDF
There is Javascript in my SQL
PGConf APAC
 
PDF
Json in Postgres - the Roadmap
EDB
 
PDF
Postgres vs Mongo / Олег Бартунов (Postgres Professional)
Ontico
 
PDF
NoSQL and Spatial Database Capabilities using PostgreSQL
EDB
 
PPTX
Power JSON with PostgreSQL
EDB
 
PDF
JSON Support in DB2 for z/OS
Jane Man
 
PDF
PG Day'14 Russia, Работа со слабо-структурированными данными в PostgreSQL, Ол...
pgdayrussia
 
PDF
Oh, that ubiquitous JSON !
Alexander Korotkov
 
PDF
PostgreSQL 9.3 and JSON - talk at PgOpen 2013
Andrew Dunstan
 
PDF
Andrew Dunstan 9.3 JSON Presentation @ Postgres Open 2013
PostgresOpen
 
PDF
Mathias test
Mathias Stjernström
 
PDF
PostgreSQL, your NoSQL database
Reuven Lerner
 
No sql way_in_pg
Vibhor Kumar
 
Do More with Postgres- NoSQL Applications for the Enterprise
EDB
 
NoSQL on ACID - Meet Unstructured Postgres
EDB
 
NoSQL Now: Postgres - The NoSQL Cake You Can Eat
DATAVERSITY
 
EDB NoSQL German Webinar 2015
EDB
 
Postgres: The NoSQL Cake You Can Eat
EDB
 
Webscale PostgreSQL - JSONB and Horizontal Scaling Strategies
Jonathan Katz
 
Postgrtesql as a NoSQL Document Store - The JSON/JSONB data type
Jumping Bean
 
There is Javascript in my SQL
PGConf APAC
 
Json in Postgres - the Roadmap
EDB
 
Postgres vs Mongo / Олег Бартунов (Postgres Professional)
Ontico
 
NoSQL and Spatial Database Capabilities using PostgreSQL
EDB
 
Power JSON with PostgreSQL
EDB
 
JSON Support in DB2 for z/OS
Jane Man
 
PG Day'14 Russia, Работа со слабо-структурированными данными в PostgreSQL, Ол...
pgdayrussia
 
Oh, that ubiquitous JSON !
Alexander Korotkov
 
PostgreSQL 9.3 and JSON - talk at PgOpen 2013
Andrew Dunstan
 
Andrew Dunstan 9.3 JSON Presentation @ Postgres Open 2013
PostgresOpen
 
Mathias test
Mathias Stjernström
 
PostgreSQL, your NoSQL database
Reuven Lerner
 
Ad

More from EDB (20)

PDF
Cloud Migration Paths: Kubernetes, IaaS, or DBaaS
EDB
 
PDF
Die 10 besten PostgreSQL-Replikationsstrategien für Ihr Unternehmen
EDB
 
PDF
Migre sus bases de datos Oracle a la nube
EDB
 
PDF
EFM Office Hours - APJ - July 29, 2021
EDB
 
PDF
Benchmarking Cloud Native PostgreSQL
EDB
 
PDF
Las Variaciones de la Replicación de PostgreSQL
EDB
 
PDF
Is There Anything PgBouncer Can’t Do?
EDB
 
PDF
Data Analysis with TensorFlow in PostgreSQL
EDB
 
PDF
Practical Partitioning in Production with Postgres
EDB
 
PDF
A Deeper Dive into EXPLAIN
EDB
 
PDF
IOT with PostgreSQL
EDB
 
PDF
A Journey from Oracle to PostgreSQL
EDB
 
PDF
Psql is awesome!
EDB
 
PDF
EDB 13 - New Enhancements for Security and Usability - APJ
EDB
 
PPTX
Comment sauvegarder correctement vos données
EDB
 
PDF
Cloud Native PostgreSQL - Italiano
EDB
 
PDF
New enhancements for security and usability in EDB 13
EDB
 
PPTX
Best Practices in Security with PostgreSQL
EDB
 
PDF
Cloud Native PostgreSQL - APJ
EDB
 
PDF
Best Practices in Security with PostgreSQL
EDB
 
Cloud Migration Paths: Kubernetes, IaaS, or DBaaS
EDB
 
Die 10 besten PostgreSQL-Replikationsstrategien für Ihr Unternehmen
EDB
 
Migre sus bases de datos Oracle a la nube
EDB
 
EFM Office Hours - APJ - July 29, 2021
EDB
 
Benchmarking Cloud Native PostgreSQL
EDB
 
Las Variaciones de la Replicación de PostgreSQL
EDB
 
Is There Anything PgBouncer Can’t Do?
EDB
 
Data Analysis with TensorFlow in PostgreSQL
EDB
 
Practical Partitioning in Production with Postgres
EDB
 
A Deeper Dive into EXPLAIN
EDB
 
IOT with PostgreSQL
EDB
 
A Journey from Oracle to PostgreSQL
EDB
 
Psql is awesome!
EDB
 
EDB 13 - New Enhancements for Security and Usability - APJ
EDB
 
Comment sauvegarder correctement vos données
EDB
 
Cloud Native PostgreSQL - Italiano
EDB
 
New enhancements for security and usability in EDB 13
EDB
 
Best Practices in Security with PostgreSQL
EDB
 
Cloud Native PostgreSQL - APJ
EDB
 
Best Practices in Security with PostgreSQL
EDB
 

Recently uploaded (20)

PPTX
Q2 FY26 Tableau User Group Leader Quarterly Call
lward7
 
PPTX
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
PDF
Automating Feature Enrichment and Station Creation in Natural Gas Utility Net...
Safe Software
 
PPTX
COMPARISON OF RASTER ANALYSIS TOOLS OF QGIS AND ARCGIS
Sharanya Sarkar
 
PDF
Kit-Works Team Study_20250627_한달만에만든사내서비스키링(양다윗).pdf
Wonjun Hwang
 
PDF
“Squinting Vision Pipelines: Detecting and Correcting Errors in Vision Models...
Edge AI and Vision Alliance
 
PDF
How do you fast track Agentic automation use cases discovery?
DianaGray10
 
PPTX
The Project Compass - GDG on Campus MSIT
dscmsitkol
 
PDF
LOOPS in C Programming Language - Technology
RishabhDwivedi43
 
PDF
Future-Proof or Fall Behind? 10 Tech Trends You Can’t Afford to Ignore in 2025
DIGITALCONFEX
 
PPT
Ericsson LTE presentation SEMINAR 2010.ppt
npat3
 
PPTX
Future Tech Innovations 2025 – A TechLists Insight
TechLists
 
PDF
Newgen Beyond Frankenstein_Build vs Buy_Digital_version.pdf
darshakparmar
 
PDF
UiPath DevConnect 2025: Agentic Automation Community User Group Meeting
DianaGray10
 
PDF
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
PPTX
Seamless Tech Experiences Showcasing Cross-Platform App Design.pptx
presentifyai
 
PDF
Reverse Engineering of Security Products: Developing an Advanced Microsoft De...
nwbxhhcyjv
 
PPTX
Digital Circuits, important subject in CS
contactparinay1
 
PPTX
Agentforce World Tour Toronto '25 - MCP with MuleSoft
Alexandra N. Martinez
 
PDF
CIFDAQ Market Wrap for the week of 4th July 2025
CIFDAQ
 
Q2 FY26 Tableau User Group Leader Quarterly Call
lward7
 
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
Automating Feature Enrichment and Station Creation in Natural Gas Utility Net...
Safe Software
 
COMPARISON OF RASTER ANALYSIS TOOLS OF QGIS AND ARCGIS
Sharanya Sarkar
 
Kit-Works Team Study_20250627_한달만에만든사내서비스키링(양다윗).pdf
Wonjun Hwang
 
“Squinting Vision Pipelines: Detecting and Correcting Errors in Vision Models...
Edge AI and Vision Alliance
 
How do you fast track Agentic automation use cases discovery?
DianaGray10
 
The Project Compass - GDG on Campus MSIT
dscmsitkol
 
LOOPS in C Programming Language - Technology
RishabhDwivedi43
 
Future-Proof or Fall Behind? 10 Tech Trends You Can’t Afford to Ignore in 2025
DIGITALCONFEX
 
Ericsson LTE presentation SEMINAR 2010.ppt
npat3
 
Future Tech Innovations 2025 – A TechLists Insight
TechLists
 
Newgen Beyond Frankenstein_Build vs Buy_Digital_version.pdf
darshakparmar
 
UiPath DevConnect 2025: Agentic Automation Community User Group Meeting
DianaGray10
 
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
Seamless Tech Experiences Showcasing Cross-Platform App Design.pptx
presentifyai
 
Reverse Engineering of Security Products: Developing an Advanced Microsoft De...
nwbxhhcyjv
 
Digital Circuits, important subject in CS
contactparinay1
 
Agentforce World Tour Toronto '25 - MCP with MuleSoft
Alexandra N. Martinez
 
CIFDAQ Market Wrap for the week of 4th July 2025
CIFDAQ
 

The NoSQL Way in Postgres

  • 1. NoSQL way in PostgreSQL Vibhor Kumar (Principal System Engineer)
  • 2. Agenda • Intro to JSON, HSTORE and PL/V8 • JSON History in Postgres • JSON Data Types, Operators and Functions • JSON, JSONB– when to use which one? • JSONB and Node.JS – easy as pie • NoSQL Performance in Postgres – fast as greased lightning • Say ‘Yes’ to ‘Not only SQL’ • Useful resources © 2014 EnterpriseDB Corporation. All rights reserved. 2
  • 3. Let’s Ask Ourselves, Why NoSQL? • Where did NoSQL come from? − Where all cool tech stuff comes from – Internet companies • Why did they make NoSQL? − To support huge data volumes and evolving demands for ways to work with new data types • What does NoSQL accomplish? − Enables you to work with new data types: email, mobile interactions, machine data, social connections − Enables you to work in new ways: incremental development and continuous release • Why did they have to build something new? − There were limitations to most relational databases © 2014 EnterpriseDB Corporation. All rights reserved. 3
  • 4. NoSQL: Real-world Applications • Emergency Management System − High variability among data sources required high schema flexibility • Massively Open Online Course − Massive read scalability, content integration, low latency • Patient Data and Prescription Records − Efficient write scalability • Social Marketing Analytics − Map reduce analytical approaches Source: Gartner, A Tour of NoSQL in 8 Use Cases, by Nick Heudecker and Merv Adrian, February 28, 2014 © 2014 EnterpriseDB Corporation. All rights reserved. 4
  • 5. Postgres’ Response • HSTORE − Key-value pair − Simple, fast and easy − Postgres v 8.2 – pre-dates many NoSQL-only solutions − Ideal for flat data structures that are sparsely populated • JSON − Hierarchical document model − Introduced in Postgres 9.2, perfected in 9.3 • JSONB − Binary version of JSON − Faster, more operators and even more robust − Postgres 9.4 © 2014 EnterpriseDB Corporation. All rights reserved. 5
  • 6. Postgres: Key-value Store • Supported since 2006, the HStore contrib module enables storing key/value pairs within a single column • Allows you to create a schema-less, ACID compliant data store within Postgres • Create single HStore column and include, for each row, only those keys which pertain to the record • Add attributes to a table and query without advance planning •Combines flexibility with ACID compliance © 2014 EnterpriseDB Corporation. All rights reserved. 6
  • 7. HSTORE Examples • Create a table with HSTORE field CREATE TABLE hstore_data (data HSTORE); • Insert a record into hstore_data INSERT INTO hstore_data (data) VALUES (’ "cost"=>"500", "product"=>"iphone", "provider"=>"apple"'); • Select data from hstore_data SELECT data FROM hstore_data ; ------------------------------------------ "cost"=>"500”,"product"=>"iphone”,"provider"=>"Apple" (1 row) © 2014 EnterpriseDB Corporation. All rights reserved. 7
  • 8. Postgres: Document Store • JSON is the most popular data-interchange format on the web • Derived from the ECMAScript Programming Language Standard (European Computer Manufacturers Association). • Supported by virtually every programming language • New supporting technologies continue to expand JSON’s utility − PL/V8 JavaScript extension − Node.js • Postgres has a native JSON data type (v9.2) and a JSON parser and a variety of JSON functions (v9.3) • Postgres will have a JSONB data type with binary storage and indexing (coming – v9.4) © 2014 EnterpriseDB Corporation. All rights reserved. 8
  • 9. JSON Examples • Creating a table with a JSONB field CREATE TABLE json_data (data JSONB); • Simple JSON data element: {"name": "Apple Phone", "type": "phone", "brand": "ACME", "price": 200, "available": true, "warranty_years": 1} • Inserting this data element into the table json_data INSERT INTO json_data (data) VALUES (’ { "name": "Apple Phone", "type": "phone", "brand": "ACME", "price": 200, "available": true, "warranty_years": 1 } ') © 2014 EnterpriseDB Corporation. All rights reserved. 9
  • 10. JSON Examples • JSON data element with nesting: {“full name”: “John Joseph Carl Salinger”, “names”: [ {"type": "firstname", “value”: ”John”}, {“type”: “middlename”, “value”: “Joseph”}, {“type”: “middlename”, “value”: “Carl”}, {“type”: “lastname”, “value”: “Salinger”} ] } © 2014 EnterpriseDB Corporation. All rights reserved. 10
  • 11. A simple query for JSON data SELECT DISTINCT data->>'name' as products FROM json_data; products ------------------------------ Cable TV Basic Service Package AC3 Case Black Phone Service Basic Plan AC3 Phone AC3 Case Green Phone Service Family Plan AC3 Case Red AC7 Phone © 2014 EnterpriseDB Corporation. All rights reserved. 11 This query does not return JSON data – it returns text values associated with the key ‘name’
  • 12. A query that returns JSON data SELECT data FROM json_data; data ------------------------------------------ {"name": "Apple Phone", "type": "phone", "brand": "ACME", "price": 200, "available": true, "warranty_years": 1} This query returns the JSON data in its original format © 2014 EnterpriseDB Corporation. All rights reserved. 12
  • 13. JSON and ANSI SQL - PB&J for the DBA • JSON is naturally integrated with ANSI SQL in Postgres • JSON and SQL queries use the same language, the same planner, and the same ACID compliant transaction framework • JSON and HSTORE are elegant and easy to use extensions of the underlying object-relational model © 2014 EnterpriseDB Corporation. All rights reserved. 13
  • 14. JSON and ANSI SQL Example SELECT DISTINCT product_type, data->>'brand' as Brand, data->>'available' as Availability FROM json_data JOIN products ON (products.product_type=json_data.data->>'name') WHERE json_data.data->>'available'=true; product_type | brand | availability ---------------------------+-----------+-------------- AC3 Phone | ACME | true ANSI SQL © 2014 EnterpriseDB Corporation. All rights reserved. 14 JSON No need for programmatic logic to combine SQL and NoSQL in the application – Postgres does it all
  • 15. Bridging between SQL and JSON Simple ANSI SQL Table Definition CREATE TABLE products (id integer, product_name text ); Select query returning standard data set SELECT * FROM products; id | product_name ----+-------------- 1 | iPhone 2 | Samsung 3 | Nokia Select query returning the same result as a JSON data set SELECT ROW_TO_JSON(products) FROM products; {"id":1,"product_name":"iPhone"} {"id":2,"product_name":"Samsung"} {"id":3,"product_name":"Nokia”} © 2014 EnterpriseDB Corporation. All rights reserved. 15
  • 16. JSON Data Types • 1. Number: − Signed decimal number that may contain a fractional part and may use exponential notation. − No distinction between integer and floating-point • 2. String − A sequence of zero or more Unicode characters. − Strings are delimited with double-quotation mark − Supports a backslash escaping syntax. • 3. Boolean − Either of the values true or false. • 4. Array − An ordered list of zero or more values, − Each values may be of any type. − Arrays use square bracket notation with elements being comma-separated. • 5. Object − An unordered associative array (name/value pairs). − Objects are delimited with curly brackets − Commas to separate each pair − Each pair the colon ':' character separates the key or name from its value. − All keys must be strings and should be distinct from each other within that object. • 6. null − An empty value, using the word null © 2014 EnterpriseDB Corporation. All rights reserved. 16 JSON is defined per RFC – 7159 For more detail please refer http://tools.ietf.org/html/rfc7159
  • 17. JSON Data Type Example { "firstName": "John", -- String Type "lastName": "Smith", -- String Type "isAlive": true, -- Boolean Type "age": 25, -- Number Type "height_cm": 167.6, -- Number Type "address": { -- Object Type "streetAddress": "21 2nd Street”, "city": "New York”, "state": "NY”, "postalCode": "10021-3100” }, "phoneNumbers": [ // Object Array { // Object "type": "home”, "number": "212 555-1234” }, { "type": "office”, "number": "646 555-4567” } ], "children": [], "spouse": null // Null } © 2014 EnterpriseDB Corporation. All rights reserved. 17
  • 18. JSON 9.4 – New Operators and Functions • JSON − New JSON creation functions (json_build_object, json_build_array) − json_typeof – returns text data type (‘number’, ‘boolean’, …) • JSONB data type − Canonical representation − Whitespace and punctuation dissolved away − Only one value per object key is kept − Last insert wins − Key order determined by length, then bytewise comparison − Equality, containment and key/element presence tests − New JSONB creation functions − Smaller, faster GIN indexes − jsonb subdocument indexes − Use “get” operators to construct expression indexes on subdocument: − CREATE INDEX author_index ON books USING GIN ((jsondata -> 'authors')); − SELECT * FROM books WHERE jsondata -> 'authors' ? 'Carl Bernstein' © 2014 EnterpriseDB Corporation. All rights reserved. 18
  • 19. JSON and BSON • BSON – stands for ‘Binary JSON’ • BSON != JSONB − BSON cannot represent an integer or floating-point number with more than 64 bits of precision. − JSONB can represent arbitrary JSON values. • Caveat Emptor! − This limitation will not be obvious during early stages of a project! © 2014 EnterpriseDB Corporation. All rights reserved. 19
  • 20. JSON, JSONB or HSTORE? • JSON/JSONB is more versatile than HSTORE • HSTORE provides more structure • JSON or JSONB? − if you need any of the following, use JSON − Storage of validated json, without processing or indexing it − Preservation of white space in json text − Preservation of object key order Preservation of duplicate object keys − Maximum input/output speed • For any other case, use JSONB © 2014 EnterpriseDB Corporation. All rights reserved. 20
  • 21. JSONB and Node.js - Easy as π • Simple Demo of Node.js to Postgres cnnection © 2014 EnterpriseDB Corporation. All rights reserved. 21
  • 22. JSON Performance Evaluation • Goal − Help our customers understand when to chose Postgres and when to chose a specialty solution − Help us understand where the NoSQL limits of Postgres are • Setup − Compare Postgres 9.4 to Mongo 2.6 − Single instance setup on AWS M3.2XLARGE (32GB) • Test Focus − Data ingestion (bulk and individual) − Data retrieval © 2014 EnterpriseDB Corporation. All rights reserved. 22
  • 23. Performance Evaluation Generate 50 Million JSON Documents Load into MongoDB 2.6 © 2014 EnterpriseDB Corporation. All rights reserved. 23 (IMPORT) Load into Postgres 9.4 (COPY) 50 Million individual INSERT commands 50 Million individual INSERT commands Multiple SELECT statements Multiple SELECT statements T1 T2 T3
  • 24. NoSQL Performance Evaluation © 2014 EnterpriseDB Corporation. All rights reserved. 24 Correction to earlier versions: MongoDB console does not allow for INSERT of documents > 4K. This lead to truncation of the MongoDB size by approx. 25% of all records in the benchmark.
  • 25. Performance Evaluations – Next Steps • Initial tests confirm that Postgres’ can handle many NoSQL workloads • EDB is making the test scripts publically available • EDB encourages community participation to better define where Postgres should be used and where specialty solutions are appropriate • Download the source at https://github.com/EnterpriseDB/pg_nosql_benchmark • Join us to discuss the findings at http://bit.ly/EDB-NoSQL-Postgres-Benchmark © 2014 EnterpriseDB Corporation. All rights reserved. 25
  • 26. Structured or Unstructured? “No SQL Only” or “Not Only SQL”? • Structures and standards emerge! • Data has references (products link to catalogues; products have bills of material; components appear in multiple products; storage locations link to ISO country tables) • When the database has duplicate data entries, then the application has to manage updates in multiple places – what happens when there is no ACID transactional model? © 2014 EnterpriseDB Corporation. All rights reserved. 26
  • 27. Ultimate Flexibility with Postgres © 2014 EnterpriseDB Corporation. All rights reserved. 27
  • 28. Say yes to ‘Not only SQL’ • Postgres overcomes many of the standard objections “It can’t be done with a conventional database system” • Postgres − Combines structured data and unstructured data (ANSI SQL and JSON/HSTORE) − Is faster (for many workloads) than than the leading NoSQL-only solution − Integrates easily with Web 2.0 application development environments − Can be deployed on-premise or in the cloud Do more with Postgres – the Enterprise NoSQL Solution © 2014 EnterpriseDB Corporation. All rights reserved. 28
  • 29. Useful Resources • Postgres NoSQL Training Events − Bruce Momjian & Vibhor Kumar @ pgEurope − Madrid (Oct 21): Maximizing Results with JSONB and PostgreSQL • Whitepapers @ http://www.enterprisedb.com/nosql-for-enterprise − PostgreSQL Advances to Meet NoSQL Challenges (business oriented) − Using the NoSQL Capabilities in Postgres (full of code examples) • Run the NoSQL benchmark − https://github.com/EnterpriseDB/pg_nosql_benchmark © 2014 EnterpriseDB Corporation. All rights reserved. 29
  • 30. © 2014 EnterpriseDB Corporation. All rights reserved. 30

Editor's Notes

  • #2: Title slide Add customer logo if applicable
  • #3: This is the roadmap for content of the meeting Adjust as necessary based on previous discussions on objectives and areas of interest Goal is to set expectations for content to be discussed Agree on specific meeting objectives/outcomes up front Inquire about any specific areas of interest or current needs that should be emphasized
  • #4: Bruce
  • #5: Marc Linster Emergency Management System: Rapid integration of heterogeneous data sources for new analysis platform - High variability among data sources required high schema flexibility MOOC: - Massive read sclability - Content integration from a variety of sources - Low latency at peak volumes Patient data and prescription records - Distributed key-value store reduced complexity of integration and management overhead Online Gambling - Key value store simplified development (up to 75% acceleration) - Efficient write sclability Online Marketing Firm - massive time series storage facilitated by table style DBMS - data center replication Social Marketing Analytics - Massive amounts of data from social media systems - Map-reduce analytical approaches Product Manufacturing Master Data Management - Network dependencies between product lines Drug Discovery - Drug Analysis
  • #7: Bruce
  • #9: Bruce
  • #17: Note: 1. JSON generally ignores any whitespace around or between syntactic elements (values and punctuation, but not within a string value). 2. JSON only recognizes four specific whitespace characters: i. the space ii. horizontal tab, iii. line feed, and carriage return. 3. JSON does not provide or allow any sort of comment syntax.