SlideShare a Scribd company logo
Enabling Data-Driven
Decisions with Automated
Insights
Charlotte Emms
Data Insight Analyst @ Seenit
charlotte@seenit.io
Charlotte Emms
Data Insight Analyst @ Seenit
- 4 years of experience in the Data Analytics field
- Joined Seenit’s Product team in Nov ‘17
Introducing Seenit
A video collaboration platform that enables
organisations to co-create authentic and
engaging content with their communities
of customers, superfans, or employees.
What is Seenit?
A Brief History of Seenit
Founded in
January
2014
Over 200k
uploads hit
4k+ monthly
active users
MVP used Couchbase
deployed on GCP
Still the same today!
Seenit Capture v2.0
released in April 2018
Our Couchbase Journey
• Seenit’s MVP in 2014 was powered by Couchbase 2.2
back when querying the data required MapReduce
techniques
• The introduction of N1QL transformed Seenit, allowing
us to develop new features rapidly and analyse event-
based audit data for product usage research
• N1QL allowed Seenit to easily understand this data by
hiring a Data Analyst proficient in SQL
Non-first (N1) Normal Form Query Language (QL)
• It is based on ANSI 92 SQL
• Its query engine is optimized for modern, highly parallel multi-core
execution
SQL-like Query Language
• Expressive, familiar, and feature-rich language for querying,
transforming, and manipulating JSON data
N1QL extends SQL to handle data that is:
• Nested: Contains nested objects, arrays
• Heterogeneous: Schema-optional, non-uniform
• Distributed: Partitioned across a cluster
What is N1QL?
...this is not a scalable approach for repeatable work!
The Seenit Data Analysis Toolkit
Display the results using
Plotly - a personal
favourite in terms of data
display versatility and
clarity
Extract data for analysis
using N1QL queries
through the Couchbase
Python SDK
Manipulate the data in
Jupyter Notebooks with
Python, using analytics
libraries like Pandas and
Numpy
How can I share
this data with the
wider team?
“Let’s build a dashboard”
- every data analyst joining a
startup ever (probably)
Tell a different
story with the data
How to Communicate Platform
Usage and Make it Interesting
There was a need for something more tangible. Something that
can engage someone from any role in the business
For example:
How many
uploads in a
week?
How far was
our global
reach this
week?
Which were the
active projects
this week & who
was running them
Have mobile
users been
interacting with
the in-app-feed?
Seenit’s Regular, Fully Automated Platform Update
Important
overused pun
Three core sections
○ This week in numbers
○ This week in lists (what’s new, what’s active)
○ This week in fun facts, which includes the following:
■ most exotic upload location (furthest from the office)
■ most committed contributor (uploads to projects)
■ biggest crowd pleaser (likes from others)
The “Engaging” Part
The Technical Journey
Productionised
● Introduced more
thorough HTML
templating using Jinja2
● Scheduled in Jenkins to
run once a week
● Runs in Kubernetes by
default - also set up to
run in Docker
Initial Idea / PoC
● Off the back of a
conversation at stand-up
● Designed a process in
Python through a Jupyter
notebook
● Query data in Couchbase,
manipulating and returning
in tabular form for friendly
email display
● Sent using my own gmail
account
MVP
● Found and wrestled with a
nicer looking email template
● Introduced fun facts section
● Moved code to a formal
Python project (separate
scripts, classes, functions)
using an executable file to
trigger the email generation
● Sent using Sendgrid as we do
for generic email
notifications from the
platform
Next steps
Another idea
DataBot
A Slack Bot user to
deliver bespoke
analytics on request
• Coded in Python and uses the
RTM (Real Time Messaging)
Slack API
• Mention the name of the bot
user to get a response
• The bot uses the first word
after the direct mention to
decide the correct response -
no AI/ML training or NLP
methods needed
Image
Image
Things I learnt
- You don’t need a relational database
structure to build automated data
programs
- You also don’t need a BI tool to
communicate data insights (at first*)
- Try new things even if they don’t go to
plan
- Learn new skills from teammates
- Welcome feedback and keep iterating!
Questions
Thank you
charlotte@seenit.io

More Related Content

What's hot (7)

PDF
NETWORK CENTRALITY IN SUB-NATIONAL AREAS OF INTEREST USING GDELT DATA
DataTactics
 
PDF
What can the cloud do for you?
Mind the Byte
 
PDF
Cloud Developer Days - BigQuery
Wlodek Bielski
 
PDF
Pivotal corporate story by CS Park
VMware Tanzu Korea
 
PDF
Integrating Web and Business Data
Safe Software
 
PDF
Making the Case for NoSQL
DATAVERSITY
 
PDF
Knative serving
Fagner Moura
 
NETWORK CENTRALITY IN SUB-NATIONAL AREAS OF INTEREST USING GDELT DATA
DataTactics
 
What can the cloud do for you?
Mind the Byte
 
Cloud Developer Days - BigQuery
Wlodek Bielski
 
Pivotal corporate story by CS Park
VMware Tanzu Korea
 
Integrating Web and Business Data
Safe Software
 
Making the Case for NoSQL
DATAVERSITY
 
Knative serving
Fagner Moura
 

Similar to Big Data LDN 2018: ENABLING DATA-DRIVEN DECISIONS WITH AUTOMATED INSIGHTS (20)

PDF
Keynote at Converge 2019
Travis Oliphant
 
PDF
Gdsc IIIT Surat Orientation 2022.pdf
SparshJhariya2
 
PDF
Maruti gollapudi cv
Maruti Gollapudi
 
PDF
Using_python_webdevolopment_datascience.pdf
Sudipta Bhattacharya
 
PDF
How Celtra Optimizes its Advertising Platform with Databricks
Grega Kespret
 
DOC
Jitesh Agrawal plone
Jitesh Agrawal
 
DOC
Jitesh agrawal Resume
Jitesh Agrawal
 
PPTX
Group 3 slide presentation
Michael Young
 
PPTX
Cracking web development
Eyal Kenig
 
PDF
London atlassian meetup 31 jan 2016 jira metrics-extract slides
Rudiger Wolf
 
PDF
Architecting for analytics
Rob Winters
 
PPTX
Automate Hadoop Cluster Deployment in a Banking Ecosystem
Hellmar Becker
 
DOCX
Nitin_updated_Profile
Nitin Saxena
 
PDF
Practical automation for beginners
Seoweon Yoo
 
PDF
Jw13 developer-jive talks-presentation
Patrick Li
 
PDF
ML Platform Q1 Meetup: Airbnb's End-to-End Machine Learning Infrastructure
Fei Chen
 
DOC
Viswanathan CV
ViswanathanSubramani15
 
DOCX
Resume_Achin
Achin Singhal
 
PDF
Viswanth_chadalawada_ft_resume
Viswanth Chadalawada
 
PDF
Building the BI system and analytics capabilities at the company based on Rea...
GameCamp
 
Keynote at Converge 2019
Travis Oliphant
 
Gdsc IIIT Surat Orientation 2022.pdf
SparshJhariya2
 
Maruti gollapudi cv
Maruti Gollapudi
 
Using_python_webdevolopment_datascience.pdf
Sudipta Bhattacharya
 
How Celtra Optimizes its Advertising Platform with Databricks
Grega Kespret
 
Jitesh Agrawal plone
Jitesh Agrawal
 
Jitesh agrawal Resume
Jitesh Agrawal
 
Group 3 slide presentation
Michael Young
 
Cracking web development
Eyal Kenig
 
London atlassian meetup 31 jan 2016 jira metrics-extract slides
Rudiger Wolf
 
Architecting for analytics
Rob Winters
 
Automate Hadoop Cluster Deployment in a Banking Ecosystem
Hellmar Becker
 
Nitin_updated_Profile
Nitin Saxena
 
Practical automation for beginners
Seoweon Yoo
 
Jw13 developer-jive talks-presentation
Patrick Li
 
ML Platform Q1 Meetup: Airbnb's End-to-End Machine Learning Infrastructure
Fei Chen
 
Viswanathan CV
ViswanathanSubramani15
 
Resume_Achin
Achin Singhal
 
Viswanth_chadalawada_ft_resume
Viswanth Chadalawada
 
Building the BI system and analytics capabilities at the company based on Rea...
GameCamp
 
Ad

More from Matt Stubbs (20)

PDF
Blueprint Series: Banking In The Cloud – Ultra-high Reliability Architectures
Matt Stubbs
 
PDF
Speed Up Your Apache Cassandra™ Applications: A Practical Guide to Reactive P...
Matt Stubbs
 
PDF
Blueprint Series: Expedia Partner Solutions, Data Platform
Matt Stubbs
 
PDF
Blueprint Series: Architecture Patterns for Implementing Serverless Microserv...
Matt Stubbs
 
PDF
Big Data LDN 2018: DATA, WHAT PEOPLE THINK AND WHAT YOU CAN DO TO BUILD TRUST.
Matt Stubbs
 
PDF
Big Data LDN 2018: DATABASE FOR THE INSTANT EXPERIENCE
Matt Stubbs
 
PDF
Big Data LDN 2018: BIG DATA TOO SLOW? SPRINKLE IN SOME NOSQL
Matt Stubbs
 
PDF
Big Data LDN 2018: DATA MANAGEMENT AUTOMATION AND THE INFORMATION SUPPLY CHAI...
Matt Stubbs
 
PDF
Big Data LDN 2018: AI VS. GDPR
Matt Stubbs
 
PDF
Big Data LDN 2018: REALISING THE PROMISE OF SELF-SERVICE ANALYTICS WITH DATA ...
Matt Stubbs
 
PDF
Big Data LDN 2018: TURNING MULTIPLE DATA LAKES INTO A UNIFIED ANALYTIC DATA L...
Matt Stubbs
 
PDF
Big Data LDN 2018: MICROSOFT AZURE AND CLOUDERA – FLEXIBLE CLOUD, WHATEVER TH...
Matt Stubbs
 
PDF
Big Data LDN 2018: CONSISTENT SECURITY, GOVERNANCE AND FLEXIBILITY FOR ALL WO...
Matt Stubbs
 
PDF
Big Data LDN 2018: MICROLISE: USING BIG DATA AND AI IN TRANSPORT AND LOGISTICS
Matt Stubbs
 
PDF
Big Data LDN 2018: EXPERIAN: MAXIMISE EVERY OPPORTUNITY IN THE BIG DATA UNIVERSE
Matt Stubbs
 
PDF
Big Data LDN 2018: A LOOK INSIDE APPLIED MACHINE LEARNING
Matt Stubbs
 
PDF
Big Data LDN 2018: DEUTSCHE BANK: THE PATH TO AUTOMATION IN A HIGHLY REGULATE...
Matt Stubbs
 
PDF
Big Data LDN 2018: FROM PROLIFERATION TO PRODUCTIVITY: MACHINE LEARNING DATA ...
Matt Stubbs
 
PDF
Big Data LDN 2018: DATA APIS DON’T DISCRIMINATE
Matt Stubbs
 
PDF
Big Data LDN 2018: A TALE OF TWO BI STANDARDS: DATA WAREHOUSES AND DATA LAKES
Matt Stubbs
 
Blueprint Series: Banking In The Cloud – Ultra-high Reliability Architectures
Matt Stubbs
 
Speed Up Your Apache Cassandra™ Applications: A Practical Guide to Reactive P...
Matt Stubbs
 
Blueprint Series: Expedia Partner Solutions, Data Platform
Matt Stubbs
 
Blueprint Series: Architecture Patterns for Implementing Serverless Microserv...
Matt Stubbs
 
Big Data LDN 2018: DATA, WHAT PEOPLE THINK AND WHAT YOU CAN DO TO BUILD TRUST.
Matt Stubbs
 
Big Data LDN 2018: DATABASE FOR THE INSTANT EXPERIENCE
Matt Stubbs
 
Big Data LDN 2018: BIG DATA TOO SLOW? SPRINKLE IN SOME NOSQL
Matt Stubbs
 
Big Data LDN 2018: DATA MANAGEMENT AUTOMATION AND THE INFORMATION SUPPLY CHAI...
Matt Stubbs
 
Big Data LDN 2018: AI VS. GDPR
Matt Stubbs
 
Big Data LDN 2018: REALISING THE PROMISE OF SELF-SERVICE ANALYTICS WITH DATA ...
Matt Stubbs
 
Big Data LDN 2018: TURNING MULTIPLE DATA LAKES INTO A UNIFIED ANALYTIC DATA L...
Matt Stubbs
 
Big Data LDN 2018: MICROSOFT AZURE AND CLOUDERA – FLEXIBLE CLOUD, WHATEVER TH...
Matt Stubbs
 
Big Data LDN 2018: CONSISTENT SECURITY, GOVERNANCE AND FLEXIBILITY FOR ALL WO...
Matt Stubbs
 
Big Data LDN 2018: MICROLISE: USING BIG DATA AND AI IN TRANSPORT AND LOGISTICS
Matt Stubbs
 
Big Data LDN 2018: EXPERIAN: MAXIMISE EVERY OPPORTUNITY IN THE BIG DATA UNIVERSE
Matt Stubbs
 
Big Data LDN 2018: A LOOK INSIDE APPLIED MACHINE LEARNING
Matt Stubbs
 
Big Data LDN 2018: DEUTSCHE BANK: THE PATH TO AUTOMATION IN A HIGHLY REGULATE...
Matt Stubbs
 
Big Data LDN 2018: FROM PROLIFERATION TO PRODUCTIVITY: MACHINE LEARNING DATA ...
Matt Stubbs
 
Big Data LDN 2018: DATA APIS DON’T DISCRIMINATE
Matt Stubbs
 
Big Data LDN 2018: A TALE OF TWO BI STANDARDS: DATA WAREHOUSES AND DATA LAKES
Matt Stubbs
 
Ad

Recently uploaded (20)

PDF
Development and validation of the Japanese version of the Organizational Matt...
Yoga Tokuyoshi
 
PPTX
apidays Singapore 2025 - From Data to Insights: Building AI-Powered Data APIs...
apidays
 
PDF
Context Engineering for AI Agents, approaches, memories.pdf
Tamanna
 
PPTX
Numbers of a nation: how we estimate population statistics | Accessible slides
Office for National Statistics
 
PPTX
Module-5-Measures-of-Central-Tendency-Grouped-Data-1.pptx
lacsonjhoma0407
 
PDF
Product Management in HealthTech (Case Studies from SnappDoctor)
Hamed Shams
 
PDF
apidays Helsinki & North 2025 - APIs in the healthcare sector: hospitals inte...
apidays
 
PPTX
apidays Helsinki & North 2025 - Agentic AI: A Friend or Foe?, Merja Kajava (A...
apidays
 
PDF
The European Business Wallet: Why It Matters and How It Powers the EUDI Ecosy...
Lal Chandran
 
PPTX
Listify-Intelligent-Voice-to-Catalog-Agent.pptx
nareshkottees
 
PPTX
b6057ea5-8e8c-4415-90c0-ed8e9666ffcd.pptx
Anees487379
 
PDF
Copia de Strategic Roadmap Infographics by Slidesgo.pptx (1).pdf
ssuserd4c6911
 
PPT
Growth of Public Expendituuure_55423.ppt
NavyaDeora
 
PPTX
Advanced_NLP_with_Transformers_PPT_final 50.pptx
Shiwani Gupta
 
PDF
Building Production-Ready AI Agents with LangGraph.pdf
Tamanna
 
PDF
Choosing the Right Database for Indexing.pdf
Tamanna
 
PPTX
apidays Singapore 2025 - The Quest for the Greenest LLM , Jean Philippe Ehre...
apidays
 
PPTX
SlideEgg_501298-Agentic AI.pptx agentic ai
530BYManoj
 
PPTX
AI Presentation Tool Pitch Deck Presentation.pptx
ShyamPanthavoor1
 
PDF
R Cookbook - Processing and Manipulating Geological spatial data with R.pdf
OtnielSimopiaref2
 
Development and validation of the Japanese version of the Organizational Matt...
Yoga Tokuyoshi
 
apidays Singapore 2025 - From Data to Insights: Building AI-Powered Data APIs...
apidays
 
Context Engineering for AI Agents, approaches, memories.pdf
Tamanna
 
Numbers of a nation: how we estimate population statistics | Accessible slides
Office for National Statistics
 
Module-5-Measures-of-Central-Tendency-Grouped-Data-1.pptx
lacsonjhoma0407
 
Product Management in HealthTech (Case Studies from SnappDoctor)
Hamed Shams
 
apidays Helsinki & North 2025 - APIs in the healthcare sector: hospitals inte...
apidays
 
apidays Helsinki & North 2025 - Agentic AI: A Friend or Foe?, Merja Kajava (A...
apidays
 
The European Business Wallet: Why It Matters and How It Powers the EUDI Ecosy...
Lal Chandran
 
Listify-Intelligent-Voice-to-Catalog-Agent.pptx
nareshkottees
 
b6057ea5-8e8c-4415-90c0-ed8e9666ffcd.pptx
Anees487379
 
Copia de Strategic Roadmap Infographics by Slidesgo.pptx (1).pdf
ssuserd4c6911
 
Growth of Public Expendituuure_55423.ppt
NavyaDeora
 
Advanced_NLP_with_Transformers_PPT_final 50.pptx
Shiwani Gupta
 
Building Production-Ready AI Agents with LangGraph.pdf
Tamanna
 
Choosing the Right Database for Indexing.pdf
Tamanna
 
apidays Singapore 2025 - The Quest for the Greenest LLM , Jean Philippe Ehre...
apidays
 
SlideEgg_501298-Agentic AI.pptx agentic ai
530BYManoj
 
AI Presentation Tool Pitch Deck Presentation.pptx
ShyamPanthavoor1
 
R Cookbook - Processing and Manipulating Geological spatial data with R.pdf
OtnielSimopiaref2
 

Big Data LDN 2018: ENABLING DATA-DRIVEN DECISIONS WITH AUTOMATED INSIGHTS

  • 1. Enabling Data-Driven Decisions with Automated Insights Charlotte Emms Data Insight Analyst @ Seenit [email protected]
  • 2. Charlotte Emms Data Insight Analyst @ Seenit - 4 years of experience in the Data Analytics field - Joined Seenit’s Product team in Nov ‘17
  • 3. Introducing Seenit A video collaboration platform that enables organisations to co-create authentic and engaging content with their communities of customers, superfans, or employees.
  • 5. A Brief History of Seenit Founded in January 2014 Over 200k uploads hit 4k+ monthly active users MVP used Couchbase deployed on GCP Still the same today! Seenit Capture v2.0 released in April 2018
  • 6. Our Couchbase Journey • Seenit’s MVP in 2014 was powered by Couchbase 2.2 back when querying the data required MapReduce techniques • The introduction of N1QL transformed Seenit, allowing us to develop new features rapidly and analyse event- based audit data for product usage research • N1QL allowed Seenit to easily understand this data by hiring a Data Analyst proficient in SQL
  • 7. Non-first (N1) Normal Form Query Language (QL) • It is based on ANSI 92 SQL • Its query engine is optimized for modern, highly parallel multi-core execution SQL-like Query Language • Expressive, familiar, and feature-rich language for querying, transforming, and manipulating JSON data N1QL extends SQL to handle data that is: • Nested: Contains nested objects, arrays • Heterogeneous: Schema-optional, non-uniform • Distributed: Partitioned across a cluster What is N1QL?
  • 8. ...this is not a scalable approach for repeatable work! The Seenit Data Analysis Toolkit Display the results using Plotly - a personal favourite in terms of data display versatility and clarity Extract data for analysis using N1QL queries through the Couchbase Python SDK Manipulate the data in Jupyter Notebooks with Python, using analytics libraries like Pandas and Numpy
  • 9. How can I share this data with the wider team?
  • 10. “Let’s build a dashboard” - every data analyst joining a startup ever (probably)
  • 11. Tell a different story with the data
  • 12. How to Communicate Platform Usage and Make it Interesting There was a need for something more tangible. Something that can engage someone from any role in the business For example: How many uploads in a week? How far was our global reach this week? Which were the active projects this week & who was running them Have mobile users been interacting with the in-app-feed?
  • 13. Seenit’s Regular, Fully Automated Platform Update Important overused pun
  • 14. Three core sections ○ This week in numbers ○ This week in lists (what’s new, what’s active) ○ This week in fun facts, which includes the following: ■ most exotic upload location (furthest from the office) ■ most committed contributor (uploads to projects) ■ biggest crowd pleaser (likes from others) The “Engaging” Part
  • 15. The Technical Journey Productionised ● Introduced more thorough HTML templating using Jinja2 ● Scheduled in Jenkins to run once a week ● Runs in Kubernetes by default - also set up to run in Docker Initial Idea / PoC ● Off the back of a conversation at stand-up ● Designed a process in Python through a Jupyter notebook ● Query data in Couchbase, manipulating and returning in tabular form for friendly email display ● Sent using my own gmail account MVP ● Found and wrestled with a nicer looking email template ● Introduced fun facts section ● Moved code to a formal Python project (separate scripts, classes, functions) using an executable file to trigger the email generation ● Sent using Sendgrid as we do for generic email notifications from the platform
  • 18. DataBot A Slack Bot user to deliver bespoke analytics on request • Coded in Python and uses the RTM (Real Time Messaging) Slack API • Mention the name of the bot user to get a response • The bot uses the first word after the direct mention to decide the correct response - no AI/ML training or NLP methods needed Image
  • 19. Image
  • 20. Things I learnt - You don’t need a relational database structure to build automated data programs - You also don’t need a BI tool to communicate data insights (at first*) - Try new things even if they don’t go to plan - Learn new skills from teammates - Welcome feedback and keep iterating!