SlideShare a Scribd company logo
Smart Data Hub 
Overviewof service catalogue offer 
Ali Belcaid© AB Advisory& Consulting 2014 
Smart Data 
Hub
Data Subscription 
What: 
Regular scheduled custom data provision for a monthly fee with a setup charge. This might include pricing data, lead generation data, research data. Monitoring services for social media and websites. Market data collection. 
How : 
Scraping/parsingdata, Vertical searchand workflow setup for Data flow 
Solutions : 
Nutch, Scrapy, Solr, Elasticsearch… + Hadoop/Hbase/Mongodb/MySQL… for storage
Data Hub 
What: 
A collaborative platform for sharing, managing and analyzing your data. This might be a standalone product where you carry out provision and analysis data yourselves, or a joint endeavor where we source and analyze data with you. 
How : 
Single repository to manage data (external & internal) 
Solution : 
CKAN open source data repository.
Data Collection 
What: 
Data collection at scale through scraping external or internal resources with transformation to formats for reuse. 
How : 
Scraping/parsingdata and storageon targetedformat 
Solution : 
Nutch, Scrapy, Solr, Elasticsearch… + Hadoop/Hbase/Mongodb/MySQL… for storage
Data Analysis 
What: 
Analysis, visualization and interpretation of data, either acquired through scraping or ingested via more conventional means. 
How : 
Machine learning, NLP, custom models, statistics, algorithms, visualization, stories telling… 
Solution : 
Knime, R, python, SQL, Hive, Pig, Spark, BI software (BO, Tableau, Qlikview), Visual JS libraries(D3.js…)
Data Consulting 
What: 
sessions to help businesses understand methods in data collection, analysis and management. 
How : 
Consulting services @ customer site or online 
Solution : 
Methodologies and best practices on how to gather, transform, analyze, visualize data and interpret results and findings.

More Related Content

What's hot (20)

PDF
Introduction to Big Data
AmpoolIO
 
PDF
Big Data Ecosystem
Lucian Neghina
 
PPTX
Introduction to hadoop
Ganesh Sanap
 
PPTX
Big data (4Vs,history,concept,algorithm) analysis and applications #bigdata #...
yashbheda
 
PPTX
Bigdata
Saravanan Manoharan
 
PPTX
tecFinal 451 webinar deck
Basho Technologies
 
PPTX
Enterprise architecture for big data projects
Sandeep Sharma IIMK Smart City,IoT,Bigdata,Cloud,BI,DW
 
PDF
9 facts about statice's data anonymization solution
Statice
 
PPTX
Data science big data and analytics
Sandeep Sharma IIMK Smart City,IoT,Bigdata,Cloud,BI,DW
 
PPTX
Data warehouse and data mining
Pradnya Saval
 
PDF
Data warehousing
Matouš Havlena
 
PPTX
Research Data Shared Service
Jisc
 
PDF
It Don’t Mean a Thing If It Ain’t Got Semantics
Ontotext
 
PDF
Building Knowledge Graphs in 10 steps
Ontotext
 
PPTX
SQLSat 245 - Por Onde Começar no BigData
Diego Nogare
 
PPTX
NoSQL Type, Bigdata, and Analytics
Sandeep Sharma IIMK Smart City,IoT,Bigdata,Cloud,BI,DW
 
PPTX
Seattle scalability meetup March 27,2013 intro slides
clive boulton
 
PPTX
Cis 520 group h (1)
Siddharth Gandhi
 
PPTX
Solution architecture
Rajat Agrawal
 
Introduction to Big Data
AmpoolIO
 
Big Data Ecosystem
Lucian Neghina
 
Introduction to hadoop
Ganesh Sanap
 
Big data (4Vs,history,concept,algorithm) analysis and applications #bigdata #...
yashbheda
 
tecFinal 451 webinar deck
Basho Technologies
 
Enterprise architecture for big data projects
Sandeep Sharma IIMK Smart City,IoT,Bigdata,Cloud,BI,DW
 
9 facts about statice's data anonymization solution
Statice
 
Data science big data and analytics
Sandeep Sharma IIMK Smart City,IoT,Bigdata,Cloud,BI,DW
 
Data warehouse and data mining
Pradnya Saval
 
Data warehousing
Matouš Havlena
 
Research Data Shared Service
Jisc
 
It Don’t Mean a Thing If It Ain’t Got Semantics
Ontotext
 
Building Knowledge Graphs in 10 steps
Ontotext
 
SQLSat 245 - Por Onde Começar no BigData
Diego Nogare
 
NoSQL Type, Bigdata, and Analytics
Sandeep Sharma IIMK Smart City,IoT,Bigdata,Cloud,BI,DW
 
Seattle scalability meetup March 27,2013 intro slides
clive boulton
 
Cis 520 group h (1)
Siddharth Gandhi
 
Solution architecture
Rajat Agrawal
 

Viewers also liked (9)

PPTX
Albel Pres Continuous Intelligence Overview
Ali BELCAID
 
PDF
Smart Data - The Foundation for Better Business Outcomes
DATAVERSITY
 
PPTX
SMART data analysis 2013
add4maths
 
PDF
Enabling Smart Data on M2M Gateways and Aggregators - Walt Bowers
mfrancis
 
PDF
Smart Data as a Service
Francois Wartelle
 
PDF
Smart Data Slides: Leverage the IOT to Build a Smart Data Ecosystem
DATAVERSITY
 
PDF
Smart data-im-marketing germancrm
B2B Smartdata GmbH
 
PPTX
Big Data in Healthcare Made Simple: Where It Stands Today and Where It’s Going
Health Catalyst
 
PDF
Big Data Solutions for Healthcare
Odinot Stanislas
 
Albel Pres Continuous Intelligence Overview
Ali BELCAID
 
Smart Data - The Foundation for Better Business Outcomes
DATAVERSITY
 
SMART data analysis 2013
add4maths
 
Enabling Smart Data on M2M Gateways and Aggregators - Walt Bowers
mfrancis
 
Smart Data as a Service
Francois Wartelle
 
Smart Data Slides: Leverage the IOT to Build a Smart Data Ecosystem
DATAVERSITY
 
Smart data-im-marketing germancrm
B2B Smartdata GmbH
 
Big Data in Healthcare Made Simple: Where It Stands Today and Where It’s Going
Health Catalyst
 
Big Data Solutions for Healthcare
Odinot Stanislas
 
Ad

Similar to Smart data hub (20)

PPTX
Big Data Analytics with Hadoop
Philippe Julio
 
PPTX
Hd insight overview
vhrocca
 
PPTX
How to Empower Your Business Users with Oracle Data Visualization
Perficient, Inc.
 
PDF
Modern data warehouse
Stephen Alex
 
PDF
Modern data warehouse
Stephen Alex
 
PDF
Big data and you
IBM
 
PDF
Real Time Recommendation System using Kiji
Daqing Zhao
 
PDF
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BI
Denodo
 
PPTX
Building a Big Data Solution
James Serra
 
PPTX
Introduction To Big Data & Hadoop
Blackvard
 
PPTX
The Power of Data
DataWorks Summit
 
PPTX
Big Data in Business Application use case and benefits
Gaurav493374
 
PDF
BAR360 open data platform presentation at DAMA, Sydney
Sai Paravastu
 
PPT
Hadoop Demo eConvergence
kvnnrao
 
PPTX
Big data analytics - hadoop
Vishwajeet Jadeja
 
PPTX
Big Data with Not Only SQL
Philippe Julio
 
PDF
Snowplow presentation for Amsterdam Meetup #3
Snowplow Analytics
 
PPTX
Introduction to data mining and data warehousing
Er. Nawaraj Bhandari
 
PPT
data warehouse and data mining unit 2 ppt
PreetiSahu90690
 
PDF
Kyvos Insights
rebeccatho
 
Big Data Analytics with Hadoop
Philippe Julio
 
Hd insight overview
vhrocca
 
How to Empower Your Business Users with Oracle Data Visualization
Perficient, Inc.
 
Modern data warehouse
Stephen Alex
 
Modern data warehouse
Stephen Alex
 
Big data and you
IBM
 
Real Time Recommendation System using Kiji
Daqing Zhao
 
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BI
Denodo
 
Building a Big Data Solution
James Serra
 
Introduction To Big Data & Hadoop
Blackvard
 
The Power of Data
DataWorks Summit
 
Big Data in Business Application use case and benefits
Gaurav493374
 
BAR360 open data platform presentation at DAMA, Sydney
Sai Paravastu
 
Hadoop Demo eConvergence
kvnnrao
 
Big data analytics - hadoop
Vishwajeet Jadeja
 
Big Data with Not Only SQL
Philippe Julio
 
Snowplow presentation for Amsterdam Meetup #3
Snowplow Analytics
 
Introduction to data mining and data warehousing
Er. Nawaraj Bhandari
 
data warehouse and data mining unit 2 ppt
PreetiSahu90690
 
Kyvos Insights
rebeccatho
 
Ad

Recently uploaded (20)

PDF
Loading Data into Snowflake (Bulk & Stream)
Accentfuture
 
PPTX
美国史蒂文斯理工学院毕业证书{SIT学费发票SIT录取通知书}哪里购买
Taqyea
 
PDF
apidays Singapore 2025 - Streaming Lakehouse with Kafka, Flink and Iceberg by...
apidays
 
PDF
GOOGLE ADS (1).pdf THE ULTIMATE GUIDE TO
kushalkeshwanisou
 
PPTX
Comparative Study of ML Techniques for RealTime Credit Card Fraud Detection S...
Debolina Ghosh
 
PDF
apidays Singapore 2025 - Trustworthy Generative AI: The Role of Observability...
apidays
 
PPTX
covid 19 data analysis updates in our municipality
RhuAyungon1
 
PPTX
BinarySearchTree in datastructures in detail
kichokuttu
 
PDF
apidays Singapore 2025 - The API Playbook for AI by Shin Wee Chuang (PAND AI)
apidays
 
DOCX
INDUSTRIAL BENEFIT FROM MICROSOFT AZURE.docx
writercontent500
 
PPTX
Feb 2021 Ransomware Recovery presentation.pptx
enginsayin1
 
DOCX
🧩 1. Solvent R-WPS Office work scientific
NohaSalah45
 
PDF
apidays Singapore 2025 - Building a Federated Future, Alex Szomora (GSMA)
apidays
 
PPTX
办理学历认证InformaticsLetter新加坡英华美学院毕业证书,Informatics成绩单
Taqyea
 
PPTX
Data anlytics Hospitals Research India.pptx
SayantanChakravorty2
 
PPTX
03_Ariane BERCKMOES_Ethias.pptx_AIBarometer_release_event
FinTech Belgium
 
PDF
NIS2 Compliance for MSPs: Roadmap, Benefits & Cybersecurity Trends (2025 Guide)
GRC Kompas
 
PDF
apidays Singapore 2025 - Surviving an interconnected world with API governanc...
apidays
 
PPTX
big data eco system fundamentals of data science
arivukarasi
 
PDF
Research Methodology Overview Introduction
ayeshagul29594
 
Loading Data into Snowflake (Bulk & Stream)
Accentfuture
 
美国史蒂文斯理工学院毕业证书{SIT学费发票SIT录取通知书}哪里购买
Taqyea
 
apidays Singapore 2025 - Streaming Lakehouse with Kafka, Flink and Iceberg by...
apidays
 
GOOGLE ADS (1).pdf THE ULTIMATE GUIDE TO
kushalkeshwanisou
 
Comparative Study of ML Techniques for RealTime Credit Card Fraud Detection S...
Debolina Ghosh
 
apidays Singapore 2025 - Trustworthy Generative AI: The Role of Observability...
apidays
 
covid 19 data analysis updates in our municipality
RhuAyungon1
 
BinarySearchTree in datastructures in detail
kichokuttu
 
apidays Singapore 2025 - The API Playbook for AI by Shin Wee Chuang (PAND AI)
apidays
 
INDUSTRIAL BENEFIT FROM MICROSOFT AZURE.docx
writercontent500
 
Feb 2021 Ransomware Recovery presentation.pptx
enginsayin1
 
🧩 1. Solvent R-WPS Office work scientific
NohaSalah45
 
apidays Singapore 2025 - Building a Federated Future, Alex Szomora (GSMA)
apidays
 
办理学历认证InformaticsLetter新加坡英华美学院毕业证书,Informatics成绩单
Taqyea
 
Data anlytics Hospitals Research India.pptx
SayantanChakravorty2
 
03_Ariane BERCKMOES_Ethias.pptx_AIBarometer_release_event
FinTech Belgium
 
NIS2 Compliance for MSPs: Roadmap, Benefits & Cybersecurity Trends (2025 Guide)
GRC Kompas
 
apidays Singapore 2025 - Surviving an interconnected world with API governanc...
apidays
 
big data eco system fundamentals of data science
arivukarasi
 
Research Methodology Overview Introduction
ayeshagul29594
 

Smart data hub

  • 1. Smart Data Hub Overviewof service catalogue offer Ali Belcaid© AB Advisory& Consulting 2014 Smart Data Hub
  • 2. Data Subscription What: Regular scheduled custom data provision for a monthly fee with a setup charge. This might include pricing data, lead generation data, research data. Monitoring services for social media and websites. Market data collection. How : Scraping/parsingdata, Vertical searchand workflow setup for Data flow Solutions : Nutch, Scrapy, Solr, Elasticsearch… + Hadoop/Hbase/Mongodb/MySQL… for storage
  • 3. Data Hub What: A collaborative platform for sharing, managing and analyzing your data. This might be a standalone product where you carry out provision and analysis data yourselves, or a joint endeavor where we source and analyze data with you. How : Single repository to manage data (external & internal) Solution : CKAN open source data repository.
  • 4. Data Collection What: Data collection at scale through scraping external or internal resources with transformation to formats for reuse. How : Scraping/parsingdata and storageon targetedformat Solution : Nutch, Scrapy, Solr, Elasticsearch… + Hadoop/Hbase/Mongodb/MySQL… for storage
  • 5. Data Analysis What: Analysis, visualization and interpretation of data, either acquired through scraping or ingested via more conventional means. How : Machine learning, NLP, custom models, statistics, algorithms, visualization, stories telling… Solution : Knime, R, python, SQL, Hive, Pig, Spark, BI software (BO, Tableau, Qlikview), Visual JS libraries(D3.js…)
  • 6. Data Consulting What: sessions to help businesses understand methods in data collection, analysis and management. How : Consulting services @ customer site or online Solution : Methodologies and best practices on how to gather, transform, analyze, visualize data and interpret results and findings.