SlideShare a Scribd company logo
©	Hortonworks	Inc.	2011–2018.	All	rights	reserved1
Intelligently	Collecting	Data	at	the	Edge	
with	Apache	NiFi	and	MiNiFi
Andy	LoPresto		|		@yolopey	
Sr.	Member	of	Technical	Staff	at	Hortonworks,	Apache	NiFi	PMC	&	Committer	
19	April	2018	Dataworks	Summit	Berlin
©	Hortonworks	Inc.	2011–2018.	All	rights	reserved2
Gauging	Audience	Familiarity	With	NiFi
“What’s	a	NeeFee?”	
No	experience	with	dataflow	
No	experience	with	NiFi
“I	can	pick	this	up	pretty	quickly”	
Some	experience	with	dataflow	
Some	experience	with	NiFi
“I	refactored	the	Ambari	
integration	endpoint	to	allow	
for	mutual	authentication	
TLS	during	my	coffee	break”	
Forgotten	more	about	NiFi	
than	most	of	us	will	ever	
know
©	Hortonworks	Inc.	2011–2018.	All	rights	reserved3
Agenda
• What	is	dataflow	and	what	are	the	challenges?	
• Apache	NiFi	
• IoT	Challenges	
• Apache	MiNiFi	
• Exploration	
• Community	
• All	slides	provided	online,	so	no	need	to	transcribe
©	Hortonworks	Inc.	2011–2018.	All	rights	reserved4
What	is	dataflow?
©	Hortonworks	Inc.	2011–2018.	All	rights	reserved5
What	is	dataflow?
• Moving	some	content	from	A	to	B	
• Content	could	be	any	bytes	
• Logs	
• HTTP	
• XML	
• CSV	
• Images	
• Video	
• Telemetry
Producers	A.K.A	
Things	
Anything	
AND		
Everything
Internet!
Consumers	
• User	
• Storage	
• System	
• …More	Things
©	Hortonworks	Inc.	2011–2018.	All	rights	reserved6
• Standards	
• Formats	
• Protocols	
• Veracity	
• Validity	
• Schemas	
• Partitioning/
Bundling
Data
Dataflow	Challenges	In	3	Categories
Infrastructure
• “Exactly	Once”	
Delivery	
• Ensuring	
Security	
• Overcoming	
Security	
• Credential	
Management	
• Network
People
• Compliance	
• “That	[person|
team|group]”	
• Consumers	
Change	
• Requirements	
Change		
• “Exactly	Once”	
Delivery
©	Hortonworks	Inc.	2011–2018.	All	rights	reserved7
Raise	your	hand	if	you	want	to	maintain	Python	scripts	for	the	rest	of	your	life
Let’s	Connect	Lots	of	As	to	Bs	to	As	to	Cs	to	Bs	to	Δs	to	Cs	to	ϕs
©	Hortonworks	Inc.	2011–2018.	All	rights	reserved8
What	is	Apache	NiFi?
©	Hortonworks	Inc.	2011–2018.	All	rights	reserved9
• Guaranteed	delivery	
• Data	buffering		
• Backpressure	
• Pressure	release	
• Prioritized	queuing	
• Flow	specific	QoS	
• Latency	vs.	throughput	
• Loss	tolerance
Key	Features
Apache	NiFi
• Data	provenance	
• Supports	push	and	pull	models	
• Recovery/recording	

a	rolling	log	of	fine-grained	history	
• Visual	command	and	control	
• Flow	templates	
• Pluggable,	multi-tenant	security	
• Designed	for	extension	
• Clustering
©	Hortonworks	Inc.	2011–2018.	All	rights	reserved10
Flowfiles	Are	Like	HTTP	Data
HTTP	Data FlowFile
HTTP/1.1	200	OK	
Date:	Sun,	10	Oct	2010	23:26:07	GMT	
Server:	Apache/2.2.8	(CentOS)	OpenSSL/0.9.8g	
Last-Modified:	Sun,	26	Sep	2010	22:04:35	GMT	
ETag:	"45b6-834-49130cc1182c0"	
Accept-Ranges:	bytes	
Content-Length:	13	
Connection:	close	
Content-Type:	text/html	
Hello	world!
Standard	FlowFile	Attributes	
Key:	'entryDate’	 Value:	'Fri	Jun	17	17:15:04	EDT	2016'	
Key:	'lineageStartDate’			Value:	'Fri	Jun	17	17:15:04	EDT	2016'	
Key:	'fileSize’	 Value:	'23609'	
FlowFile	Attribute	Map	Content	
Key:	'filename’	 Value:	'15650246997242'	
Key:	'path’	 Value:	'./’	
Binary	Content	*
Header
Content
©	Hortonworks	Inc.	2011–2018.	All	rights	reserved11
User	Interface
Less	of	this…
©	Hortonworks	Inc.	2011–2018.	All	rights	reserved11
User	Interface
Less	of	this… …	more	of	this
©	Hortonworks	Inc.	2011–2018.	All	rights	reserved12
Deeper	Ecosystem	Integration:	260+	Processors,	48	Controller	Services
Hash
Extract
Merge
Duplicate
Scan
GeoEnrich
Replace
ConvertSplit
Translate
Route	Content
Route	Context
Route	Text
Control	Rate
Distribute	Load
Generate	Table	Fetch
Jolt	Transform	JSON
Prioritized	Delivery
Encrypt
Tail
Evaluate
Execute
All	Apache	project	logos	are	trademarks	of	the	ASF	and	the	respective	projects.
Fetch
HTTP
Syslog
Email
HTML
Image
HL7
FTP
UDP
XML
SFTP
AMQP
WebSocket
Parse	Records Convert	Records
©	Hortonworks	Inc.	2011–2018.	All	rights	reserved13
What	are	the	IoT	challenges?
©	Hortonworks	Inc.	2011–2018.	All	rights	reserved14
IoT	Challenges
• Limited	computing	capability	
• Limited	power/network	
• Restricted	software	library/platform	
availability	
• No	UI	
• Physically	inaccessible	
• Not	frequently	updated	
• Competing	standards/protocols	
• Scalability	
• Privacy	&	Security
©	Hortonworks	Inc.	2011–2018.	All	rights	reserved15
• When	the	Mirai	attack	has	its	own	
Wikipedia	page,	that’s	not	good	
• Hackers	stole	high-roller	database	from	
casino	via	aquarium	thermometer	
connected	to	internet	(04/2018)
Recent	Examples
©	Hortonworks	Inc.	2011–2018.	All	rights	reserved16
• Runs	on	JVM	
• Provides	UI	for	flow	design	&	monitoring	
• Security	built-in	
• TLS,	authentication/authorization,	encrypted	data	
• Handles	practically	any	format/protocol
NiFi	Solves	Everything*
©	Hortonworks	Inc.	2011–2018.	All	rights	reserved17
• NiFi	supports	AMQP,	MQTT,	UDP,	TCP,	
HTTP(S),	CEF,	JMS,	(S)FTP,	AWSIoT	
• With	a	little	pruning,	NiFi	can	run	on	a	
Raspberry	Pi
NiFi	for	IoT
©	Hortonworks	Inc.	2011–2018.	All	rights	reserved18
• NiFi	is	designed	to	“own	the	box”	
• NiFi	0.7.x	started	up	in	about	10-15	minutes	on	RP3	(593	MB)	
• NiFi	1.x	started	up	in	about	30	minutes	on	RP3	(760	MB)	
• 33	new	processors	
• Rewrite	for	multi	tenant	authorization	
• Complete	UI	overhaul
So	Why	Do	We	Need	A	Different	Solution?
©	Hortonworks	Inc.	2011–2018.	All	rights	reserved19
Enter	Apache	MiNiFi
©	Hortonworks	Inc.	2011–2018.	All	rights	reserved20
• Get	the	key	parts	of	NiFi	close	to	where	data	begins	and	provide	bidirectional	
communication	
• NiFi	lives	in	the	data	center	—	give	it	an	enterprise	server	or	a	cluster	of	them	
• MiNiFi	lives	as	close	to	where	data	is	born	and	is	a	guest	on	that	device	or	system	
• IoT	
• Connected	car	
• Legacy	hardware
Apache	NiFi	Subproject:	MiNiFi
©	Hortonworks	Inc.	2011–2018.	All	rights	reserved21
• NiFi	is	big	
• 1.6.0	release	is	1.2	GB	compressed	
• Can	be	modified	to	run	in	restricted	environments,	but	requires	manual	surgery	
• Provides	UI,	provenance	query,	etc.	
• Runs	on	dedicated	machines/clusters	—	“owns	the	box”	
• MiNiFi	lives	at	the	edge	
• No	UI	
• 0.4.0	Java	binary	is	65	MB,	C++	binary	is	4.5	MB	(0.2.0	fits	on	a	floppy	disk)	
• “Good	guest”
Why	build	MiNiFi?
©	Hortonworks	Inc.	2011–2018.	All	rights	reserved22
• MiNiFi	Java	(v0.4.0)	
• Modified	version	of	NiFi	
• No	UI	
• YAML	configuration	
• Reduced	processor	count	
• 110+	by	default,	more	

available	with	

additional	NARs	
• MiNiFi	C++	(v0.4.0)	
• Written	from	scratch	
• 28	processors	by	default	
• Bi-directional	site-to-site	&	provenance	data
Flavors	of	MiNiFi
©	Hortonworks	Inc.	2011–2018.	All	rights	reserved23
NiFi	vs	MiNiFi	Java	Processes
NiFi	Framework
Components
MiNiFi
NiFi	Framework
User	Interface
Components
NiFi
©	Hortonworks	Inc.	2011–2018.	All	rights	reserved24
• NiFi	
• Design	flows	
• Aggregate	data	from	many	
sources	
• Perform	routing/analysis/SEP	
• MiNiFi	
• Receive	flows	
• Collect	data	
• Send	for	processing
How	Does	MiNiFi	Interact	With	NiFi?
©	Hortonworks	Inc.	2011–2018.	All	rights	reserved25
• We’ve	been	imagining	EDGE	to	CORE	as	a	bi-directional	linear	system	
• Let’s	expand	

that	to	the	real	

world
Let’s	Add	Dimensionality
©	Hortonworks	Inc.	2011–2018.	All	rights	reserved26
• Data	tagging/provenance	
• Governance	from	edge	(geopolitical	
restrictions)	
• Security	(encryption,	certificate-based	
authentication)	
• Low	latency	(immediate	reactions	&	
decision-making)
What	does	MiNiFi	provide? Connected Car Reference Platform Box
Tuner + DSRC CardConnectivity Card
©	Hortonworks	Inc.	2011–2018.	All	rights	reserved27
MiNiFi	on	a	Connected	Car
Comprehension
Collection
CAN	Bus
Gateway
MCU MCU MCU
Ethernet	/	
Ethernet	AVB
Local	
Interconnect	
Network
Yet	to	be	
established	
protocol
Listen	Ethernet Listen	LINListen	CAN Listen	<>
Parse	CAN Parse	Ethernet Parse	LIN Parse	<>
Processing	/	Synthesis
Route
Transmit Execute PrioritizeFilter
©	Hortonworks	Inc.	2011–2018.	All	rights	reserved28
MiNiFi	on	a	Connected	Car
©	Hortonworks	Inc.	2011–2018.	All	rights	reserved29
• Site-to-Site	
• NiFi	protocol	
• Two	implementations	
• Raw	socket	
• HTTP(S)	(Java	only)	
• Secured	with	mutual	authentication	TLS	
• HTTP(S),	(S)FTP,	JMS,	Syslog,	File,	Email,	Process	(Java	only)
MiNiFi	Exfil
©	Hortonworks	Inc.	2011–2018.	All	rights	reserved30
Edge	Data	Exploration
©	Hortonworks	Inc.	2011–2018.	All	rights	reserved31
• IoT	Device	generating	log	messages	
• Need	to	encrypt	data	on	device	
• Need	to	prioritize	some	data	for	
unreliable	network	connectivity	
• Transmit	data	to	central	node	
• Decrypt	data	and	analyze	
• Make	determinations	and	modify	live	
flow
Scenario
©	Hortonworks	Inc.	2011–2018.	All	rights	reserved32
• Simulate	the	log	generation	
• Schedule	is	customizable	
• Script	can	write	to	dynamic	location
NiFi	As	Test	Harness/Environment
©	Hortonworks	Inc.	2011–2018.	All	rights	reserved33
• Tails	a	log	file	
• Logs	the	raw	contents	(can	be	
multiple	lines	in	time	window)	
• Splits	into	individual	lines	
• Filters	the	content	
• Using	parity	of	the	timestamp	
• Prioritizes	
• Encrypts	using	AES/GCM	
• Exfils	to	remote	NiFi
Build	the	MiNiFi	Flow
©	Hortonworks	Inc.	2011–2018.	All	rights	reserved34
• Save	as	template	from	NiFi	
• Run	$	./bin/config.sh	transform	
template.xml	config.yml	
• MiNiFi	flow	ready	to	run*

				*Still	need	to	set	up	TLS	&	encrypted	properties
Export	from	NiFi	to	MiNiFi
©	Hortonworks	Inc.	2011–2018.	All	rights	reserved35
• NiFi	TLS	Toolkit	makes	certificates	&	
keystores	simple	(and	secure)	
• Copy	encrypted	property	value	from	
flow.xml.gz	to	config.yml	(flow	repo)
Setting	Up	Crypto
©	Hortonworks	Inc.	2011–2018.	All	rights	reserved36
• All	data	transmitted	over	TLS	is	
encrypted	
• On	NiFi,	automatically	
decrypted	
• Attributes	visible	
• Content	still	encrypted	because	
of	EncryptContent	processor	
• Can	serve	as	secure	route	for	
follow-on	systems
If	We	Really	Have	TLS,	Why	Encrypt?
©	Hortonworks	Inc.	2011–2018.	All	rights	reserved37
• Receive	the	data	over	S2S	
• Log	the	incoming	messages	
• Decrypt	content	
• Log	again
Process	Data	In	NiFi
©	Hortonworks	Inc.	2011–2018.	All	rights	reserved38
Does	It	Work?
©	Hortonworks	Inc.	2011–2018.	All	rights	reserved39
• Increase	the	write	frequency	
• Check	that	newer	records	(within	tail	
window)	with	higher	priority	arrive	first
Prioritization?
©	Hortonworks	Inc.	2011–2018.	All	rights	reserved40
• Window	Aggregator	
• If	>60%	odd	in	window,	switch	prioritization	
• Encrypt	with	different	keys	for	different	tags	&	send	to	different	follow-on	systems	
• Spotty	network?	Tell	MiNiFi	to	cache	low	priority	and	send	in	batches	
• MiNiFi	rollover	&	pruning	of	monitored	log	
• Exfil	MiNiFi	provenance	data	to	NiFi
Next	Steps
©	Hortonworks	Inc.	2011–2018.	All	rights	reserved41
Community
©	Hortonworks	Inc.	2011	–	2017.	All	Rights	Reserved42
Community	Example
⬢ Jeremy	Dyer	
⬢ Alexa	+	MiNiFi	+	Dyer	2.0
http://www.opensourcedad.com/apache/minifi-cpp/2016/12/18/poop-scale.html
©	Hortonworks	Inc.	2011	–	2017.	All	Rights	Reserved42
Community	Example
⬢ Jeremy	Dyer	
⬢ Alexa	+	MiNiFi	+	Dyer	2.0
http://www.opensourcedad.com/apache/minifi-cpp/2016/12/18/poop-scale.html
©	Hortonworks	Inc.	2011–2018.	All	rights	reserved43
What’s	Next?
©	Hortonworks	Inc.	2011–2018.	All	rights	reserved44
• NiFi	1.6.0	—	08	April	2018	
• MongoDB,	InfluxDB,	Druid,	HBase	
components	
• Granular	@Restricted	components	
• MiNiFi	C++	0.4.0	—	27	January	2018	
• MiNiFi	Java	0.4.0	—	22	January	2018	
• NiFi	Registry	0.1.0	—	1	January	2018
Introducing	Apache	NiFi	Registry
New	Announcements
©	Hortonworks	Inc.	2011–2018.	All	rights	reserved45
• Previously,	flows	were	exported	via	
XML	templates	
• Didn’t	contain	sensitive	values	
• Couldn’t	be	updated	in-place	
• No	tracking	system	
• NiFi	Registry	brings	asset	management	
as	first-class	citizen	to	NiFi	
• Flows	can	be	versioned	
• Flows	can	be	promoted	between	
environments
Introducing	Apache	NiFi	Registry	0.1.0
NiFi	Registry	for	Dataflows
Learn	more	at	Forget	Duplicating	Local	Changes:	Apache	
NiFi	and	the	Flow	Development	Lifecycle	(FDLC)	
Thursday	19/4	@	1600,	Room	II
©	Hortonworks	Inc.	2011–2018.	All	rights	reserved46
• Centralized	C2	for	classes	of	
agents	
• Tag	devices,	platforms,	
capabilities	
• Distribute	flows	to	edge	via	
multiple	mechanisms
Command	&	Control	for	MiNiFi
MiNiFi	C2	Server
IN DEVELOPMENT
©	Hortonworks	Inc.	2011–2018.	All	rights	reserved47
Community	Health
©	Hortonworks	Inc.	2011–2018.	All	rights	reserved48
Apache NiFi site

https://nifi.apache.org
Subproject MiNiFi site
https://nifi.apache.org/minifi/
Subscribe to and collaborate at

dev@nifi.apache.org
users@nifi.apache.org
Submit Ideas or Issues

https://issues.apache.org/jira/browse/NIFI
Follow us on Twitter
@apachenifi
Learn	more	and	join	us
©	Hortonworks	Inc.	2011–2018.	All	rights	reserved49
More	NiFi	This	Week…
Title Room Time Speaker(s)
Apache	NiFi	Crash	Course Hall	I	-	D 1115	-	1345 Andy	LoPresto,	Tim	Spann
IoT	with	Apache	MXNet	and	Apache	NiFi	and	
MiNiFi
Hall	I	-	C 1150	-	1230 Tim	Spann
Best	practices	and	lessons	learnt	from	Running	
Apache	NiFi	at	Renault
Europe 1650	-	1730 Adel	Gacem,	Abdelkrim	Hadjidj
From	an	experiment	to	a	real	production	
environment
Room	V 1650	-	1730 Jeroen	Wolffensperger,	Martijn	Groen
IoT,	Streaming,	and	Dataflow	Birds	of	a	Feather Room	I 1740	-	1855 George	Vetticaden,	Davor	Bonaci,	Andy	
LoPresto,	Stephan	Ewen
Intelligently	Collecting	Data	at	the	Edge	—	
Intro	to	Apache	MiNiFi
Room	II 1100	-	1140 Andy	LoPresto
The	Power	of	Intelligent	Flows:	Realtime	IoT	
Botnet	Classification	with	Apache	NiFi
Hall	I	-	C 1400	-	1440 Andy	LoPresto
Forget	Duplicating	Local	Changes:	Apache	NiFi	
and	the	Flow	Development	Lifecycle	(FDLC)
Room	II 1600	-	1640 Andy	LoPresto
©	Hortonworks	Inc.	2011–2018.	All	rights	reserved50
Thank	you
alopresto@hortonworks.com	|	alopresto@apache.org	|	@yolopey	
github.com/alopresto/slides

More Related Content

What's hot (20)

PDF
Nifi
Julio Castro
 
PPTX
HBase and HDFS: Understanding FileSystem Usage in HBase
enissoz
 
PPTX
Hortonworks Data in Motion Webinar Series Part 7 Apache Kafka Nifi Better Tog...
Hortonworks
 
PDF
Advanced Streaming Analytics with Apache Flink and Apache Kafka, Stephan Ewen
confluent
 
PPTX
Security Considerations for API Gateway Aggregation
Hitachi, Ltd. OSS Solution Center.
 
PDF
Apache pulsar - storage architecture
Matteo Merli
 
PDF
Introduction to Apache NiFi dws19 DWS - DC 2019
Timothy Spann
 
PDF
Introduction to Apache Kafka and Confluent... and why they matter
confluent
 
PPTX
Best practices and lessons learnt from Running Apache NiFi at Renault
DataWorks Summit
 
PPTX
Apache Phoenix and Apache HBase: An Enterprise Grade Data Warehouse
Josh Elser
 
PPTX
Integrating Apache Spark and NiFi for Data Lakes
DataWorks Summit/Hadoop Summit
 
PDF
Beyond the Brokers: A Tour of the Kafka Ecosystem
confluent
 
PPTX
Monitoring Apache Kafka
confluent
 
PDF
Apache Pinot Case Study: Building Distributed Analytics Systems Using Apache ...
HostedbyConfluent
 
PPTX
Apache Hadoop YARN: best practices
DataWorks Summit
 
PDF
Apache NiFi Record Processing
Bryan Bende
 
PPTX
Overview of new features in Apache Ranger
DataWorks Summit
 
PDF
Real-Life Use Cases & Architectures for Event Streaming with Apache Kafka
Kai Wähner
 
PDF
Owning Your Own (Data) Lake House
Data Con LA
 
PDF
Apache Nifi Crash Course
DataWorks Summit
 
HBase and HDFS: Understanding FileSystem Usage in HBase
enissoz
 
Hortonworks Data in Motion Webinar Series Part 7 Apache Kafka Nifi Better Tog...
Hortonworks
 
Advanced Streaming Analytics with Apache Flink and Apache Kafka, Stephan Ewen
confluent
 
Security Considerations for API Gateway Aggregation
Hitachi, Ltd. OSS Solution Center.
 
Apache pulsar - storage architecture
Matteo Merli
 
Introduction to Apache NiFi dws19 DWS - DC 2019
Timothy Spann
 
Introduction to Apache Kafka and Confluent... and why they matter
confluent
 
Best practices and lessons learnt from Running Apache NiFi at Renault
DataWorks Summit
 
Apache Phoenix and Apache HBase: An Enterprise Grade Data Warehouse
Josh Elser
 
Integrating Apache Spark and NiFi for Data Lakes
DataWorks Summit/Hadoop Summit
 
Beyond the Brokers: A Tour of the Kafka Ecosystem
confluent
 
Monitoring Apache Kafka
confluent
 
Apache Pinot Case Study: Building Distributed Analytics Systems Using Apache ...
HostedbyConfluent
 
Apache Hadoop YARN: best practices
DataWorks Summit
 
Apache NiFi Record Processing
Bryan Bende
 
Overview of new features in Apache Ranger
DataWorks Summit
 
Real-Life Use Cases & Architectures for Event Streaming with Apache Kafka
Kai Wähner
 
Owning Your Own (Data) Lake House
Data Con LA
 
Apache Nifi Crash Course
DataWorks Summit
 

Similar to Intelligently collecting data at the edge—intro to Apache MiNiFi (20)

PDF
Intelligently Collecting Data at the Edge - Intro to Apache MiNiFi
DataWorks Summit
 
PDF
Intelligently Collecting Data at the Edge – Intro to Apache MiNiFi
DataWorks Summit
 
PDF
The First Mile – Edge and IoT Data Collection with Apache NiFi and MiNiFi
DataWorks Summit
 
PDF
Dataflow Management From Edge to Core with Apache NiFi
DataWorks Summit
 
PDF
Apache Nifi Crash Course
DataWorks Summit
 
PPTX
Hortonworks Data in Motion Webinar Series - Part 1
Hortonworks
 
PPTX
Harnessing Data-in-Motion with HDF 2.0, introduction to Apache NIFI/MINIFI
Haimo Liu
 
PDF
The First Mile - Edge and IoT Data Collection With Apache Nifi and MiniFi
DataWorks Summit
 
PDF
The First Mile -- Edge and IoT Data Collection with Apache NiFi and MiNiFi
DataWorks Summit
 
PDF
Dataflow Management From Edge to Core with Apache NiFi
DataWorks Summit
 
PPTX
Data at Scales and the Values of Starting Small with Apache NiFi & MiNiFi
Aldrin Piri
 
PDF
Apache Nifi Crash Course
DataWorks Summit
 
PPTX
Connecting the Drops with Apache NiFi & Apache MiNiFi
DataWorks Summit
 
PPTX
State of the Apache NiFi Ecosystem & Community
Accumulo Summit
 
PDF
Enterprise IIoT Edge Processing with Apache NiFi
Timothy Spann
 
PPTX
Introduction to Apache NiFi - Seattle Scalability Meetup
Saptak Sen
 
PDF
Hortonworks DataFlow & Apache Nifi @Oslo Hadoop Big Data
Mats Johansson
 
PPTX
HDF Powered by Apache NiFi Introduction
Milind Pandit
 
PPTX
Apache NiFi Crash Course - San Jose Hadoop Summit
Aldrin Piri
 
PPTX
Big Data Day LA 2016/ Big Data Track - Building scalable enterprise data flow...
Data Con LA
 
Intelligently Collecting Data at the Edge - Intro to Apache MiNiFi
DataWorks Summit
 
Intelligently Collecting Data at the Edge – Intro to Apache MiNiFi
DataWorks Summit
 
The First Mile – Edge and IoT Data Collection with Apache NiFi and MiNiFi
DataWorks Summit
 
Dataflow Management From Edge to Core with Apache NiFi
DataWorks Summit
 
Apache Nifi Crash Course
DataWorks Summit
 
Hortonworks Data in Motion Webinar Series - Part 1
Hortonworks
 
Harnessing Data-in-Motion with HDF 2.0, introduction to Apache NIFI/MINIFI
Haimo Liu
 
The First Mile - Edge and IoT Data Collection With Apache Nifi and MiniFi
DataWorks Summit
 
The First Mile -- Edge and IoT Data Collection with Apache NiFi and MiNiFi
DataWorks Summit
 
Dataflow Management From Edge to Core with Apache NiFi
DataWorks Summit
 
Data at Scales and the Values of Starting Small with Apache NiFi & MiNiFi
Aldrin Piri
 
Apache Nifi Crash Course
DataWorks Summit
 
Connecting the Drops with Apache NiFi & Apache MiNiFi
DataWorks Summit
 
State of the Apache NiFi Ecosystem & Community
Accumulo Summit
 
Enterprise IIoT Edge Processing with Apache NiFi
Timothy Spann
 
Introduction to Apache NiFi - Seattle Scalability Meetup
Saptak Sen
 
Hortonworks DataFlow & Apache Nifi @Oslo Hadoop Big Data
Mats Johansson
 
HDF Powered by Apache NiFi Introduction
Milind Pandit
 
Apache NiFi Crash Course - San Jose Hadoop Summit
Aldrin Piri
 
Big Data Day LA 2016/ Big Data Track - Building scalable enterprise data flow...
Data Con LA
 
Ad

More from DataWorks Summit (20)

PPTX
Data Science Crash Course
DataWorks Summit
 
PPTX
Floating on a RAFT: HBase Durability with Apache Ratis
DataWorks Summit
 
PPTX
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
DataWorks Summit
 
PDF
HBase Tales From the Trenches - Short stories about most common HBase operati...
DataWorks Summit
 
PPTX
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
DataWorks Summit
 
PPTX
Managing the Dewey Decimal System
DataWorks Summit
 
PPTX
Practical NoSQL: Accumulo's dirlist Example
DataWorks Summit
 
PPTX
HBase Global Indexing to support large-scale data ingestion at Uber
DataWorks Summit
 
PPTX
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
DataWorks Summit
 
PPTX
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
DataWorks Summit
 
PPTX
Supporting Apache HBase : Troubleshooting and Supportability Improvements
DataWorks Summit
 
PPTX
Security Framework for Multitenant Architecture
DataWorks Summit
 
PDF
Presto: Optimizing Performance of SQL-on-Anything Engine
DataWorks Summit
 
PPTX
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
DataWorks Summit
 
PPTX
Extending Twitter's Data Platform to Google Cloud
DataWorks Summit
 
PPTX
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
DataWorks Summit
 
PPTX
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
DataWorks Summit
 
PPTX
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
DataWorks Summit
 
PDF
Computer Vision: Coming to a Store Near You
DataWorks Summit
 
PPTX
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
DataWorks Summit
 
Data Science Crash Course
DataWorks Summit
 
Floating on a RAFT: HBase Durability with Apache Ratis
DataWorks Summit
 
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
DataWorks Summit
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
DataWorks Summit
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
DataWorks Summit
 
Managing the Dewey Decimal System
DataWorks Summit
 
Practical NoSQL: Accumulo's dirlist Example
DataWorks Summit
 
HBase Global Indexing to support large-scale data ingestion at Uber
DataWorks Summit
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
DataWorks Summit
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
DataWorks Summit
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
DataWorks Summit
 
Security Framework for Multitenant Architecture
DataWorks Summit
 
Presto: Optimizing Performance of SQL-on-Anything Engine
DataWorks Summit
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
DataWorks Summit
 
Extending Twitter's Data Platform to Google Cloud
DataWorks Summit
 
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
DataWorks Summit
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
DataWorks Summit
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
DataWorks Summit
 
Computer Vision: Coming to a Store Near You
DataWorks Summit
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
DataWorks Summit
 
Ad

Recently uploaded (20)

PPTX
Agentforce World Tour Toronto '25 - MCP with MuleSoft
Alexandra N. Martinez
 
PDF
Agentic AI lifecycle for Enterprise Hyper-Automation
Debmalya Biswas
 
PDF
Future-Proof or Fall Behind? 10 Tech Trends You Can’t Afford to Ignore in 2025
DIGITALCONFEX
 
PDF
SIZING YOUR AIR CONDITIONER---A PRACTICAL GUIDE.pdf
Muhammad Rizwan Akram
 
PDF
Newgen 2022-Forrester Newgen TEI_13 05 2022-The-Total-Economic-Impact-Newgen-...
darshakparmar
 
PDF
How do you fast track Agentic automation use cases discovery?
DianaGray10
 
PPT
Ericsson LTE presentation SEMINAR 2010.ppt
npat3
 
PDF
“Computer Vision at Sea: Automated Fish Tracking for Sustainable Fishing,” a ...
Edge AI and Vision Alliance
 
PPTX
Designing_the_Future_AI_Driven_Product_Experiences_Across_Devices.pptx
presentifyai
 
PDF
AI Agents in the Cloud: The Rise of Agentic Cloud Architecture
Lilly Gracia
 
PDF
“Squinting Vision Pipelines: Detecting and Correcting Errors in Vision Models...
Edge AI and Vision Alliance
 
PDF
CIFDAQ Market Wrap for the week of 4th July 2025
CIFDAQ
 
DOCX
Python coding for beginners !! Start now!#
Rajni Bhardwaj Grover
 
PDF
UPDF - AI PDF Editor & Converter Key Features
DealFuel
 
PPTX
Seamless Tech Experiences Showcasing Cross-Platform App Design.pptx
presentifyai
 
PPTX
Mastering ODC + Okta Configuration - Chennai OSUG
HathiMaryA
 
PDF
The Rise of AI and IoT in Mobile App Tech.pdf
IMG Global Infotech
 
PDF
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
PPTX
AI Penetration Testing Essentials: A Cybersecurity Guide for 2025
defencerabbit Team
 
PDF
Transcript: Book industry state of the nation 2025 - Tech Forum 2025
BookNet Canada
 
Agentforce World Tour Toronto '25 - MCP with MuleSoft
Alexandra N. Martinez
 
Agentic AI lifecycle for Enterprise Hyper-Automation
Debmalya Biswas
 
Future-Proof or Fall Behind? 10 Tech Trends You Can’t Afford to Ignore in 2025
DIGITALCONFEX
 
SIZING YOUR AIR CONDITIONER---A PRACTICAL GUIDE.pdf
Muhammad Rizwan Akram
 
Newgen 2022-Forrester Newgen TEI_13 05 2022-The-Total-Economic-Impact-Newgen-...
darshakparmar
 
How do you fast track Agentic automation use cases discovery?
DianaGray10
 
Ericsson LTE presentation SEMINAR 2010.ppt
npat3
 
“Computer Vision at Sea: Automated Fish Tracking for Sustainable Fishing,” a ...
Edge AI and Vision Alliance
 
Designing_the_Future_AI_Driven_Product_Experiences_Across_Devices.pptx
presentifyai
 
AI Agents in the Cloud: The Rise of Agentic Cloud Architecture
Lilly Gracia
 
“Squinting Vision Pipelines: Detecting and Correcting Errors in Vision Models...
Edge AI and Vision Alliance
 
CIFDAQ Market Wrap for the week of 4th July 2025
CIFDAQ
 
Python coding for beginners !! Start now!#
Rajni Bhardwaj Grover
 
UPDF - AI PDF Editor & Converter Key Features
DealFuel
 
Seamless Tech Experiences Showcasing Cross-Platform App Design.pptx
presentifyai
 
Mastering ODC + Okta Configuration - Chennai OSUG
HathiMaryA
 
The Rise of AI and IoT in Mobile App Tech.pdf
IMG Global Infotech
 
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
AI Penetration Testing Essentials: A Cybersecurity Guide for 2025
defencerabbit Team
 
Transcript: Book industry state of the nation 2025 - Tech Forum 2025
BookNet Canada
 

Intelligently collecting data at the edge—intro to Apache MiNiFi