Download as pdf or txt
Download as pdf or txt
You are on page 1of 2

Our Privacy Policy has been updated.

Updated Policy

Alooma is now part of Google Cloud. Learn More

ANSWERS Search questions

What is Apache NiFi? ETL Tool ETL Component

Extract, Transform, Load (ETL)


by Alooma Team
Updated Jan 3, 2018

Apache NiFi is an open source project which enables the automation of data flow
between systems, known as "data logistics". The project is written using flow-based Further reading
programming and provides a web-based user interface to manage data flows in real
What's the most tedious part of building
time. ETLs and/or data pipelines?
Yuval Barth • Updated Feb 28, 2019
The project was created by the United States National Security Agency (NSA),
originally named Niagarafiles. In 2014 the NSA released it as open-source software. What is the future of ETL tools?
Apache NiFi continued to be developed at Onyara, Inc., which was subsequently Ofri Raviv • Updated Dec 14, 2018

acquired by HortonWorks.
Should I use an ETL tool or create a Python
ETL pipeline?
What Apache NiFi Does Eli Oxman • Updated Nov 2, 2018

Apache NiFi is an integrated data logistics platform for automating the movement of
What are the pitfalls to avoid when
data between disparate systems. It is data source agnostic and supports sources of implementing an ETL (Extract, Transform,
different formats, schemas, protocols, speeds, and sizes. Some common formats Load) tool?
are geolocation devices, click streams, files, social feeds, log files, and more. NiFi Yossi Zini • Updated Oct 15, 2018

provides a configurable plumbing platform for moving data, and enables tracing
What do you need to consider if I will be
data in real time. It is not an interactive ETL tool. It can be part of an ETL solution. creating an event-driven ETL?
Yuval Barth • Updated Oct 15, 2018
Apache NiFi is designed from the ground up to be enterprise ready: flexible,
extensible, and suitable for a range of devices from network edge devices such as a
Raspberry Pi to enterprise data clusters and the cloud. Apache NiFi can also adjust
to fluctuating network connectivity that could impact the delivery of data.

Apache NiFi Features

NiFi supports directed graphs of data routing, transformation, and system


mediation. Features include:

Web-based user interface - covering design, control, feedback, and


monitoring.
Highly Configurable - enables a balance between loss tolerance and
guaranteed delivery, and low latency vs high throughput. Enables dynamic
prioritization of flows, modification of flows at runtime, and back pressure
thresholds, which specify amount of data that may exist in the queue, to avoid
overrunning the system with data.
Data Provenance - enables tracking data flows from beginning to end.
Extensible - enables users to build their own processors and more. Enables
rapid development and effective testing. Secure - supports SSL, SSH, HTTPS,
encrypted content, and more. Provides multi-tenant authorization and
internal policy management.

Like what you read? Share on


Hi there, this site uses some modern cookies to make sure you have the best experience. Got It Learn More

You might also like