skip to Main Content

StreamSets Data Integration Blog

Where change is welcome.

Elasticsearch plus StreamSets for Reliable Data Ingestion

Arvind Prabhakar By November 18, 2015

StreamSets Data Collector is open source software that lets you easily build continuous data ingestion pipelines for Elasticsearch. By being resistant to "data drift", StreamSets minimizes ingest-related data loss and helps ensure optimized indexes so that Elasticsearch and Kibana users…

What Is StreamSets?

By October 5, 2015

This 2015 blog post has been updated. The original post is preserved below. StreamSets is a modern data integration platform dedicated to building the smart data pipelines needed to power DataOps across hybrid and multi-cloud architectures. StreamSets was founded in…

State of the Art Data Ingestion

By September 29, 2015

Forward-looking, data-driven enterprises increasingly leverage Big Data platforms, such as Hadoop, Elasticsearch and Amazon Web Services, to derive insights from non-­transactional, machine­-generated data. Many tools have emerged to power next ­generation data pipelines and provide specialized analytic capabilities. To get value…

Start with Why: Data Drift

By September 24, 2015

Today, after a year of working in stealth mode with a number of enterprise charter customers, we are excited to launch StreamSets. Arvind and I started StreamSets in June 2014 because, as they say in French, “plus ça change, plus…

Back To Top