skip to Main Content

The DataOps Blog

Where Change Is Welcome

Binlog Processing Using Maxwell, Kafka & StreamSets

Engineering, Use Cases

This is a nice example of Kafka enablement using Maxwell (a mysql-to-kafka binlog processor) and StreamSets Data Collector from the folks at B23.   It includes a schema change listener for handling data drift.  Enjoy! Innovate on Your Data - Maxwell Meets StreamSets  

By March 2, 2016

Announcing StreamSets Data Collector ver 1.2.1.0

StreamSets News

We’re happy to announce a new version of the StreamSets Data Collector. This version has a number of bug fixes and - most importantly - support for Elasticsearch 2.x.

By February 18, 2016

Continuous Ingest in the Face of Data Drift – Part 2 (from the Cloudera Vision Blog)

Industry, StreamSets News

In my previous post I discussed the causes and impacts of data drift, a natural consequence of Big Data which creates serious data quality and data pipeline operational issues. Now I will describe the features of StreamSets Data Collector, how they address ingesting data in a “drifty” environment and describe some common use cases. StreamSets was founded to deliver a…

By February 9, 2016
Back To Top