skip to Main Content

The DataOps Blog

Where Change Is Welcome

Retrieving Metrics via the StreamSets Data Collector REST API

StreamSets News

Last week, I explained how I was able to run StreamSets Data Collector Engine on a Raspberry Pi 3, ingesting sensor data and writing it to Cassandra. With that working, I wanted to show pipeline metrics across data pipelines on Adafruit's awesome PiTFT Plus 2.8" screen. In this blog post, I'll explain how I was able to write a Python…

By July 8, 2016

Announcing Data Collector ver 1.5.0.0

StreamSets News

We are excited to announce the release of the next version of StreamSets Data Collector. With this release we have a number of new features and enhancements and 40+ bug fixes. Automatic creation and updates to Hive schemas. This new functionality automatically creates schemas for Hive tables, and if it detects schema changes in the incoming data set (schema drift)…

Struggling with Bad Data? What to Do From the Enterprise

Industry, StreamSets News

Last week we announced the results of a survey of over 300 enterprise data professionals conducted by Dimensional Research and sponsored by StreamSets.  We were trying to understand the market’s state of play for managing their big data flows.  What we discovered was that there is an alarming issue at hand: companies are struggling to detect and keep bad data…

By June 28, 2016
Back To Top