Dynamic Outlier Detection with StreamSets and Cassandra
This blog post concludes a short series building up a IoT sensor testbed with StreamSets Data Collector (SDC), a Raspberry Pi and Apache Cassandra. Previously, I covered:
- Part 1: Ingesting Sensor Data on the Raspberry Pi with StreamSets Data Collector
- Part 2: Retrieving Metrics via the StreamSets Data Collector REST API
- Part 3: Standard Deviations on Cassandra – Rolling Your Own Aggregate Function
To wrap up, I’ll show you how I retrieved statistics from Cassandra, fed them into SDC, and was able to filter out ‘outlier’ values.