Why & How to Use Data Enrichment to Activate Your Data Lake for Analytics
We all know the shift to the cloud is massive and has been accelerated by COVID; however, I think many of us (myself included) don’t take enough time to really…
We all know the shift to the cloud is massive and has been accelerated by COVID; however, I think many of us (myself included) don’t take enough time to really…
StreamSets Privacy Policy
We know, we know. We went a little long this time, but you are going to love every minute of this week's guest. This week we are so excited to…
Imagine asking Amazon Alexa or Google Home to run your ETL, data processing, and automate your data pipelines. For example, "Start my data pipeline on Amazon EMR", “How many active…
Data protection is an integral part of working with data today. 2020 saw a huge increase in records exposed in data breaches, and no company wants to leave themselves open…
Although the recent public preview of Amazon Managed Streaming for Kafka (MSK) certainly made headlines, Kinesis remains Amazon's supported, production, real-time streaming service. In this blog post, I'll show you how to…
StreamSets Data Collector’s HTTP pipeline stages allow a wide range of API integrations. I recently built a data pipeline to integrate customer data from a MySQL database: retrieving, creating and…
The emerging practice of DataOps encompasses many activities that an enterprise may execute today including ingestion, ETL, and real-time stream processing. The trick to DataOps is to execute these things…
Yannick Jaquier is a Database Technical Architect at STMicroelectronics in Geneva, Switzerland. Recently, Yannick started experimenting with StreamSets Data Collector Engine's Oracle CDC Client origin, building pipelines for Oracle replication--replicating data…