skip to Main Content

The DataOps Blog

Where Change Is Welcome

From Zero to Production ETL in 30 minutes with StreamSets

StreamSets News

Jeff Schmitz has been working with big data for over a decade: at Shell, Sanchez Energy, MapR and, currently, as a senior solutions architect at MongoDB. Here, in a guest post reposted with permission from the original, Jeff shares his early experience with StreamSets Data Collector. Now that I work for MongoDB I work with StreamSets quite a bit however a while back…

By June 14, 2019

Announcing StreamSets Data Collector 3.9.0 and StreamSets Data Collector Edge 3.9.0

StreamSets News

StreamSets is excited to announce the immediate availability of StreamSets Data Collector 3.9.0 and StreamSets Data Collector Edge 3.9.0. StreamSets Data Collector is open source under Apache License 2.0 and a powerful design and execution engine. It enables moving data between any source and destination, performing transformations, and push down analytics along the way. To download, click here. StreamSets Data…

By June 6, 2019

Ingesting Data from Apache Kafka to TimescaleDB

StreamSets News

The Glue Conference (better known as GlueCon) is always a treat for me. I've been speaking there since 2012, and this year I presented a session explaining how I use StreamSets Data Collector to ingest content delivery network (CDN) data from compressed CSV files in S3 to MySQL for analysis, using the Kickfire API to turn IP addresses into company…

By May 28, 2019

Replicating Oracle to MySQL and JSON

StreamSets News

Yannick Jaquier is a Database Technical Architect at STMicroelectronics in Geneva, Switzerland. Recently, Yannick started experimenting with StreamSets Data Collector's Oracle CDC Client origin, building pipelines to replicate data to a second Oracle database, a JSON file, and a MySQL database. Yannick documented his journey very thoroughly, and kindly gave us permission to repost his findings from his original blog entry.…

By May 10, 2019
Back To Top