StreamSets Data Integration Blog
Where change is welcome.
AWS Reference Architecture Guide for StreamSets
Using StreamSets DataOps Platform To Integrate Data from PostgreSQL to AWS S3 and Redshift: A Reference Architecture This document describes…
Transformer for Snowflake: Snowflake Transformation that Meets Cloud-First Expectations
Transformer for Snowflake is the first enterprise data transformation engine built on Snowpark. Want to learn how the engine makes advanced, native data transformations for your Data Cloud possible? Join our technical experts on Office Hours. It's likely you remember (or have at least heard) that databases in the past, beyond SQL, were extremely complicated to stay in a performant…
The JSON Validator: A Custom Processor to Ensure Your JSON Payload is Syntactically Accurate
Before we get to building your custom JSON validator, let’s talk about the author and their thoughts on why JSON has become so essential in data engineering. The author, Joel Klo, is a consultant at Bigspark, UK's engineering powerhouse, delivering next level data platforms and solutions to their clients. Their clients use modern cloud platforms and open source technologies, prioritize…
Get Ready for Private Preview! Transformer Engine for Snowflake
Transformer for Snowflake is the first enterprise data transformation engine built on Snowpark. Want to learn how the engine makes advanced, native data transformations for your Data Cloud possible? Join our technical experts on Office Hours. It’s no surprise when organizations implement the Snowflake Data Cloud as their internal standard for their overall data strategy. Especially as Snowflake continues to…
Manage File Updates with Automated Drift Detection in Your Kafka Topics
This article is the second part of a three-part series, Conducting the Chaos of Data Drift. StreamSets’ automated drift detection, a piece of it’s patented Data Drift technology, allows users to reduce break-fix by 90%. In this second part of the series, we will be covering explicitly data drift as it pertains to Kafka Topics. Read Part One: Manage File…
Snowflake and StreamSets’ Partnership Means Accelerated Innovation for the Future of Data Integration
Transformer for Snowflake is the first enterprise data transformation engine built on Snowpark. Want to learn how the engine makes advanced, native data transformations for your Data Cloud possible? Join our technical experts on Office Hours. Data. It makes the world go round. An overstatement perhaps, but in some pockets of the globe you wouldn’t know it by the glacial…