skip to Main Content

StreamSets Data Integration Blog

Where change is welcome.

Transformer for Snowflake

By June 29, 2021

StreamSets platform provides an end-to-end enterprise solution to maximize the value of your Snowflake Data Cloud. The platform can ingest data into Snowflake (using batch, streaming and change data capture data pipelines). With the preview of the StreamSets Engine for…

Alexa, Start My Data Pipeline

By April 29, 2021

Imagine asking Amazon Alexa or Google Home to run your ETL, data processing, and automate your data pipelines. For example, "Start my data pipeline on Amazon EMR", “How many active jobs do I have running on Databricks?", or "Stop my…

13 Data Engineering Best Practices At DNB

By November 17, 2020

DNB is Norway's largest financial services group, and has a reputation as a trusted financial institution throughout the region. In this guest post, the DNB Data Engineering Centre of Practice team--Saleem Pothiwala, Operations Lead - Customer Insights, Jones Mabea Agwata,…

Demystifying Kerberos Authentication on Hadoop Clusters

By September 29, 2020

Guest post by Rishi Jain, Technical Support Engineer III, StreamSets. In this blog post, you'll learn the recommended way of enabling and using kerberos authentication when running StreamSets Transformer, a modern data transformation engine, on Hadoop clusters. Generally speaking, the --proxy-user…

Back To Top