Sample Data Pipelines

Jumpstart your pipeline design with intent-driven data pipelines and sample data

Choose a Design Pattern for Your Data Pipeline

StreamSets has created a library of free data pipelines for the most common ingestion and transformation design patterns. Start building data pipelines using samples and try now for free.

Dev data origin with sample data for testing
Drift synchronization for Apache Hive and Apache Impala
MySQL and Oracle to cloud change data capture pipelines
MySQL schema replication to cloud data platforms
Machine learning data pipelines using PySpark or Scala
Slowly changing dimensions data pipelines

The StreamSets Data Integration Platform

Build smart data pipelines in minutes and deploy across hybrid and multi-cloud platforms from a single log in.

Schedule Demo

Data Engineering for DataOps on Google Cloud

Data Engineering for DataOps on Snowflake

Why Use Sample Data Pipelines?

With pre-built data pipelines, you don’t have to spend a lot of time building a pipeline to find out how it works. StreamSets has created a rich data pipeline library available inside the StreamSets Platform. Simply log in, choose your design pattern, then open the sample pipeline. Add your own data or use sample data, preview, and run.

StreamSets smart data pipelines use intent-driven design. That means the “how” of implementation details is abstracted away from the “what” of the data, and it becomes easy to convert sample data pipelines into essential data pipelines. Instead of rewriting the same pipeline over and over, let StreamSets do the work. You’ve got more important problems to solve.

Getting Started with StreamSets

How-to Videos

Set Up and Run

Documentation

Go to Docs

Community

Share Your Success