Get Started With StreamSets

OPEN SOURCE DATA PIPELINES

StreamSets
Data Collector

Use the open source StreamSets Data Collector to build, test, run and maintain batch and streaming pipelines. Data Collector pipelines require no schema specification and handle data drift automatically.

  • Visual UI with pre-built sources, destinations and transformations.
  • Runs standalone or scales via YARN, Mesos or Kubernetes
  • Build, preview, run and monitor real-time performance.
  • Lightweight Edge version runs in <10MB

ENTERPRISE DATA INTEGRATION

StreamSets
DataOps Platform

The StreamSets DataOps Platform provides a centralized cloud-native environment for the design, deployment, monitoring and maintenance of architectures comprising dozens or hundreds of pipelines.

  • Collaborative design of multi-pipeline topologies.
  • Live data movement map with real-time monitoring and drill down.
  • High-availability, access control and enterprise-grade security.
  • Detect and protect sensitive data and PII in-stream.

ENTERPRISE PLATFORM INTEGRATION

StreamSets
Enterprise Connectors

StreamSets Enterprise Connectors can be used to add extra connectivity and features for enterprise data platforms like Snowflake, Teradata, and MemSQL. StreamSets Enterprise Connectors require agreement to enterprise terms and conditions concerning use of these connectors in production deployments.

  • Load change data capture (CDC) data for Snowflake or use Snowpipe to optimize loads
  • Move data quickly with Teradata Fast Export
  • Insert data quickly with the MemSQL Fast Loader

MODERN ETL

StreamSets
Transformer

Use StreamSets Transformer to create data processing pipelines that execute on Spark. Using a simple to use drag and drop UI users can create pipelines for performing ETL, stream processing and machine learning operations.

  • Visibility into Apache Spark application execution
  • Runs in both batch and streaming modes
  • Progressive error handling without the need for Apache Spark skills to decipher complex log files
  • Works anywhere; in the cloud, Kubernetes or on premises 
  • Sets-based processing — For ETL, machine learning and complex event processing
Receive Updates

Receive Updates

Join our mailing list to receive the latest news from StreamSets.

You have Successfully Subscribed!