Products
StreamSets Platform
Power your modern analytics and digital transformation with continuous data.

Data Collector
Transformer for Spark
Transformer for Snowflake
Mainframe Collector
Control Hub
Connectors
Super iPaaS
DEMO
Spend Less Time Fixing and More Time Doing

Talk to an expert and eliminate data integration friction.

Request a Demo
Solutions
StreamSets Solutions
Powerful data engineering solutions for modern data integration across multiple cloud platforms.

Agile Reporting
Cloud Data Lake Integration
Cloud Data Warehouse Integration
Mainframe Data Modernization
Power Real-time Applications
Talk With StreamSets
Contact Us

Learn more about how StreamSets can help your organization harness the power of data.

Get in Touch
Partners
StreamSets Partners
Use data in more ways with a modern approach to data integration.

Amazon Web Services
Databricks
Google Cloud Platform
Hewlett Packard Enterprise
Microsoft Azure
Snowflake
Resources
Resources
Best practices and technical how-tos for modern data integration.

Getting Started
The Data Integration Blog
Webinars
Learn Data Management
Case Studies
Events
Community
WHITE PAPER
Data Integration Advantage

Building a Foundation for Scalable AI. See the state of AI in the enterprise.

Download Now
About Us
About Us
Modernizing data integration for continuous data under constant change.

Careers
Leadership
News
Software AG
Start Free Trial
Search

StreamSets Data Integration Blog

Where change is welcome.

Replicating Relational Databases with StreamSets Data Collector

Cloud Data Migration

Data Transformation

Data Integration

By Pat Patterson February 3, 2017

StreamSets Data Collector Engine has long supported both reading and writing data from and to relational databases via Java Database Connectivity (JDBC). While it was straightforward to configure pipelines to read data from individual tables, ingesting records from an entire database was cumbersome, requiring a pipeline per table. StreamSets Data Collector Engine Now introduces the JDBC Multitable Consumer, a new pipeline origin that can read data from multiple tables through a single database connection. In this blog entry, I’ll explain how the JDBC Multitable Consumer can implement a typical use case – replicating relational databases (an entire one) into Hadoop.

Announcing Data Collector ver 2.3.0.0

By Kirit Basu, Head of Strategy February 3, 2017

We’re excited to release the next version of the StreamSets Data Collector. This release has 80+ new features and improvements, and 150+ bug fixes.