skip to Main Content

The DataOps Blog

Where Change Is Welcome

Bulk Ingest of Salesforce Data Into Databricks Delta Lake

Engineering, StreamSets Partners

StreamSets is proud to announce an expansion of its partnership with Databricks by participating in Databricks’ newly launched Data Ingestion Network. A key component of this integration is leveraging the recently released Databricks COPY command for bulk ingest into Databricks Delta Lake using StreamSets for Databricks. Let’s consider a simple example of ingesting accounts information from Salesforce and storing it…

By June 27, 2020

Oracle 19c Bulk Ingest And Change Data Capture Into Databricks Delta Lake

Engineering

In this post, we will explore how to bulk ingest and process change data capture (CDC) information from Oracle 19c database using the enhanced Oracle CDC Client origin into Databricks Delta Lake in StreamSets Data Collector, a fast data ingestion engine. You’ll also learn one way to automate and orchestrate the two jobs using StreamSets Control Hub REST APIs. Introduction…

By May 28, 2020

Announcing StreamSets Data Collector 3.16.0

Engineering, StreamSets News

StreamSets is excited to announce the immediate availability of StreamSets Data Collector 3.16.0. StreamSets Data Collector is a powerful design and execution engine. It enables you to build and deploy intent driven and data drift resilient smart data pipelines. Highlights There are some great new features and enhancements included in this release—below I’ve reviewed some of the highlights. For a…

By May 22, 2020
Back To Top