StreamSets News

Replicating Oracle to MySQL and JSON

Yannick Jaquier is a Database Technical Architect at STMicroelectronics in Geneva, Switzerland. Recently, Yannick started experimenting with StreamSets Data Collector’s Oracle CDC Client origin, building pipelines to replicate data to a second Oracle database, a JSON file, and a MySQL database. Yannick documented his journey very thoroughly, and kindly gave us permission to repost his findings […]

Binary Classification of Streaming Data using TensorFlow to ADLS Gen1 and ADLS Gen2

Over the past decade, digital transformation has evolved such that every system and device has a digital trail: from IT servers, to factory equipment, to consumer electronics, to buildings, to cars. Increasing data volumes, rates, and variety have created increasing complexity, not to mention these new datasets must be analyzed in real-time. Fit-for-purpose data platforms […]

Sensor Data from Azure Event Hub to ADLS Gen2 and Azure SQL DWH

Data that’s in flight is perceived to be more challenging to work with than “landed” data sitting quietly in some storage platform. With the high volume, and variety of data constantly streaming in from IoT sensors, you need a holistic DataOps approach that goes beyond traditional, stationary transactional data. The StreamSets DataOps platform provides an […]

A Cost Comparison of a Cloudera Hadoop Cluster with StreamSets Ingestion Framework on Oracle Cloud Infrastructure

Introduction It should come as no surprise that a Hadoop cluster and the public cloud go together like peanut butter and jelly because of scale, agility, and economy. It should come as even less of a surprise that a software provider like Oracle is now providing an enterprise-grade cloud via its bare metal compute offering […]

Join Us at the First Annual DataOps Summit

  StreamSets is proud to be hosting the first annual DataOps Summit in San Francisco, California at the Hilton San Francisco Financial District on September 3rd-5th. The summit will feature a full day of data operations training and two days of comprehensive content featuring major brands, high-scale use cases, ecosystem partners, and community heroes. Keynote […]

Ingesting Data from Relational Databases to Cassandra with StreamSets

This post is summarized content from a full tutorial at https://academy.datastax.com/content/ingesting-data-relational-databases-cassandra-streamsets                 How do you ingest from an existing relational database (RDBMS) to an Apache Cassandra or DataStax Enterprise cluster? What about a one-time batch loading of historical data vs. streaming changes? I know what some of you are […]

Snowflake + StreamSets: DataOps Accelerating Cloud Adoption

StreamSets is proud to announce their new partnership with Snowflake and the general availability release of StreamSets for Snowflake. As enterprises move more of their big data workloads to the cloud, it becomes imperative that Data Operations are more resilient and adaptive to continue to serve the business’s needs. This is why StreamSets has partnered with […]

How to Bulk Load Amazon RedShift from Relational Databases with StreamSets

Overview You have options when bulk loading data into RedShift from relational database (RDBMS) sources.  These options include manual processes or using one of the numerous hosted as-a-service options. But, if you have broader requirements than simply importing, you need another option.  Your company may have requirements such as adhering to enterprise security policies which […]

StreamSets Helps You Build Your Data Warehouse in the Cloud

StreamSets is excited to announce the immediate availability of StreamSets for Snowflake, the first DataOps platform for Snowflake. Now users can extend their Dataops environments to the popular Snowflake service. StreamSets makes copying data from databases, streams, and event processing directly into your cloud EDW simple, without complex schema design and hand-coding. Users get high […]

Accelerate Your Journey To The Cloud Data Warehouse: StreamSets For Snowflake

Introduction Data warehouses are a critical component of modern data architecture in enterprises that leverage massive amounts of data to drive quality of their products and services. A data warehouse is an OLAP (Online Analytical Processing) database that collects data from transactional databases such as Billing, CRM, ERP, etc. and provides a layer on top […]

Receive Updates

Receive Updates

Join our mailing list to receive the latest news from StreamSets.

You have Successfully Subscribed!