StreamSets Data Integration Blog
Where change is welcome.
AWS Reference Architecture Guide for StreamSets
Using StreamSets DataOps Platform To Integrate Data from PostgreSQL to AWS S3 and Redshift: A Reference Architecture This document describes…
Cloud Data Migration – Knowing When, Why, and How To Move Your Data
Cloud data migration is on the rise, with cloud adoption expected to nearly double in the next five years. It's no surprise that cloud data migration is increasing, as many business benefits exist. In this piece, we’ll explore why organizations are moving to the cloud, different migration strategies, the steps involved, how long cloud migration takes, and the everyday challenges…
Reverse ETL to Marketo: A Real-Life Example
Standing for Extract, Transform, and Load, the acronym ETL describes the process of extracting data from a target, transforming it, and sending it on to load into a destination. The barest definition of ETL doesn’t include details about the nature of the source or destination. For this reason, I think there is an excellent case to be made that reverse…
How To Quickly Support Diverse LOBs With Scarce Data Engineering Resources
In a highly competitive environment, making smarter decisions faster dramatically impacts both the top and bottom lines. According to Forrester, advanced insights-driven businesses (IDBs) — firms that use data, analytics, and software in closed, continuously optimized loops to differentiate and compete — are eight times more likely than beginners to say they grew by 20% or more. That kind of…
The Basics of Data Pipeline Architecture for Machine Learning
Machine learning has become an integral part of organizations looking to do everything from improve customer experience to make product recommendations to target advertisements. The machine learning (ML) pipeline defines the steps that help create the models used for ML predictions. Each step involved in an ML pipeline is distinct and can be broken into modules to increase reusability for…
8 Data Governance Principles To Live By
Data governance is essential for all businesses, but especially for enterprise companies with their petabytes of data. Properly governing your data can ensure it is accurate, consistent, and secure. This helps to protect your company from data breaches and other security threats. This blog post will discuss eight data governance principles that you should live by. Data Governance Principles Let’s…