skip to Main Content

StreamSets Data Integration Blog

Where change is welcome.

The Next Chapter for StreamSets

By April 19, 2022

Arvind Prabhakar and I co-founded StreamSets in 2014 with an audacious vision: data should be the lifeblood of the enterprise. Not just gathered in warehouses and lakes, but to drive the next advances in digital transformation with operationalized data analytics.…

Series C Funding
Right Where We Knew We’d Be

By September 11, 2018


Today, we announced that StreamSets raised $35 million in a Series C funding round, led by new investor Harmony Partners. I met Mark Lotke, Harmony’s Managing Partner, over 18 months ago, and we immediately hit it off because it was clear that he really got both Data and Operations, exemplified by his investments in AppDynamics, Alation and InfluxDB. Our other new investor is Paul Drews of Tenaya Capital. Paul and StreamSets go back a long way; he was a Board Observer in his past life at Battery Ventures and must have liked what he saw. I’m also delighted that our existing investors, Dharmesh Thakker from Battery Ventures and Pete Sonsini at NEA, also participated to their fullest, validating our “say what we’ll do, do what we said” doctrine.

Data in Motion Evolution: Where We’ve Been…Where We Need to Go

By January 17, 2017

data-in-motionToday we hear a lot about streaming data, fast data, and data in motion. But the truth is that we have always needed ways to move our data. Historically, the industry has been pretty inventive about getting this done. From the early days of data warehousing and extract, transform, and load (ETL) to now, we have continued to adapt and create new data movement methods, even as the characteristics of the data and data processing architectures have dramatically changed.

Exerting firm control over data in motion is a critical competency which has become core to modern data integration and operations. Based on more than 20 years in enterprise data, here is my take on the past, present and future of data in motion.

Introducing StreamSets DPM – Operational Control of Your Data in Motion

By September 12, 2016

Friends of StreamSets,

Today I am delighted to announce our new product, StreamSets Dataflow Performance Manager, or DPM, the industry’s first solution for managing operations of a company’s end-to-end dataflows within a single pane of glass. The result of a year’s worth of innovative engineering and collaboration with key customers, DPM will be generally available on or before September 27, in time for Strata. We invite you to come by our booth (#451) for a live demonstration.

DPM is a natural follow-on to our first product, StreamSets Data Collector, which is open source software for building and deploying any-to-any dataflow pipelines. That product has enjoyed a great deal of success in its first year in market, with an accelerating number of weekly downloads, which now total in the tens of thousands across hundreds of enterprises, and numerous production use cases in Fortune 500 companies across a variety of industries.

State of the Art Data Ingestion

By September 29, 2015

Forward-looking, data-driven enterprises increasingly leverage Big Data platforms, such as Hadoop, Elasticsearch and Amazon Web Services, to derive insights from non-­transactional, machine­-generated data. Many tools have emerged to power next ­generation data pipelines and provide specialized analytic capabilities. To get value…

Start with Why: Data Drift

By September 24, 2015

Today, after a year of working in stealth mode with a number of enterprise charter customers, we are excited to launch StreamSets. Arvind and I started StreamSets in June 2014 because, as they say in French, “plus ça change, plus…

Back To Top