skip to Main Content

StreamSets Data Integration Blog

Where change is welcome.

What’s the Biggest Lot in the City of San Francisco?

StreamSets News

After building my first pipeline with StreamSets Data Collector, I wanted to give the framework more of a workout. I've spent a lot of time working with JSON data over the past few years, and the biggest, baddest JSON data set I can easily get hold of is a 181MB file containing the address and coordinates of all 206,560 city lots in San Francisco. Not…

By March 16, 2016

Getting Started with StreamSets Data Collector

Engineering, StreamSets News

Hi, I'm Pat Patterson, newly minted 'community champion' here at StreamSets. As I get up to speed with big data in general and StreamSets Data Collector (SDC) in particular, I'll write up my exploits here on the StreamSets blog to help other novices as they get started with open source big data ingest. I'm going to assume you know the…

By March 14, 2016

Announcing StreamSets Data Collector ver 1.2.2.0

StreamSets News

We’re happy to announce a new version of the StreamSets Data Collector.

By March 11, 2016

Building a Real-Time Retail Analytics Solution with StreamSets, MapR Streams and MapR FS

StreamSets News

Today’s complex retail applications have changed dramatically and in order to compete, enterprises must adopt new strategies for working with data. Big data and Hadoop enable retailers to connect with customers through multiple channels at new levels by leveraging traditional and real-time data sources for stream processing and analytics. These data sources often have the characteristics of varying volumes, velocity…

By March 10, 2016
Back To Top