Download Award-Winning Open Source StreamSets Data Collector

Current Release : 2.5.1.1

Release Date : 5/12/2017 | Release Notes | Archives

Download Core (TGZ)
This tarball contains the core SDC software with a minimum set of connectors. You can download additional stages manually using the steps shown on the right hand side

Extract Tarball:

$ tar xvzf streamsets-datacollector-core-2.5.1.1.tgz

Run Data Collector:

$ streamsets-datacollector-2.5.1.1/bin/streamsets dc

Browse to http://<system-ip>:18630/

Install packages using UI

Use the package manager in the StreamSets UI to install stages.

– or –

Install packages using CLI
# extract the tar file
$ tar xvzf streamsets-datacollector-core-2.5.1.1.tgz
# list all downloadable stage libraries
$ ./bin/streamsets stagelibs -list
# install stage libraries as required
$ ./bin/streamsets stagelibs -install=<stageid1>,<stageid2>


Want the full packages?

Download Tarball, RPM or Cloudera Manager Parcel.

Extract Tarball:

$ tar xvzf streamsets-datacollector-all-2.5.1.1.tgz

Run Data Collector:

$ streamsets-datacollector-2.5.1.1/bin/streamsets dc

Browse to http://<system-ip>:18630/

Install RPM:

$ tar -xzf streamsets-datacollector-2.5.1.1-all-rpms.tgz

$ yum localinstall streamsets*.rpm

Run Data Collector:

$ service sdc start

Browse to http://<system-ip>:18630/

Copy CSD to:

$ mv STREAMSETS-2.5.1.1.jar /opt/cloudera/csd/

Change file owner + permissions:

$ sudo chown cloudera-scm:cloudera-scm STREAMSETS-2.5.1.1.jar && sudo chmod 644 STREAMSETS-2.5.1.1.jar

Restart Cloudera Manager:

$ sudo /etc/init.d/cloudera-scm-server restart


Using Docker?

We are on DockerHub.

$ docker run --restart on-failure -p 18630:18630 -d --name streamsets-dc streamsets/datacollector

Building from the Source?

Data Collector is developed using the Apache v2 License. Join our community of contributors.

We are on GitHub – check out the build instructions.


Getting Started with StreamSets

https://vimeo.com/139165571

Need a deeper dive? Read the docs or check out our tutorials.

CDN Services provided by Fastly.

Shona DavidsonDownload StreamSets Open Source