Download Open Source StreamSets Data Collector™

The award-winning open source software for development of data pipelines.

Current Release : 3.3.0

Release Date : 5/24/2018 | Release Notes | Archives

Download Core (TGZ)
This tarball contains the core SDC software with a minimum set of connectors. You can download additional stages manually using the steps shown on the right hand side

maclinux

Extract Tarball:

$ tar xvzf streamsets-datacollector-core-3.3.0.tgz

Run Data Collector:

$ streamsets-datacollector-3.3.0/bin/streamsets dc

Browse to http://<system-ip>:18630/

The default username and password are “admin” and “admin”.

Install packages using UI

Use the package manager in the StreamSets UI to install stages.

– or –

Install packages using CLI
# extract the tar file
$ tar xvzf streamsets-datacollector-core-3.3.0.tgz
# list all downloadable stage libraries
$ ./bin/streamsets stagelibs -list
# install stage libraries as required
$ ./bin/streamsets stagelibs -install=<stageid1>,<stageid2>
linux

Copy CSD to:

$ mv STREAMSETS-3.3.0.jar /opt/cloudera/csd/

Change file owner + permissions:

$ sudo chown cloudera-scm:cloudera-scm STREAMSETS-3.3.0.jar && sudo chmod 644 STREAMSETS-3.3.0.jar

Restart Cloudera Manager:

$ sudo /etc/init.d/cloudera-scm-server restart

mac
linux

Extract Tarball:

$ tar xvzf streamsets-datacollector-all-3.3.0.tgz

Run Data Collector:

$ streamsets-datacollector-3.3.0/bin/streamsets dc

Browse to http://<system-ip>:18630/

The default username and password are “admin” and “admin”.

linux

Install RPM:

Use the appropriate operating system (el6 or el7) in the following command:

$ tar xf streamsets-datacollector-3.3.0-[operating system]-all-rpms.tar

$ yum localinstall streamsets*.rpm

Run Data Collector on an EL6 operating system:

$ service sdc start

Run Data Collector on an EL7 operating system:

$ systemctl start sdc

Browse to http://<system-ip>:18630/

The default username and password are “admin” and “admin”.

Using Docker?

We are on DockerHub.

StreamSets Data Collector

$ docker run --restart on-failure -p 18630:18630 -d --name streamsets-dc streamsets/datacollector
The default username and password are "admin" and "admin".

StreamSets Data Collector Edge

$ docker run --publish 18633:18633 --name edge --rm streamsets/datacollector-edge

Building from the Source?

StreamSets Data Collector (SDC) and StreamSets Data Collector Edge (SDC Edge) is developed using the Apache v2 License. Join our community of contributors.

We are on GitHub – check out the build instructions for SDC and SDC Edge

Core Tarball
Download Core (TGZ)
This tarball contains the core SDC software with a minimum set of connectors. You can download additional stages manually using the steps shown on the right hand side

maclinux

Extract Tarball:

$ tar xvzf streamsets-datacollector-core-3.3.0.tgz

Run Data Collector:

$ streamsets-datacollector-3.3.0/bin/streamsets dc

Browse to http://<system-ip>:18630/

The default username and password are “admin” and “admin”.

Install packages using UI

Use the package manager in the StreamSets UI to install stages.

– or –

Install packages using CLI
# extract the tar file
$ tar xvzf streamsets-datacollector-core-3.3.0.tgz
# list all downloadable stage libraries
$ ./bin/streamsets stagelibs -list
# install stage libraries as required
$ ./bin/streamsets stagelibs -install=<stageid1>,<stageid2>
Cloudera Parcel
linux

Copy CSD to:

$ mv STREAMSETS-3.3.0.jar /opt/cloudera/csd/

Change file owner + permissions:

$ sudo chown cloudera-scm:cloudera-scm STREAMSETS-3.3.0.jar && sudo chmod 644 STREAMSETS-3.3.0.jar

Restart Cloudera Manager:

$ sudo /etc/init.d/cloudera-scm-server restart

Full Tarball
mac
linux

Extract Tarball:

$ tar xvzf streamsets-datacollector-all-3.3.0.tgz

Run Data Collector:

$ streamsets-datacollector-3.3.0/bin/streamsets dc

Browse to http://<system-ip>:18630/

The default username and password are “admin” and “admin”.

Full RPM
linux

Install RPM:

Use the appropriate operating system (el6 or el7) in the following command:

$ tar xf streamsets-datacollector-3.3.0-[operating system]-all-rpms.tar

$ yum localinstall streamsets*.rpm

Run Data Collector on an EL6 operating system:

$ service sdc start

Run Data Collector on an EL7 operating system:

$ systemctl start sdc

Browse to http://<system-ip>:18630/

The default username and password are “admin” and “admin”.

Docker Image

Using Docker?

We are on DockerHub.

StreamSets Data Collector

$ docker run --restart on-failure -p 18630:18630 -d --name streamsets-dc streamsets/datacollector
The default username and password are "admin" and "admin".

StreamSets Data Collector Edge

$ docker run --publish 18633:18633 --name edge --rm streamsets/datacollector-edge
Source Code

Building from the Source?

StreamSets Data Collector (SDC) and StreamSets Data Collector Edge (SDC Edge) is developed using the Apache v2 License. Join our community of contributors.

We are on GitHub – check out the build instructions for SDC and SDC Edge

StreamSets Open Source Data Collector Edge

Download Data Collector Edge for the following platforms:

CDN Services provided by Fastly.

fastly-logo
Receive Updates

Receive Updates

Join our mailing list to receive the latest news from StreamSets.

You have Successfully Subscribed!

Pin It on Pinterest