Announcing Data Collector ver

Announcing Data Collector ver

With this release we have a number of exciting new features and integrations. And as usual, we've addressed a number of bug fixes.



  • Late directory support for File Tail and Directory. You can configure the origins to read from directories and files that show up after you start the pipeline.
  • With external JMX tools, you can view additional metrics for the File Tail origin that let you know how many files are pending in the directory, and how much of the active file remains to be read.
  • The Field Hasher processor now allows hashing in place, hashing to a target field or header, and hashing the entire record.
  • A couple new processors that support Encoding and Decoding Base64 data.
  • The HBase destination now supports implicit field mappings.
  • The Kinesis Consumer origin now supports AWS proxy settings.
  • The JMS Consumer origin provides configurable custom JNDI properties.
  • Users with the Admin role can restart Data Collector from the console.
  • Configurable timeout for inactive user sessions.
  • REST API support for cross-origin resource sharing (CORS).

Download the Data Collector to get started now.

Related Resources

Check out StreamSets white papers, videos, webinars, report and more.

Visit the Resource Library

Related Blog Posts

Receive Updates

Receive Updates

Join our mailing list to receive the latest news from StreamSets.

You have Successfully Subscribed!