skip to Main Content

The DataOps Blog

Where Change Is Welcome

Demystifying Kerberos Authentication on Hadoop Clusters

By September 29, 2020

Guest post by Rishi Jain, Technical Support Engineer III, StreamSets. In this blog post, you'll learn the recommended way of enabling and using kerberos authentication when running StreamSets Transformer, a modern transformation engine, on Hadoop clusters. Generally speaking, the --proxy-user argument…

What are Grok Patterns?

By August 10, 2020

Grok leverages regular expression language that allows you to name existing patterns and/or combine them into more complex patterns. Because Grok is based on regular expressions, any valid regular expressions (regexp) are also valid in grok. In StreamSets Data Collector,…

Using Kubernetes Secrets in Data Collector Pipelines

By May 19, 2020

In addition to StreamSets Data Collector's support for a variety of Credential Stores, one can also use Kubernetes Secrets as a mechanism for securely managing credentials and environment-specific properties within Data Collector pipelines. This post shows an example of using…

Back To Top