skip to Main Content

The DataOps Blog

Where Change Is Welcome

Vendors Merge. DataOps Wins.

By October 9, 2018

The big data world was shocked last week when Apache Hadoop™ vendors Cloudera and Hortonworks announced they would be merging. Anyone familiar with this space knows that these two vendors have hardly been friends, so suffice to say this announcement…

StreamSets Wins Cloudera Partner of the Year Award… Again!

By October 1, 2018

Partners are an important part of driving our business here at StreamSets. The StreamSets DataOps Platform is adopted by leading companies across industries, and in many cases is used directly in conjunction with Cloudera Enterprise. Jointly, customers use StreamSets and Cloudera to build new and innovative solutions that help unlock customer insight, reduce fraud and risk, and help reduce operational costs. One such customer, Voya Financial, recently won a Data Impact Award for their solution to better protect its customers data while identifying fraud incidents and proactively addressing them. Joint customers Vodafone (Business Impact) and RingCentral (Protect Your Business) were also nominated as finalists.

Streamline Hybrid Cloud Data Integration with DataOps

By September 11, 2018

As cloud adoption grows, so does the complexity of the data architectures that serve as the backbone for modern enterprise applications. This complexity, if not planned for, can cripple any cloud initiative. According to research firm Gartner (subscription required), by 2021, at least 75% of large and global organizations will implement a multi-cloud capable hybrid integration platform, up from less than 25% in 2018. Taking a DataOps approach to data infrastructure can help streamline how data is moved around the business and ensure integration initiatives support the cloud-oriented goals of any organization.

DataOps Starts to Get Attention (thanks, Gartner!)

By August 7, 2018

For a while now, StreamSets has been touting the merits of DataOps as a new method for disciplined management of data topologies. In the same way DevOps changed how applications are developed, deployed and operated, DataOps will greatly improve data integration in the modern era.  You can read our point of view here.

The fact is, we were a DataOps company before the term was coined in late 2015, as our founders recognized the serious operational challenges that unstructured, streaming data and hybrid cloud infrastructures would pose to enterprises used to static, batch structured data integration.  Since our inception, we’ve been focused on enabling teams to operationalize data movement and we created the StreamSets DataOps platform to empower customers to capitalize on a  DataOps approach.

Introducing StreamSets Data Protector

By March 6, 2018

Detect, Secure and Govern Sensitive Data Upon Ingestion

StreamSets is excited to announce a new product for protecting data in motion. StreamSets Data Protector, as the name would imply, extends the value of the StreamSets DataOps Platform to enable users to detect, secure and govern sensitive data as it flows around your business. Leveraging StreamSets’ unique “Dataflow Sensors”, Data Protector can spot and act on personally identifiable information (PII) at the point of ingest, further strengthening your ability to meet new and changing regulatory requirements such as the General Data Protection Regulation (GDPR) and Health Insurance Portability and Accountability Act (HIPAA).

Introducing StreamSets Control Hub

By December 13, 2017

Control. We always want it, regularly don’t get it, yet in business it’s a must have to ensure things run as expected. Control is particularly critical when it comes to moving data around your company. Without it, it’s difficult to know where data is coming from, where it’s going and how it’s been manipulated (and by whom!) along the way. At StreamSets, we specialize in helping customers effectively move data around their business, from any source to any destination. Over time, we’ve observed that most organizations lack proper controls to adequately ensure that dataflow pipelines are built, executed and operated in a manner that meets the needs of the application or business process which they support.

How to Convert Apache Sqoop™ Commands Into StreamSets Data Collector Pipelines

By October 26, 2017

Sqoop ImportWhen it comes to loading data into Apache Hadoop™, the de facto choice for bulk loads of data from leading relational databases is Apache Sqoop™. After initially entering Apache Incubator status in 2011, it quickly saw wide spread adoption and development, eventually graduating to a Top-Level Project (TLP) in 2012.

In StreamSets Data Collector (SDC) 2.7 we added additional capabilities that enable SDC to behave in a manner almost identical to Sqoop. Now customers can use SDC as a way to modernize Sqoop-like workloads, performing the same load functions while getting the ease of use and flexibility benefits that SDC delivers.

Straight from Our Customers: The Benefits of Modern Ingestion

By October 20, 2017

Three months into my journey here at StreamSets and I’ve had a chance to talk with many of our customers and prospects to understand how they are using the open source StreamSets Data Collector (SDC) across a number of different use cases. As it turns out, behind solving technical problems in areas such as cybersecurity, IoT or plain old data lake ingestion lies a treasure trove of value that IT teams realize as part of a typical deployment.
While this is not an exhaustive list, let’s take a quick look at some of the more common benefits our customers call out.

Back To Top