skip to Main Content

Customer Story

GSK: How Self-service Data Advances Drug Discovery

“GSK has more than 10,000 scientists who need access to millions of diverse data elements, from genome sequences to experiment, clinical trial, and even insurance claim data. With StreamSets, we were able to deploy a million pipelines for thousands of data sources.”
Mark Ramsey, former Chief Data & Analytics Officer, GSKWatch the Video
999 +
Scientists consuming data
999 Pb
of stored data
999 +
data sources

Driving Analytics with DataOps Center of Excellence

GlaxoSmithKline (GSK) is a science-led global healthcare company with a special purpose: to help people do more, feel better, and live longer. 

Pharmaceutical companies spend years discovering, developing, and testing new drugs before bringing them to market. GSK set out to build a Data Center of Excellence to accelerate delivery of clean data from 1,000s of data sources to more than 10,000+ scientists involved in R&D around the world. And to accelerate time-to-market for life changing healthcare solutions.

Using StreamSets, the team has automated data pipeline creation and data drift handling with the flexibility to push technology boundaries without interrupting the critical flow of self-service data for scientists.

Learn How
StreamSets Customer Story - GSK

Ready to Get Started?

We’re here to help you start building pipelines or see the platform in action.

Back To Top