Skip to content

Customer Story

GSK: How StreamSets Increased Compliance Efficiency 10X

“In the pharma industry, it is very important for us to trace our data. Keeping track through StreamSets logs was a huge benefit for us because moving, transforming, and tracing the data brought high efficiency with compliance and scaling data engineering”
Arun Reddipalli, Sr Director R&D Data Platform, GSK

Bringing R&D data into one single system efficiently and quickly was a top priority for GSK to scale as new data systems come on board. With StreamSets, GSK automated data pipelines, identified patterns across multiple data systems and reused prior patterns as new systems onboarded – increasing operational efficiency from 3-6 months to 1 week.

999 +
Scientists consuming data
999 Pb
of stored data
999 +
data sources

Driving Analytics with DataOps Center of Excellence

GlaxoSmithKline (GSK) is a science-led global healthcare company with a special purpose: to help people do more, feel better, and live longer. 

Pharmaceutical companies spend years discovering, developing, and testing new drugs before bringing them to market. GSK set out to build a Data Center of Excellence to accelerate delivery of clean data from 1,000s of data sources to more than 10,000+ scientists involved in R&D around the world. And to accelerate time-to-market for life changing healthcare solutions.

Using StreamSets, the team has automated data pipeline creation and data drift handling with the flexibility to push technology boundaries without interrupting the critical flow of self-service data for scientists.

Learn How
StreamSets Customer Story - GSK
Self-service Data Advances Drug Discovery
“GSK has more than 10,000 scientists who need access to millions of diverse data elements, from genome sequences to experiment, clinical trial, and even insurance claim data. With StreamSets, we were able to deploy a million pipelines for thousands of data sources.”
Mark Ramsey, former Chief Data & Analytics Officer, GSK

Lifting the Lid on the Hidden Data Integration Problem

Under-resourced technical teams struggle to keep up with business requests for data without ceding control, while business teams must have their data on demand to stay competitive. See solutions that reduce frustrations.
Whitepapers & Ebooks

Data Engineers Handbook for Snowflake

Whitepapers & Ebooks

Five Challenges Limiting the Impact of Transformative Analytics

Enterprises now have the power to uncover deep insights from their data that could completely change how they do business.

Ready to Get Started?

We’re here to help you start building pipelines or see the platform in action.

Back To Top