GlaxoSmithKline (GSK) is a science-led global healthcare company with a special purpose: to help people do more, feel better, and live longer.
Pharmaceutical companies spend years discovering, developing, and testing new drugs before bringing them to market. GSK set out to build a Data Center of Excellence to accelerate delivery of clean data from 1,000s of data sources to more than 10,000+ scientists involved in R&D around the world. And to accelerate time-to-market for life changing healthcare solutions.
Using StreamSets, the team has automated data pipeline creation and data drift handling with the flexibility to push technology boundaries without interrupting the critical flow of self-service data for scientists.