The DataOps Blog
Where Change Is Welcome
The Next Chapter for StreamSets
Arvind Prabhakar and I co-founded StreamSets in 2014 with an audacious vision: data should be the lifeblood of the enterprise.…
Data Warehouse vs. Database
People and machines create an unfathomable amount of data every day. It enables businesses to operate efficiently and understand where their business is moving. But without proper management and solid architectural design practices, valuable data soon becomes lost or unusable. Databases and data warehouses are critical to a data management architecture that ensures this doesn’t happen. The Difference Between Databases…
When, Why, and How to Use Change Data Capture (CDC)
The digital world moves fast. To keep up with customers (and competitors), organizations need to make split-second decisions and take action in real-time. Because as Nucleus Research pointed out, data has a half-life when it comes to tactical, operational, and strategic decisions. But using change data capture to support real-time analytics can change the game. Instead of providing weekly or…
Data Mart vs. Data Warehouse
What is a Data Warehouse? Data warehouses are centralized repositories used to store data for an entire organization. Data warehouses contain data from many disparate data sources and can often be quite large. Data warehouses are different from other data repositories in that they are relational, meaning they store data in tables with rows and columns. What is a Data…
Streaming Kafka to Snowflake: A Strategic & Technical Walkthrough
Messaging Oriented Middleware (MOM) applications have been around for a very long time and Kafka is just one of the latest and, as of late, one of the most popular. With Snowflake also having mass popularity, it’s no surprise streaming Kafka to Snowflake is a hot topic. It used to be that messaging applications like Kafka would mostly be used…
PostgreSQL vs MySQL: A Head to Head Comparison
What is PostgreSQL? PostgreSQL is a relational database that stores data in tables, rows, and columns with pre-defined relationships. This is as opposed to NoSQL or document storage solutions that lack these features and give up advanced analytical capabilities in favor of ease of use. It is also open-source. What does this mean? There is no fee, even for commercial…