StreamSets Data Integration Blog
Where change is welcome.
Where change is welcome.
One of the first steps of the data lifecycle is data extraction, which involves collecting data of various types from multiple sources for use in data processes. Because data sources are numerous and bring in data of various types and…
As a data engineer, you likely have an opinion on coding. The ongoing debate between pro-code and no-code approaches has touched most of us. But the reality is that this is not an all-or-nothing choice. In fact, there is great…
Most data collected from sources like database systems, websites, applications, and internal systems undergo either online transaction processing (OLTP) or online analytical processing (OLAP). These processing systems carry out different functions, with OLAP performing complex queries on aggregated, historical data,…
Good news data warriors! After a complete redesign that makes the StreamSets platform even easier to use, we’re thrilled to announce you can now try it free for a full month. While clients have always enjoyed StreamSets’ ease of use…
At last week's CDOIQ Symposium in Cambridge, MA, Tom Redman of Data Quality Solutions and Tom Davenport, who has a ubiquitous presence in the data field, jointly presented a session called “The Rise of Tweener Roles in Data Science.” The…
Calling all data professionals! Are you looking to streamline and automate reporting? StreamSets Academy has the perfect solution for you: the "Agile Reporting with StreamSets" self-paced course. Agile reporting refers to automated analytics creation that fuels reporting for internal…
In a 2013 Ted Talk, Dr. Kirk Borne, a data scientist and astrophysicist, spoke about the presence of known knowns, known unknowns, and unknown unknowns. Dr. Borne described them like this: Known knowns: Things you know about already. Known unknowns:…