
Data Pipeline Development just got easier ...
EASILY AUDIT YOUR DATA AND ACCESS IT FROM ANY POINT IN TIME.
In a modern data environment, the data runs through various stops. To still provide continuous data quality, it must always be clear where data has come from. Data Vault has made it easier to do so by inherently enabling auditing, as load times and record sources are required for every row. It also tracks all historical changes as satellites, including the load time as part of the primary key. When an attribute is updated, a new record is created.
All of this auditing enables you to easily provide auditability for both regulatory and data governance purposes. And because you store all of your history, you can access data from any point in time.
Data Vault enables quicker data loading simply because a number of tables can be loaded at the same time in parallel. The model decreases dependencies between tables during the load process and simplifies the ingestion process by leveraging inserts only, which load quicker than upserts or merges. This also leads to less complexity.
A perfect fit for multi-source systems or those that have constantly changing relationships.
The reason why it works well for these systems is its ability to make adding attributes simple. If there is a change to one source system, that change does not need to show up within all source systems. Similarly, you can limit the number of places changes are made, as attributes are stored separately from structural data in satellites.