data-architecture-a

(coco) #1

What Is Data Vault 1.0?


Data Vault 1.0 (DV1) is highly focused on the data vault modeling components and
relational database technology. A DV1 data model attaches surrogate sequence keys as
its primary key selection for each of the entity types. Unfortunately, surrogate sequences
exhibit the following problems:



  • Introduce dependencies on the ETL/ELT loading paradigm

  • Contain an upper bound/upper limit, when reached can cause issues

  • Are meaningless numbers (mean absolutely nothing to the business)

  • Cause performance problems (due to dependencies) on load of big data sets

  • Reduce parallelism (again due to dependencies) of loading processes

  • Cannot be utilized as MPP partition keys for data placement, to do so would potentially cause hot
    spots in the MPP platform

  • Cannot be reliably rebuilt or reassigned (reattached to old values) during recovery loads

  • Are disparate across multiple source applications that are housing the same data sets


Chapter 6.1: Introduction to Data Vault 2.0
Free download pdf