data-architecture-a

(coco) #1

DV1 does not meet the needs of big data, unstructured data, semistructured data, or very
large relational data sets. DV1 is highly focused on just the data modeling section and
relational databases.


Are surrogate sequences a bad thing to utilize? No, if the data set is small (less than 100M
records per table) or if the platform is capable of scaling compute power beyond
traditional methods (reducing the cost of a lookup on load). Sequences do work very well
for high-performance queries, and most traditional relational engines utilize this to their
advantage when data are partitioned by range.


There are platforms where sequences are discouraged and in fact not even available. In
those platforms, alternative key structures are needed. The alternative key structure
proposed is in fact a hash key, which is discussed in detail, later in this chapter. A third
alternative is to utilize the natural business key directly from the source system. This too
has its pros and cons and will also be addressed later on in this chapter.


Chapter 6.1: Introduction to Data Vault 2.0
Free download pdf