data-architecture-a

(coco) #1

analyst.


Fig. 10.1.21 Sequencing the many functions of textual ETL.

Internal Referential Integrity


In order to keep track of the many different variables and the many different
relationships, textual ETL has an elaborate internal structure. In order for any given
iteration of textual ETL to execute properly, the internal relationships MUST be defined
properly. Stated differently, if the internal relationships inside textual ETL are not
properly defined, textual ETL will not execute properly, and the results obtained will not
be valid and accurate.


As an example of internal relationships inside textual ETL, there is a need to define a
document. Once a document is defined, the different indexes that can be created for the
document can be defined. Once the different indexes are defined, the delimiters that
define the index must be defined. This entire infrastructure must be in place before
textual ETL can operate accurately.


Chapter 10.1: Nonrepetitive Data
Free download pdf