data-architecture-a

(coco) #1

the data are managed.


This difference—the “great divide”—is shown in Fig. 1.3.2.


Fig. 1.3.2 Different types of unstructured data.

It is seen then that there is a very different focus between the two types of unstructured
data.


Repetitive Unstructured Data


The repetitive unstructured data are said to be “Hadoop” centric. Being “Hadoop”
centric means that processing of repetitive unstructured data revolves around processing
and managing the Hadoop/big data environment. The centricity of the repetitive
unstructured data is seen in Fig. 1.3.3.


Chapter 1.3: The “Great Divide”
Free download pdf