data-architecture-a

(coco) #1

Chapter 1.3


The “Great Divide”


Abstract


Corporate data include everything found in the corporation in the way of data. The most
basic division of corporate data is by structured data and unstructured data. As a rule,
there are much more unstructured data than structured data. Unstructured data have two
basic divisions—repetitive data and nonrepetitive data. Big data is made up of
unstructured data. Nonrepetitive big data has a fundamentally different form than
repetitive unstructured big data. In fact, the differences between nonrepetitive big data
and repetitive big data are so large that they can be called the boundaries of the “great
divide.” The divide is so large that many professionals are not even aware that there is
this divide. As a rule, nonrepetitive big data has MUCH greater business value than
repetitive big data.


Keywords


Structured data; Unstructured data; Corporate data; Repetitive data; Nonrepetitive data;
Business value; The great divide of data; Big data


Classifying Corporate Data


Corporate data can be classified in many different ways. One of the major classifications
is by structured versus unstructured data. And unstructured data can be further broken
into two categories—repetitive unstructured data and nonrepetitive unstructured data.
This division of data is shown in Fig. 1.3.1.


Chapter 1.3: The “Great Divide”
Free download pdf