data-architecture-a

(coco) #1

the repetitive records have the same identifying information in exactly the same
structure.


Fig. 4.2.5 Repetitive records have the same context.

From the standpoint of repetitiveness and predictability, big data indeed has very
structured data inside it.


So in answer to the question does big data have structure?—if you look at the question
from the standpoint of structure meaning a structured DBMS infrastructure, then big data
does not contain structured data. But if you look at big data from the standpoint of
containing repetitive data with predictable context, then big data can be said to be
structured.


The answer to the question then is neither yes nor no. The answer to the question
depends entirely on the definition of what is meant by structured and unstructured.


Nonrepetitive Data


Even if big data can contain structured data, big data can also contain what is called
“nonrepetitive” data as well. Nonrepetitive records of data are records where the
structure and content of the records are entirely independent of each other. Where there
is nonrepetitive data, it is entirely an accident if any two records resemble each other,
either in content or structure.


There are many examples of nonrepetitive data. Some examples of nonrepetitive data
include the following:


E-mails
Call center information
Health-care records
Insurance claim information
Warranty claim information

Nonrepetitive information contains indicative information. But the indicative information


Chapter 4.2: What Is Big Data?
Free download pdf