data-architecture-a

(coco) #1
Fig. 1.3.3 Hadoop centric unstructured data.

The center of the Hadoop environment naturally enough is Hadoop. Hadoop is one of the
technologies by which data can be managed over very large amounts of data. Hadoop/big
data is at the center of what is known as “big data.” Hadoop is one of the primary storage
mechanism for big data. The essential characteristics of Hadoop are that Hadoop



  • is capable of managing very large volumes of data,

  • manages data on less expensive storage,

  • manages data by the “Roman census” method,

  • stores data in an unstructured manner.


Because of these operating characteristics of Hadoop, very large volumes of data can be
managed. Hadoop is capable of managing volumes of data significantly larger than
standard relational database management systems.


The big data technology of Hadoop is depicted in Fig. 1.3.4.


Chapter 1.3: The “Great Divide”
Free download pdf