data-architecture-a
Fig. 9.1.17 Most nonrepetitive data is non exceptional. The retailer is interested in such issues as the following: How often d ...
have been linked together tell an even larger picture. When records are linked together where there is a logical reason for the ...
Log Tape Records It is common in examining big data to encounter log tapes. Many organizations create log tapes only to wake up ...
Fig. 9.1.19 Log tape records are very irregular. Fig. 9.1.19 shows that on the log tape, many different kinds of records are fou ...
Fig. 9.1.20 Two different perspectives of the same thing. In Fig. 9.1.20, it is seen that logically, a log tape is merely a sequ ...
Outliers On occasion, there is a point of reference that does not seem to fit with all the other points. If this is the case, th ...
insight that would otherwise not be possible. One of the standard ways to look at data over time is through a Pareto chart. Fig. ...
This effect—of looking at data over limited moments of time—is illustrated by a simple example. Suppose there is an examination ...
The fact is that the dollar and inflation are well-understood phenomena. What is not so well understood is that there are other ...
Chapter 9.2 Analyzing Repetitive Data Abstract There are many facets to the analysis of repetitive data. One type of data where ...
Fig. 9.2.1 Repetitive data. Repetitive data can be thought of as being organized into blocks, records, and attributes. Fig. 9.2. ...
A block of data is a large allocation of space. The system knows how to find a block of data. The block of data is loaded with u ...
Fig. 9.2.3 Parsing is done to find out what is in a block. Log Data One of the most common forms of big data is log data. Indeed ...
Fig. 9.2.4 The difference between log tape data and repetitive data. In Fig. 9.2.4, repetitive data do not look like log data at ...
Even though a log tape record is made up of multiple records and must be parsed, the good news is that there typically are a fin ...
Fig. 9.2.6 Typical contents of a log tape. The analysis of repetitive data starts with access to the means by which big data is ...
The management of large amounts of data The management of large amounts of data is a consuming issue because there are indeed la ...
Fig. 9.2.8 Two approaches to accessing repetitive data. With any index, there is a cost. There is the cost of initially building ...
Fig. 9.2.9 The costs of building and maintaining an index. Summary/Detailed Data Another issue that arises is whether detailed a ...
the related summary data that can be stored in big data. Fig. 9.2.10 shows this relationship of data inside big data. Fig. 9.2.1 ...
«
14
15
16
17
18
19
20
21
22
23
»
Free download pdf