data-architecture-a
Fig. 4.3.3 Processors executing independently. An interesting thing about parallelization is that the total number of machine cy ...
Fig. 4.3.4 An MPP—massively parallel processor. The form of parallelization seen in Fig. 4.3.4 is called the “massively parallel ...
Fig. 4.3.5 Text is parsed then placed in the appropriate processor. Fig. 4.3.5 shows that in the MPP architecture, the parsing o ...
the system knows it has found the data that were being sought. Fig. 4.3.6 shows the parsing that occurs. Fig. 4.3.6 Parsing is d ...
It is seen from Fig. 4.3.6 that in order to find a single instance of data, quite a bit of work has to be done by the system. Bu ...
Fig. 4.3.8 shows the parsing of nonrepetitive data. Fig. 4.3.8 Parsing nonrepetitive data. The parsing of nonrepetitive is an en ...
works only for repetitive data, not nonrepetitive data. Once the index for the repetitive data is created, it can be scanned muc ...
Fig. 4.3.10 The application nature of building an index on repetitive data. Chapter 4.3: Parallel Processing ...
Chapter 4.4 Unstructured Data Abstract There are different definitions of big data. The definition used here is that big data en ...
Fig. 4.4.1 Unstructured data can be repetitive or nonrepetitive. Decisions Based on Structured Data For a variety of reasons, th ...
corporation. The challenge then is unlocking that potential. The Business Value Proposition Fig. 4.4.3 shows that there is a dif ...
These cases represent merely the most obvious tip of the iceberg for finding and using nonrepetitive unstructured information. R ...
Fig. 4.4.5 Analysis on nonrepetitive data is like fitting a square peg in a round hole. Ease of Analysis Fig. 4.4.5 shows that a ...
unstructured records are the following: Very nonuniform in shape. Sometimes small, sometimes large, and sometimes very large. Th ...
Punctuation Grammar Proper sentence construction It cannot be argued that there are no rules that govern the creation of proper ...
lady passes by—“She's hot.” Now, what is being said here? One interpretation is that the gentleman finds the young lady to be at ...
earliest attempt to trying to contextualize text is a technology called “NLP.” NLP stands for natural language processing (or so ...
Fig. 4.4.9 NLP does not do a good job of finding and managing context of text. In later chapters, much more will be said about t ...
Fig. 4.4.10 Map reduce can be used to address text. MapReduce is a language for the technician that can be used to do all sorts ...
Fig. 4.4.11 Manual analysis is appealing for small, one time only projects. The great appeal of doing analysis manually is that ...
«
3
4
5
6
7
8
9
10
11
12
»
Free download pdf