data-architecture-a

(coco) #1

sophisticated analysis of text.


Note that on output of the processed text, the analyst can now create a query on “car”
and find all mentions of any type of car. Also note that the term “car” appears nowhere
in the raw text. This is just a glimpse at the value added by taxonomies when taxonomies
are applied to text.


The ability to classify data externally is extremely useful when disambiguating
nonrepetitive unstructured data.


Taxonomies and Textual Disambiguation—Separate


Technologies


Taxonomies—the gathering, classification, and maintenance of the taxonomy—require
their own care and handling. Usually, it makes sense to build and manage the taxonomy
external to the technology for textual disambiguation. Fig. 4.7.7 shows that arrangement.


Chapter 4.7: Taxonomies
Free download pdf