data-architecture-a

(coco) #1

The fourth element of homographic resolution is that each of the homographic classes
must have typical words assigned to the class. For example, a cardiologist may be
associated with words like “aorta,” “stent,” “bypass,” and “valve.”


There are then four elements to homographic resolution:


The homograph
The homograph class
The homograph resolution
Words associated with the homograph class

Fig. 10.1.9 shows how homographic processing is done against raw text.


Fig. 10.1.9 Homographic processing.

Suppose the raw text looks as follows—“...120/68, 168 lbs, ha, 72 bpm, f, 38,...”


Upon processing the raw text, the entry into the database might look like the following:


Document name, byte, context—head ache, value—ha

Care must be taken with the specification of homographs. The underlying work done by
the system to resolve the homograph is considerable. So, system overhead is a concern.


In addition, the analyst can specify a default homographic class should none of the
homographic classes be qualified. In this case, the system will default to the homograph
class specified by the analyst.


Chapter 10.1: Nonrepetitive Data
Free download pdf