data-architecture-a

(coco) #1
Fig. 4.6.4 Mapping.

In almost every case, the mapping process is done in an iterative manner. The first
mapping of a document is created. A few documents are processed, and the analyst sees
the results. The analyst decides to make a few changes and reruns the document through
textual disambiguation with the new mapping specifications. The process of gradually
refining the mapping continues until the analyst is satisfied.


The iterative approach to the creation of a mapping is used because documents are
notoriously complex and there are many nuances to a document that are not immediately
apparent. For even an experienced analyst, the creation of the mapping is an iterative
process.


Because of the iterative nature of the creation of the mapping, it NEVER makes sense to
create a mapping and then process thousands of documents using the initial mapping.
Such a practice is wasteful because it is almost guaranteed that the initial mapping will
need to be refined.


Fig. 4.6.5 shows the iterative nature of the mapping process.


Chapter 4.6: Textual Disambiguation
Free download pdf