definition of 292–293
levels of 195 , 196 f
repetitive and nonrepetitive records of data 291
repetitive text 291–292
textual disambiguation 102–104, 293
variables name selection 292
MapReduce 96 , 96 f
Massively parallel processing (MPP) approach 70 , 83–84, 83 f
Master file 207 , 208 f
Mechanical speed 312
Medical records 304–307, 308 f
Metadata
in big data 257–259, 259 f
in end-state data architecture 54 , 55 f
Metrics, repetitive analysis 267–268
“Million in one” syndrome 366
MPP approach See Massively parallel processing (MPP) approach
Multipart source business keys 155–156
Multiple processors 41–42
N
Named value processing 105–106
Narrative data, classification of 112–113
Narrative information 304
Native metadata 258
Natural language processing (NLP) 17 , 95 , 374
Negation analysis, nonrepetitive data 277–278
Networked metadata 55 , 56 f
Net-worked relationship 203 , 203 f
NLP See Natural language processing (NLP)
Nonrepetitive data 86 , 269 , 269 f
acronym resolution 277 , 277 f
analytics from 295
call center information 295–303, 304 f
medical records 304–307, 308 f
associative word processing 281–282
in big data 78–79, 269
context in 79–80, 79–80f
Index