436 JOURNAL OF LAW AND POLICY
The models based on word features achieve their best
performance at about 400 features, far lower than the character
n-gram representation. However, the performance of the models
based on more features does not drop dramatically as happens in
cross-topic experiments. Again, this confirms the above
conclusion about the usefulness of thematic-related information
Figure 5: Performance of the cross-topic attribution models
(training on Politics, test on UK).Figure 6: Performance of the cross-genre attribution models
(training on Politics, test on Book reviews).304050607080901000 5000 10000 15000Accuracy (%)FeaturesWordsChar 3-
grams304050607080901000 5000 10000 15000Accuracy (%)FeaturesWordsChar 3-
grams