Higher Good Thresholds



Next: Good Threshold = Up: Results and Analysis Previous: Error Analysis

Higher Good Thresholds

Until now, a page has been considered ``Good'' for OCR purposes if it produces an OCR output with at least 90%accuracy. In [4] it is suggested that a page should be considered ``good'' only if it is in the 95%-98%accuracy range, depending on its textual contents' difficulty. The classifier was therefore run twice, assuming a ``good threshold'' of both 95%and 98%, respectively. The results follow.