From: Corpus Refactoring: a Feasibility Study
Type of error
Percentage
Count
Text blocks requiring no manual correction
57.6%
163/283
Text blocks requiring at least one boundary correction
22.3%
63/283
Text blocks with at least one unmappable entity
20.1%
57/283
Total
100%
283/283