Document type recognition using evidence theory
In Proceedings of the 5th IAPR international workshop on graphics recognition (GREC)
Abstract
This paper presents a method to recognize the type of a document when a database of models (document types) is given. For instance, when every documents are forms and when we know every different types of forms, we want to be able to assign to an input document its type of form. To that aim, we define each model by a set of characteristics whose nature can vary from one to another. For instance, a characteristic can be having a flower-shaped logo on top-left as well as having about 12pt fonts. This paper does not intent to explain how to extract such knowledge from documents but it describes how to use such information to decide what the type of a given document is when different document types are described by characteristics.