Abstract
This paper describes the specifications for three ground-truthed mathematical character and symbol image databases, called InftyCDB-1, InftyCDB-2, and InftyCDB-3. In the former two databases, the ground-truth of each character is composed of type, font, quality (touching/broken) and link (relative position), etc. InftyCDB-1 includes all the characters and symbols of 30 articles on mathematics, and is organized so that it can be used as word image database or as mathematical formula image database. InftyCDB-2, which is a continuation of InftyCDB-1, includes 37 articles including French and German articles and is organized like InftyCDB-1. InftyCDB-3 is a single character database for training and evaluating single-character recognition engines.
Original language | English |
---|---|
Pages (from-to) | 7-14 |
Number of pages | 8 |
Journal | Research Reports on Information Science and Electrical Engineering of Kyushu University |
Volume | 12 |
Issue number | 1 |
Publication status | Published - Mar 2007 |
Externally published | Yes |
All Science Journal Classification (ASJC) codes
- Computer Science(all)
- Electrical and Electronic Engineering