Optical Character Recognition - Former activities - wxMoleOCR


For current research on algorithm- and software developments you may contact me via email.

wxWidgets and OCR related topics

  • Plattformunabhängige Softwareentwicklung mit wxWidgets
    2. Leipziger Linux Workshop, FH Telekom Leipzig (15. 11. 2003)
    Vortragsfolien als PDF
  • Platform independent plugin example with wxWidgets (runs under Linux & MS-Win)
    This code is based on wxMinimal example and some hints given on the mailing list - thanks.
    Source code as tar gz (Plugin example + min. Application) You may also take a look to the readme first.
  • Moment based Methods for Character Recognition (OCR)
    Diplomarbeit, Institut für Informatik, Universität Leipzig, 2002
    moments_da.pdf Sources of the implemented and analysed methods on request.
  • FL3 Handwritten Symbol Database - Subset of the NIST Special Database 1

    One of the rawdata sets used within my diploma thesis (not easy to find in the www). I converted the original data from sun-ras format to pbm and did some sorting. This database consists of 3471 pre segmented (slightly normalised) numerical symbols in 10 classes from different writers.
    NIST SUB-DB FL3 (part of special database SD1)

  • Collection of handwritten alpha numerical symbols - taken with a standardized questionnaire at the University of Leipzig in 2001

    This is a very short collection with numeric and alphabetic characters from different writers. All symbols were segmented and are given with 300dpi in 3 channels as well as b/w. Additional the symbols are given class wise on one image with corresponding segment description files (.seg). The set consists of 72 classes with 27 items per class. Collection of alpha numerical symbols

  • MNIST A numerical symbols database

    Link to another dataset with 70000 numerical symbols which I have recently found Collection of numerical symbols from MNIST

  • Initial version of a wxWidgets OCR framework for handwritten character recognition called MOLE

    This work is in a very early alpha stage. The classes implement an application which can be used as an OCR framework. All necessary OCR algorithm parts are available and can be easily substituted by own algorithms due to the plugin concept behind. Currently only a model for the classification of numerical symbols is available. For further information take a look at the readme. The project has been moved recently to GNU Savannah - you are encouraged to become a member.

Valid HTML 4.0 Transitional