Machine Learning in
Document Analysis and Recognition
by Simone Marinai and Hiromichi Fujisawa (Eds.)
Reviewed by: L. Venkata Subramaniam (India)
This book is a collection of research papers and reviews linking together document analysis and recognition (DAR) research with machine learning research. Stated goals of the book’s editors are: the identification of good practices for the use of learning strategies in DAR, identification of DAR tasks more appropriate for learning strategies, and highlighting new learning algorithms that may be successfully applied to DAR. The papers in this book cover different topics in DAR including layout analysis, text recognition, and classification.
Document analysis and recognition is a mature field of research. The first papers in this area appeared in the 1960’s. This book has sixteen papers covering pretty much the most recent research in this area. The editors mention that they have deliberately not grouped the papers so that readers can choose their own path through the book. However, the first paper gives an introduction to DAR and ties the whole book together by citing the papers in the book under appropriate sections. This is the must read chapter of the book.
Several papers cover physical layout analysis, with one covering logical layout analysis. Text recognition is a widely studied topic that has resulted in many applications and products. Still there are challenges in dealing with noisy documents and non-standard fonts. There are several papers covering both online and offline recognition of characters and words. Supervised and unsupervised classifiers have been considered for various tasks like pixel and region classification, reading order detection, text recognition, character segmentation, script identification, signature verification, writer identification, and document categorization.
Neural networks, inductive logic programming, support vector machines, latent semantic indexing, and a host of other machine learning techniques have been applied to the various DAR tasks in this book. Indeed this book is about learning methods that can be used in DAR. Each of the papers has an experiments section where the proposed approaches have been evaluated on actual datasets including several public ones.
The collection of papers in this book will prove useful for an advanced researcher in the field or graduate students planning to do a thesis in DAR. The book would also be very useful for researchers in machine learning to understand key applications of learning approaches.
Click above to go to the publisher’s web page where there is a description of the book, a link to the Table of Contents, and sample pages.
Book Reviews Published in
the IAPR Newsletter
Close Range Photogrammetry: Principles, Methods, and Applications
by Luhmann, Robson, Kyle, and Harley
Classification and Learning Using Genetic Algorithms: Applications in Bioinformatics and Web Intelligence
by Bandyopadhyay and Pal
An Approximation Theory Viewpoint
by Cucker and Zhou
Character Recognition Systems—A Guide for Students and Practitioners
by Cheriet, Kharma, Liu, and Suen
Geometry of Locally Finite Spaces
From Gestalt Theory to Image Analysis—A Probabilistic Approach
By Desolneux, Moisan, and Morel
Numerical Recipes: The art of scientific computing, 3rd ed.
by Press, Teukolsky, Vetterling and Flannery
Feature Extraction and Image Processing, 2nd ed.
by Nixon and Aguado
Digital Watermarking and Steganography:
Fundamentals and Techniques
Springer Handbook of Speech Processing
by Benesty, Sondhi, and Huang, eds.
Digital Image Processing: An Algorithmic Introduction Using Java
by Burger and Burge
Bézier and Splines in Image Processing and Machine Vision
by Biswas and Lovell
Practical Algorithms for Image Analysis, 2 ed.
by O’Gorman, Sammon and Seul
The Dissimilarity Representation for Pattern Recognition: Foundations and Applications
by Pekalska and Duin
Handbook of Biometrics
by Jain, Flynn, and Ross (Editors)
Advances in Biometrics –
Sensors, Algorithms, and Systems
by Ratha and Govindaraju, (Editors)
Dynamic Vision for Perception and Control of Motion
by Polanski and Kimmel
Introduction to clustering large and high-dimensional data
The Text Mining Handbook
by Feldman and Sanger
Information Theory, Inference,
and Learning Algorithms
“Foundations and Trends in Computer Graphics and Vision”
Curless, Van Gool, and Szeliski., Editors
Applied Combinatorics on Words
by M. Lothaire
Human Identification Based on Gait
by Nixon, Tan and Chellappar
Mathematics of Digital Images
by Stuart Hogan
Advances in Image and Video Segmentation
Graph-Theoretic Techniques for Web Content Mining
by Schenker, Bunke, Last and Kandel
Handbook of Mathematical Models in Computer Vision
by Paragios, Chen, and Faugeras (Editors)
The Geometry of Information Retrieval
by van Rijsbergen
Biometric Inverse Problems
by Yanushkevich, Stoica, Shmerko and Popel
Correlation Pattern Recognition
by Kumar, Mahalanobis, and Juday
Pattern Recognition 3rd Edition
by Theodoridis and Koutroumbas
Dictionary of Computer Vision and
by R.B. Fisher, et. Al
Kernel Methods for Pattern Analysis
by Shawe-Taylor and Cristianini
Machine Vision Books
CVonline: an overview
The Guide to Biometrics by Bolle, et al
Pattern Recognition Books
Jul. ‘04 [pdf]