BOOKSBOOKSBOOKS
Machine Learning in Document Analysis and Recognition
by Simone Marinai and Hiromichi Fujisawa (Eds.) Springer, 2008
Reviewed by: L. Venkata Subramaniam (India) |
This book is a collection of research papers and reviews linking together document analysis and recognition (DAR) research with machine learning research. Stated goals of the book’s editors are: the identification of good practices for the use of learning strategies in DAR, identification of DAR tasks more appropriate for learning strategies, and highlighting new learning algorithms that may be successfully applied to DAR. The papers in this book cover different topics in DAR including layout analysis, text recognition, and classification. Document analysis and recognition is a mature field of research. The first papers in this area appeared in the 1960’s. This book has sixteen papers covering pretty much the most recent research in this area. The editors mention that they have deliberately not grouped the papers so that readers can choose their own path through the book. However, the first paper gives an introduction to DAR and ties the whole book together by citing the papers in the book under appropriate sections. This is the must read chapter of the book. Several papers cover physical layout analysis, with one covering logical layout analysis. Text recognition is a widely studied topic that has resulted in many applications and products. Still there are challenges in dealing with noisy documents and non-standard fonts. There are several papers covering both online and offline recognition of characters and words. Supervised and unsupervised classifiers have been considered for various tasks like pixel and region classification, reading order detection, text recognition, character segmentation, script identification, signature verification, writer identification, and document categorization. Neural networks, inductive logic programming, support vector machines, latent semantic indexing, and a host of other machine learning techniques have been applied to the various DAR tasks in this book. Indeed this book is about learning methods that can be used in DAR. Each of the papers has an experiments section where the proposed approaches have been evaluated on actual datasets including several public ones. The collection of papers in this book will prove useful for an advanced researcher in the field or graduate students planning to do a thesis in DAR. The book would also be very useful for researchers in machine learning to understand key applications of learning approaches. |
Click above to go to the publisher’s web page where there is a description of the book, a link to the Table of Contents, and sample pages. |
Book Reviews Published in the IAPR Newsletter
Close Range Photogrammetry: Principles, Methods, and Applications by Luhmann, Robson, Kyle, and Harley
Classification and Learning Using Genetic Algorithms: Applications in Bioinformatics and Web Intelligence by Bandyopadhyay and Pal
Learning Theory: An Approximation Theory Viewpoint by Cucker and Zhou
Character Recognition Systems—A Guide for Students and Practitioners by Cheriet, Kharma, Liu, and Suen
Geometry of Locally Finite Spaces by Kovalevsky
From Gestalt Theory to Image Analysis—A Probabilistic Approach By Desolneux, Moisan, and Morel
Numerical Recipes: The art of scientific computing, 3rd ed. by Press, Teukolsky, Vetterling and Flannery
Feature Extraction and Image Processing, 2nd ed. by Nixon and Aguado
Digital Watermarking and Steganography: Fundamentals and Techniques by Shih
Springer Handbook of Speech Processing by Benesty, Sondhi, and Huang, eds.
Digital Image Processing: An Algorithmic Introduction Using Java by Burger and Burge
Bézier and Splines in Image Processing and Machine Vision by Biswas and Lovell
Practical Algorithms for Image Analysis, 2 ed. by O’Gorman, Sammon and Seul
The Dissimilarity Representation for Pattern Recognition: Foundations and Applications by Pekalska and Duin
Handbook of Biometrics by Jain, Flynn, and Ross (Editors)
Advances in Biometrics – Sensors, Algorithms, and Systems by Ratha and Govindaraju, (Editors)
Dynamic Vision for Perception and Control of Motion by Dickmanns
Bioinformatics by Polanski and Kimmel
Introduction to clustering large and high-dimensional data by Kogan
The Text Mining Handbook by Feldman and Sanger
Information Theory, Inference, and Learning Algorithms by Makay
Geometric Tomography by Gardner
“Foundations and Trends in Computer Graphics and Vision” Curless, Van Gool, and Szeliski., Editors
Applied Combinatorics on Words by M. Lothaire
Human Identification Based on Gait by Nixon, Tan and Chellappar
Mathematics of Digital Images by Stuart Hogan
Advances in Image and Video Segmentation Zhang, Editor
Graph-Theoretic Techniques for Web Content Mining by Schenker, Bunke, Last and Kandel
Handbook of Mathematical Models in Computer Vision by Paragios, Chen, and Faugeras (Editors)
The Geometry of Information Retrieval by van Rijsbergen
Biometric Inverse Problems by Yanushkevich, Stoica, Shmerko and Popel
Correlation Pattern Recognition by Kumar, Mahalanobis, and Juday
Pattern Recognition 3rd Edition by Theodoridis and Koutroumbas
Dictionary of Computer Vision and Image Processing by R.B. Fisher, et. Al
Kernel Methods for Pattern Analysis by Shawe-Taylor and Cristianini
Machine Vision Books
CVonline: an overview
The Guide to Biometrics by Bolle, et al
Pattern Recognition Books Jul. ‘04 [pdf] |