Author: "Patrice Y. Simard" / Journal: seventh international conference on document analysis and recognition, 2003. proceedings. - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Patrice Y. Simard"' showing total 3 results

Start Over Author "Patrice Y. Simard" Journal seventh international conference on document analysis and recognition, 2003. proceedings.

3 results on '"Patrice Y. Simard"'

1. Best practices for convolutional neural networks applied to visual document analysis

Author: John Platt, David W. Steinkraus, and Patrice Y. Simard
Subjects: Training set, Artificial neural network, business.industry, Computer science, Machine learning, computer.software_genre, Convolutional neural network, Support vector machine, Set (abstract data type), Handwriting recognition, Artificial intelligence, business, computer, MNIST database
Abstract: Neural networks are a powerful technology forclassification of visual inputs arising from documents.However, there is a confusing plethora of different neuralnetwork methods that are used in the literature and inindustry. This paper describes a set of concrete bestpractices that document analysis researchers can use toget good results with neural networks. The mostimportant practice is getting a training set as large aspossible: we expand the training set by adding a newform of distorted data. The next most important practiceis that convolutional neural networks are better suited forvisual document tasks than fully connected networks. Wepropose that a simple "do-it-yourself" implementation ofconvolution with a flexible architecture is suitable formany visual document problems. This simpleconvolutional neural network does not require complexmethods, such as momentum, weight decay, structure-dependentlearning rates, averaging layers, tangent prop,or even finely-tuning the architecture. The end result is avery simple yet general architecture which can yieldstate-of-the-art performance for document analysis. Weillustrate our claims on the MNIST set of English digitimages.
Published: 2005
Full Text: View/download PDF

2. Discerning structure from freeform handwritten notes

Author: Zile Wei, D. Jones, Michael Shilman, Patrice Y. Simard, and Sashi Raghupathy
Subjects: Structure (mathematical logic), Parsing, Computer science, business.industry, Feature extraction, Integrated approach, computer.software_genre, Text mining, Handwriting, ComputingMethodologies_DOCUMENTANDTEXTPROCESSING, Artificial intelligence, business, computer, Natural language processing, Document layout analysis
Abstract: This paper presents an integrated approach to parsing textual structure in freeform handwritten notes. Text-graphics classification and text layout analysis are classical problems in printed document analysis, but the irregularity in handwriting and content in freeform notes reveals limitations in existing approaches. We advocate an integrated technique that solves the layout analysis and classification problems simultaneously: the problems are so tightly coupled that it is not possible to solve one without the other for real user notes. We tune and evaluate our approach on a large corpus of unscripted user files and reflect on the difficult recognition scenarios that we have encountered in practice.
Published: 2004
Full Text: View/download PDF

3. Using character recognition and segmentation to tell computer from humans

Author: Josh Benaloh, Patrice Y. Simard, Richard Szeliski, Iulian D. Calinov, and Julien D. Couvreur
Subjects: Intelligent character recognition, Computer science, business.industry, String (computer science), Character encoding, Image segmentation, Optical character recognition, Machine learning, computer.software_genre, Document processing, Intelligent word recognition, Electronic mail, Segmentation, Artificial intelligence, business, computer
Abstract: How do you tell a computer from a human? The situation arises often on the Internet, when online polls are conducted, accounts are requested, undesired email is received, and chat-rooms are spammed. The approach we use is to create a visual challenge that is easy for humans but difficult for a computer. More specifically, our challenge is to recognize a string of random distorted characters. To pass the challenge, the subject must type in the correct corresponding ASCII string. From an OCR point of view, this problem is interesting because our goal is to use the vast amount of accumulated knowledge to defeat the state of the art OCR algorithms. This is a role reversal from traditional OCR research. Unlike many other systems, our algorithm is based on the assumption that segmentation is much more difficult than recognition. Our image challenges present hard segmentation problems that humans are particularly apt at solving. The technology is currently being used in MSN's Hotmail registration system, where it has significantly reduced daily registration rate with minimal Consumer Support impact.
Published: 2004
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

3 results on '"Patrice Y. Simard"'

1. Best practices for convolutional neural networks applied to visual document analysis

2. Discerning structure from freeform handwritten notes

3. Using character recognition and segmentation to tell computer from humans

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Database

3 results on '"Patrice Y. Simard"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources