Skip to content

wenhuach/DAS2016_Tutorial1

 
 

Repository files navigation

DAS2016_Tutorial1

"Scene-Text Localization, Recognition, and Understanding"

DAS2016 Tutorial 1. Slides and sample code.

Albert Gordo (Xerox Research Center Europe) and Lluís Gómez i Bigordà (Computer Vision Center, Universitat Autonòma de Barcelona)

During the last few years, the computer vision and document analysis communities have started giving attention to tasks related to text localization and recognition in natural images (also referred to as scene-text or text-in-the-wild), particularly after the seminal works of Wang et al. More recently, and steered by the current deep learning renaissance, architectures based on convolutional neural networks and recurrent neural networks have shown outstanding results on localization and recognition tasks, and have allowed researchers to approach more challenging problems such as text understanding in natural images.

This tutorial has three main objectives: first, to familiarize the audience with the problem of localization, recognition, and understanding of text in natural images, highlighting the similarities and differences between them and the same tasks performed on document images. Second, to provide some details about the techniques that are showing the largest potential for current and future research in the topic, and that could be easily transferred or adapted back to the document analysis domain. Third, to present to the audience open-source libraries that implement some of the current state-of-the-art methods.

About

Slides and sample code demos for DAS2016 Tutorial 1

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 100.0%