资源论文State Estimation in a Document Image and Its Application in Text Block Identification and Text Line Extraction

State Estimation in a Document Image and Its Application in Text Block Identification and Text Line Extraction

2020-03-31 | |  61 |   36 |   0

Abstract

This paper proposes a new approach to the estimation of document states such as interline spacing and text line orientation, which facilitates a number of tasks in document image processing. The proposed method can be applied to spatially varying states as well as invariant ones, so that general cases including images of complex layout, camera- captured images, and handwritten ones can also be handled. Specifically, we find CCs (Connected Components) in a document image and assign a state to each of them. Then the states of CCs are estimated using an en- ergy minimization framework, where the cost function is designed based on frequency domain analysis and minimized via graph-cuts. Using the estimated states, we also develop a new algorithm that performs text block identification and text line extraction. Roughly speaking, we can segment an image into text blocks by cutting the distant connections among the CCs (compared to the estimated interline spacing), and we can group the CCs into text lines using a bottom-up grouping along the estimated text line orientation. Experimental results on a variety of doc- ument images show that our method is efficient and provides promising results in several document image processing tasks.

上一篇:Visibility Subspaces: Uncalibrated Photometric Stereo with Shadows

下一篇:The Semi-explicit Shape Model for Multi-ob ject Detection and Classification*

用户评价
全部评价

热门资源

  • Learning to Predi...

    Much of model-based reinforcement learning invo...

  • Stratified Strate...

    In this paper we introduce Stratified Strategy ...

  • The Variational S...

    Unlike traditional images which do not offer in...

  • A Mathematical Mo...

    Direct democracy, where each voter casts one vo...

  • Rating-Boosted La...

    The performance of a recommendation system reli...