Skip to main navigation Skip to search Skip to main content

Text segmentation for automatic document processing

Research output: Contribution to journalConference articlepeer-review

Abstract

There is a considerable interest in designing automatic systems that can scan a given paper document and store it on electronic media for easier storage, manipulation and access. Most documents contain graphics and images, in addition to text. Thus, the document image has to be segmented to identify text and image regions, so that appropriate techniques may be applied to those regions. In this paper, we have presented a new technique for image segmentation in which text and image regions, in a given document image, are automatically identified. The technique is based on the differential processing text extraction concept. The proposed technique is capable of analyzing complex document image layouts. The document image is processed by using textural feature analysis. Results of the proposed method are presented with test images which demonstrate the robustness of the technique.

Original languageAmerican English
Pages (from-to)30-40
Number of pages11
JournalProceedings of SPIE - The International Society for Optical Engineering
Volume3651
StatePublished - 1999
Externally publishedYes
EventProceedings of the 1999 6th Annual Conference on Document Recognition and Retrieval VI - San Jose, CA, USA
Duration: Jan 27 1999Jan 28 1999

ASJC Scopus subject areas

  • Electronic, Optical and Magnetic Materials
  • Condensed Matter Physics
  • Computer Science Applications
  • Applied Mathematics
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'Text segmentation for automatic document processing'. Together they form a unique fingerprint.

Cite this