Restoring 2D content from distorted documents

Micheal S. Brown, Mingxuan Sun, Ruigang Yang, Yun Lin, W. Brent Seales

Research output: Contribution to journalArticlepeer-review

51 Scopus citations

Abstract

This paper presents a framework to restore the 2D content printed on documents in the presence of geometric distortion and non-uniform illumination. Compared with textbased document imaging approaches that correct distortion to a level necessary to obtain sufficiently readable text or to facilitate optical character recognition (OCR), our work targets nontextual documents where the original printed content is desired. To achieve this goal, our framework acquires a 3D scan of the document's surface together with a high-resolution image. Conformal mapping is used to rectify geometric distortion by mapping the 3D surface back to a plane while minimizing angular distortion. This conformal "deskewing" assumes no parametric model of the document's surface and is suitable for arbitrary distortions. Illumination correction is performed by using the 3D shape to distinguish content gradient edges from illumination gradient edges in the high-resolution image. Integration is performed using only the content edges to obtain a reflectance image with significantly less illumination artifacts. This approach makes no assumptions about light sources and their positions. The results from the geometric and photometric correction are combined to produce the final output.

Original languageEnglish
Pages (from-to)1904-1916
Number of pages13
JournalIEEE Transactions on Pattern Analysis and Machine Intelligence
Volume29
Issue number11
DOIs
StatePublished - Nov 2007

Keywords

  • Conformal mapping
  • Document processing
  • Document restoration
  • Geometric correction
  • Photometric correction
  • Shading correction

ASJC Scopus subject areas

  • Software
  • Computer Vision and Pattern Recognition
  • Computational Theory and Mathematics
  • Artificial Intelligence
  • Applied Mathematics

Fingerprint

Dive into the research topics of 'Restoring 2D content from distorted documents'. Together they form a unique fingerprint.

Cite this