ACM Home Page
Please provide us with feedback. Feedback
A table-form extraction with artefact removal
Full text PdfPdf (602 KB)
Source Symposium on Applied Computing archive
Proceedings of the 2007 ACM symposium on Applied computing table of contents
Seoul, Korea
SESSION: Document engineering table of contents
Pages: 622 - 626  
Year of Publication: 2007
ISBN:1-59593-480-4
Authors
Luiz Antonio Pereira Neves  PUCPR -- Pontifícia Universidade, Brazil
João Marques de Carvalho  UFCG -- Universidade Federal de, Campina Grande, Brazil
Jacques Facon  PUCPR -- Pontifícia Universidade, Brazil
Flávio Bortolozzi  PUCPR -- Pontifícia Universidade, Brazil
Sponsor
SIGAPP: ACM Special Interest Group on Applied Computing
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 5,   Downloads (12 Months): 47,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
Save this Article to a Binder    Display Formats: BibTex  EndNote ACM Ref   
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1244002.1244144
What is a DOI?

ABSTRACT

We present a novel methodology for extracting the structure of handwritten filled table-forms. The method identifies the table-form line intersections, detecting and correcting wrong intersections produced by faulty line segments or by table artefacts. Examples of artefacts are overlapping data, broken segments, and smudges. A novel method for artefact identification and deletion is also proposed. The last step performs the extraction of table-form cells.

A database of 350 table-form images was used for evaluation, showing that the artefact identification method improves the performance of the table-forms structure extractor. The proposed approach reached a success rate of 85%.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
 
3
 
4
 
5
 
6
 
7
J. Hu, R. S. Kashi, D. Lopresti, and G. T. Wilfong. Evaluating the performance of table processing algorithms. International Journal on Document Analysis and Recognition, 4:140--153, 2002.
 
8
 
9
 
10
L. A. P. Neves. Extração de células de dados manuscritos em tabelas. Master's thesis, Pontifícia Universidade Católica do Paraná - PUCPR, Brazil, 1999.
 
11
L. A. P. Neves. Metodologia de Extração de Recuperação de Tabelas. PHD thesis - Universidade Federal de Campina Grande - UFCG, Paraíba, 2006.
 
12
A. Pizano. Extracting line features from images of business forms and tables. IAPR - In: Proceedings of the 11th International Conference on Pattern Recognition, 3:399--403, 1992.
 
13
 
14
 
15
R. T. V. Thom. Modelisation de Tableaux pour le traitement Automatique des Formulaires. Laboratoire PSI, Universit de Rouen, 1997.
 
16
T. Watanabe, Q. Luo, and N. Sugie. Structure recognition methods for various types of documents. Machine Vision and Applications, 1993.
 
17
T. Watanabe, Q. Luo, and N. Sugie. Toward a practical document understanding of table-form documents: Its framework and knowledge representation. In: Second Conference on Document Analysis and Recognition, pages 510--515, 1993.
 
18
 
19
Tukey, J. W.: Exploratory Data Analysis. Addison-Wesley, 1977.
 
20
Neves, L. A. P.; Carvalho, J. M. ; Facon, J.. Bit Block Transfer and Structuring Element Decomposition for Table-form Physical Structure. SIBGRAPI 2003 - XVI Brazilian Symposium on Computer Graphics and Image Processing, 2003, São Carlos, SP.
 
21
Neves, L. A. P.; Carvalho, J. M.; Facon, J.. Recognition of Deteriorated Table-form Documents: A New Approach. SIBGRAPI 2003 - XVI Brazilian Symposium on Computer Graphics and Image Processing, 2003, São Carlos, SP.
 
22
Neves, L. A. P.; Carvalho, J. M.; Facon, J.; Bortolozzi, F.; Ignacio, S. A. Handwritten Artefact Identification Method In Table Interpretation With Little Use of Knowledge. LNCS - DAS 2006 - Seventh International Association For Pattern Recognition on Document Analysis Systems, Nelson, Nova Zelǎndia, 2006.
23

Collaborative Colleagues:
Luiz Antonio Pereira Neves: colleagues
João Marques de Carvalho: colleagues
Jacques Facon: colleagues
Flávio Bortolozzi: colleagues