文献类型: 会议论文
第一作者: Maroua Mehri
作者: Maroua Mehri 1 ; Pierre Heroux 2 ; Nabil Sliti 3 ; Petra Gomez-Kramer 1 ; Najoua Essoukri Ben Amara 3 ; Remy Mullot 2 ;
作者机构: 1.L3i, University of La Rochelle
2.LITIS, University of Rouen, Avenue de l'Universite
3.SAGE, University of Sousse, Ecole Nationale d'Ingenieurs de Sousse
关键词: Historical Document Images;Segmentation;SLIC Superpixels;Gabor Filters;Multi-Scale Analysis;ARLSA
会议名称: International Conference on Computer Vision Theory and Applications
主办单位:
页码: 40-47
摘要: To reach the objective of ensuring the indexing and retrieval of digitized resources and offering a structured access to large sets of cultural heritage documents, a raising interest to historical document image segmentation has been generated. In fact, there is a real need for automatic algorithms ensuring the identification of homogenous regions or similar groups of pixels sharing some visual characteristics from historical documents (i.e. distinguishing graphic types, segmenting graphical regions from textual ones, and discriminating text in a variety of situations of different fonts and scales). Indeed, determining graphic regions can help to segment and analyze the graphical part in historical heritage, while finding text zones can be used as a pre-processing stage for character recognition, text line extraction, handwriting recognition, etc. Thus, we propose in this article an automatic segmentation method for historical document images based on extraction of homogeneous or similar content regions. The proposed algorithm is based on using simple linear iterative clustering (SLIC) superpixels, Gabor filters, multi-scale analysis, majority voting technique, connected component analysis, color layer separation, and an adaptive run-length smoothing algorithm (ARLSA). It has been evaluated on 1000 pages of historical documents and achieved interesting results.
分类号: TP391.41-53
- 相关文献
[1]Texture approach for nets extraction Application to old Arab newspapers images structuring. Mohamed Aymen Charrada,Najoua Essoukri Ben Amara. 2012
[2]Texture approach for nets extraction application to old Arab newspapers images structuring. Charrada, Mohamed Aymen,Ben Amara, Najoua Essoukri. 2012
[3]Fast Pulmonary Contour Extraction in X-ray CT Images: A Methodology and Quality Assessment. Augusto Silva,Jose S. Silva,Beatriz S. Santos,Carlos Ferreira. 2001
[4]Biological cell tracking with implicit active contours: preventing object fusions. Marion Feral,Christophe Zimrner,Jean-Christophe Olivo-Marin,SPIE-The International Society for Optical Engineering. 2003
[5]Performance Evaluation and Benchmarking of Six Texture-based Feature Sets for Segmenting Historical Documents. Maroua Mehri,Mohamed Mhiri,Pierre Heroux,Petra Gomez-Kramer,Mohamed Ali Mahjoub,Remy Mullot. 2014
作者其他论文 更多>>
-
Transductive Transfer Learning to Specialize a Generic Classifier Towards a Specific Scene
作者:Houda Maamatou;Thierry Chateau;Sami Gazzah;Yann Goyat;Najoua Essoukri Ben Amara
关键词:Transductive Transfer Learning;Specialization;Generic Classifier;Pedestrian Detection;Sequential Monte Carlo Filter (SMC)
-
A Dataset for Arabic Text Detection, Tracking and Recognition in News Videos- AcTiV
作者:Oussama Zayene;Jean Hennebert;Sameh Masmoudi Touj;Rolf Ingold;Najoua Essoukri Ben Amara
关键词:Video OCR;Video database;Benchmark;Arabic text
-
Interactive Content-Based Document Retrieval Using Fuzzy Attributed Relational Graph Matching
作者:Ramzi CHAIEB;Karim KALTI;Najoua ESSOUKRI BEN AMARA
关键词:Fuzzy Attributed Relational Graph indexing;Graph matching distance;Document image retrieval;User interaction
-
Modalities Combination for Italian Sign Language Extraction and Recognition
作者:Bassem Seddik;Sami Gazzah;Najoua Essoukri Ben Amara
关键词:Motion spotting;Action recognition;Fisher vector;Modalities combination;Classification fusion
-
Performance Evaluation and Benchmarking of Six Texture-based Feature Sets for Segmenting Historical Documents
作者:Maroua Mehri;Mohamed Mhiri;Pierre Heroux;Petra Gomez-Kramer;Mohamed Ali Mahjoub;Remy Mullot
关键词:Historical digitized document images;Segmentation;Texture;Multiscale approach
-
SID Signature Database: A Tunisian Off-line Handwritten Signature Database
作者:Imen Abroug Ben Abdelghani;Najoua Essoukri Ben Amara
关键词:Off-line handwritten signature;SID-Signature database;Tunisian signature;Planar modeling signature
-
Texture approach for nets extraction Application to old Arab newspapers images structuring
作者:Mohamed Aymen Charrada;Najoua Essoukri Ben Amara
关键词:Segmentation;degradations;historical Arab periodicals;Gabor filters;post-processing