Library on-shelf book segmentation and recognition based on deep visual features

文献类型: 外文期刊

第一作者: Zhou, Shuo

作者: Zhou, Shuo;Sun, Tan;Xia, Xue;Zhang, Ning;Xian, Guojian;Chai, Xiujuan;Zhou, Shuo;Sun, Tan;Xia, Xue;Zhang, Ning;Xian, Guojian;Chai, Xiujuan;Huang, Bo

作者机构:

关键词: Library information management; On-shelf book recognition; Book spine segmentation; Deep learning; Library robot

期刊名称:INFORMATION PROCESSING & MANAGEMENT ( 影响因子:7.466; 五年影响因子:7.036 )

ISSN: 0306-4573

年卷期: 2022 年 59 卷 6 期

页码:

收录情况: SCI

摘要: On-shelf book segmentation and recognition are crucial steps in library inventory management and daily operation. In this paper, a detailed investigation of related work is conducted. RFID and barcode-based solutions suffer from expensive hardware facilities and long-term maintenance. Digital Image processing and OCR techniques are flawed due to a lack of accuracy and robustness. On this basis, we propose a visual and non-character system utilizing deep learning methods to accomplish on-shelf book segmentation and recognition tasks. Firstly, book spine masks are extracted from the image of on-shelf books by instance segmentation model, followed by affine transformation to rectangle images. Secondly, a spine feature encoder is trained to learn the deep visual features of spine images. Finally, the book inventory search space is constructed and the similarity metric between spine visual representations is calculated to recognize the target book identity. To train the models we collect high-resolution datasets of 10k-level and develop a data annotation software accordingly. For validation, we design simulated scenarios of recognizing 3.6k IDs from 5.6k book spines and achieve a best top1 accuracy of 99.18% and top5 accuracy of 99.91%. Furthermore, we develop a prototype of a mobile library management robot with embedded edge intelligence. It can automatically perform on-shelf book image capturing, spine segmentation and recognition, and target book grasping workflow.

分类号:

  • 相关文献
作者其他论文 更多>>