Exploring Object Detection and Recognition Methods for Automated Book Inventory
(2019) In Master's Theses in Mathematical Sciences FMAM05 20192Mathematics (Faculty of Engineering)
- Abstract
- As more modern technical devices with cameras are accessible to people, more photos are taken and the amount of data that is available has vastly increased. Researchers have learnt to utilise this and use images for various purposes with Deep Learning based Computer Vision. For instance, one can teach a computer to perform an automated inventory, which could be a very time consuming task. Librarians are some of the affected people, they spend a lot of their time to arrange and go through the books on the bookshelves. In an attempt to automate the inventory of books, different object detection and recognition methods have been explored in this Master Thesis. The state of the art object detection method Mask Region Convolutional Neural... (More)
- As more modern technical devices with cameras are accessible to people, more photos are taken and the amount of data that is available has vastly increased. Researchers have learnt to utilise this and use images for various purposes with Deep Learning based Computer Vision. For instance, one can teach a computer to perform an automated inventory, which could be a very time consuming task. Librarians are some of the affected people, they spend a lot of their time to arrange and go through the books on the bookshelves. In an attempt to automate the inventory of books, different object detection and recognition methods have been explored in this Master Thesis. The state of the art object detection method Mask Region Convolutional Neural Network (Mask R-CNN) was used to find book spines in images and two Convolutional Neural Networks (CNN) with different loss functions; categorical cross-entropy and triplet loss, were used to recognise each book the Mask R-CNN had found. In addition, various data augmentation techniques were used to artificially expand and diversify the training dataset. The Mask R-CNN reached a mean Average Precision (mAP) of 94% on one of the test sets and the recognition networks each got 90% accuracy on the books that had been found. (Less)
Please use this url to cite or link to this publication:
http://lup.lub.lu.se/student-papers/record/8998626
- author
- Svensson, Anna LU
- supervisor
- organization
- course
- FMAM05 20192
- year
- 2019
- type
- H2 - Master's Degree (Two Years)
- subject
- keywords
- Object Detection, Deep Learning, Automated Book Inventory, Computer Vision, Data Augmentation, Mask R-CNN, Triplet Loss
- publication/series
- Master's Theses in Mathematical Sciences
- report number
- LUTFMA-3394-2019
- ISSN
- 1404-6342
- other publication id
- 2019:E62
- language
- English
- id
- 8998626
- date added to LUP
- 2020-01-14 11:24:49
- date last changed
- 2020-01-14 11:24:49
@misc{8998626, abstract = {{As more modern technical devices with cameras are accessible to people, more photos are taken and the amount of data that is available has vastly increased. Researchers have learnt to utilise this and use images for various purposes with Deep Learning based Computer Vision. For instance, one can teach a computer to perform an automated inventory, which could be a very time consuming task. Librarians are some of the affected people, they spend a lot of their time to arrange and go through the books on the bookshelves. In an attempt to automate the inventory of books, different object detection and recognition methods have been explored in this Master Thesis. The state of the art object detection method Mask Region Convolutional Neural Network (Mask R-CNN) was used to find book spines in images and two Convolutional Neural Networks (CNN) with different loss functions; categorical cross-entropy and triplet loss, were used to recognise each book the Mask R-CNN had found. In addition, various data augmentation techniques were used to artificially expand and diversify the training dataset. The Mask R-CNN reached a mean Average Precision (mAP) of 94% on one of the test sets and the recognition networks each got 90% accuracy on the books that had been found.}}, author = {{Svensson, Anna}}, issn = {{1404-6342}}, language = {{eng}}, note = {{Student Paper}}, series = {{Master's Theses in Mathematical Sciences}}, title = {{Exploring Object Detection and Recognition Methods for Automated Book Inventory}}, year = {{2019}}, }