Advanced

Exploring Object Detection and Recognition Methods for Automated Book Inventory

Svensson, Anna LU (2019) In Master's Theses in Mathematical Sciences FMAM05 20192
Mathematics (Faculty of Engineering)
Abstract
As more modern technical devices with cameras are accessible to people, more photos are taken and the amount of data that is available has vastly increased. Researchers have learnt to utilise this and use images for various purposes with Deep Learning based Computer Vision. For instance, one can teach a computer to perform an automated inventory, which could be a very time consuming task. Librarians are some of the affected people, they spend a lot of their time to arrange and go through the books on the bookshelves. In an attempt to automate the inventory of books, different object detection and recognition methods have been explored in this Master Thesis. The state of the art object detection method Mask Region Convolutional Neural... (More)
As more modern technical devices with cameras are accessible to people, more photos are taken and the amount of data that is available has vastly increased. Researchers have learnt to utilise this and use images for various purposes with Deep Learning based Computer Vision. For instance, one can teach a computer to perform an automated inventory, which could be a very time consuming task. Librarians are some of the affected people, they spend a lot of their time to arrange and go through the books on the bookshelves. In an attempt to automate the inventory of books, different object detection and recognition methods have been explored in this Master Thesis. The state of the art object detection method Mask Region Convolutional Neural Network (Mask R-CNN) was used to find book spines in images and two Convolutional Neural Networks (CNN) with different loss functions; categorical cross-entropy and triplet loss, were used to recognise each book the Mask R-CNN had found. In addition, various data augmentation techniques were used to artificially expand and diversify the training dataset. The Mask R-CNN reached a mean Average Precision (mAP) of 94% on one of the test sets and the recognition networks each got 90% accuracy on the books that had been found. (Less)
Please use this url to cite or link to this publication:
author
Svensson, Anna LU
supervisor
organization
course
FMAM05 20192
year
type
H2 - Master's Degree (Two Years)
subject
keywords
Object Detection, Deep Learning, Automated Book Inventory, Computer Vision, Data Augmentation, Mask R-CNN, Triplet Loss
publication/series
Master's Theses in Mathematical Sciences
report number
LUTFMA-3394-2019
ISSN
1404-6342
other publication id
2019:E62
language
English
id
8998626
date added to LUP
2020-01-14 11:24:49
date last changed
2020-01-14 11:24:49
@misc{8998626,
  abstract     = {As more modern technical devices with cameras are accessible to people, more photos are taken and the amount of data that is available has vastly increased. Researchers have learnt to utilise this and use images for various purposes with Deep Learning based Computer Vision. For instance, one can teach a computer to perform an automated inventory, which could be a very time consuming task. Librarians are some of the affected people, they spend a lot of their time to arrange and go through the books on the bookshelves. In an attempt to automate the inventory of books, different object detection and recognition methods have been explored in this Master Thesis. The state of the art object detection method Mask Region Convolutional Neural Network (Mask R-CNN) was used to find book spines in images and two Convolutional Neural Networks (CNN) with different loss functions; categorical cross-entropy and triplet loss, were used to recognise each book the Mask R-CNN had found. In addition, various data augmentation techniques were used to artificially expand and diversify the training dataset. The Mask R-CNN reached a mean Average Precision (mAP) of 94% on one of the test sets and the recognition networks each got 90% accuracy on the books that had been found.},
  author       = {Svensson, Anna},
  issn         = {1404-6342},
  keyword      = {Object Detection,Deep Learning,Automated Book Inventory,Computer Vision,Data Augmentation,Mask R-CNN,Triplet Loss},
  language     = {eng},
  note         = {Student Paper},
  series       = {Master's Theses in Mathematical Sciences},
  title        = {Exploring Object Detection and Recognition Methods for Automated Book Inventory},
  year         = {2019},
}