First Advisor
Melanie Mitchell
Date of Award
5-24-2019
Document Type
Thesis
Degree Name
Bachelor of Science (B.S.) in Computer Science and University Honors
Department
Computer Science
Subjects
Image processing -- Digital techniques -- Evaluation, Neural networks (Computer science), Computer vision, Machine learning
DOI
10.15760/honors.703
Abstract
This thesis evaluates the accuracy and performance of VGG16, a convolutional neural network (CNN), and YOLO v3, an object detector, on a dataset of 1000 hand-drawn images. Unlike with photographs, which possess high amounts of detail, sketches tend to lack much detail aside from the freehand lines that comprise them. This is further detailed in prior works about Sketch-based Image Retrieval (SBIR), a classification task to map photographs to sketches; and SketchParse, a CNN that analyzes sketch features and assigns captions. In this paper, I show the differences in classification accuracy between VGG16 and YOLO v3. The former model, pretrained on ImageNet, showed a test accuracy as high as 79.6%. On the other hand, YOLO v3, pretrained on MS COCO, performed worse; it misclassified objects in the dog category and across all categories, made no detections on several images.
Rights
In Copyright. URI: http://rightsstatements.org/vocab/InC/1.0/ This Item is protected by copyright and/or related rights. You are free to use this Item in any way that is permitted by the copyright and related rights legislation that applies to your use. For other uses you need to obtain permission from the rights-holder(s).
Persistent Identifier
https://archives.pdx.edu/ds/psu/28780
Recommended Citation
Hoang, Lee, "An Evaluation of VGG16 and YOLO v3 on Hand-drawn Images" (2019). University Honors Theses. Paper 693.
https://doi.org/10.15760/honors.703