First Advisor

Wayne Wakeland

Date of Publication

Fall 1-23-2014

Document Type

Thesis

Degree Name

Master of Science (M.S.) in Systems Science

Department

Systems Science

Language

English

Subjects

Neural networks (Computer science), Artificial intelligence -- Data processing -- Mathematical models, Machine learning -- Mathematical models, Computer vision -- Mathematical models

DOI

10.15760/etd.1549

Physical Description

1 online resource (iv, 37 pages)

Abstract

One of the most impressive qualities of the brain is its neuro-plasticity. The neocortex has roughly the same structure throughout its whole surface, yet it is involved in a variety of different tasks from vision to motor control, and regions which once performed one task can learn to perform another. Machine learning algorithms which aim to be plausible models of the neocortex should also display this plasticity. One such candidate is the stacked denoising autoencoder (SDA). SDA's have shown promising results in the field of machine perception where they have been used to learn abstract features from unlabeled data. In this thesis I develop a flexible distributed implementation of an SDA and train it on images and audio spectrograms to experimentally determine properties comparable to neuro-plasticity. Specifically, I compare the visual-auditory generalization between a multi-level denoising autoencoder trained with greedy, layer-wise pre-training (GLWPT), to one trained without. I test a hypothesis that multi-modal networks will perform better than uni-modal networks due to the greater generality of features that may be learned. Furthermore, I also test the hypothesis that the magnitude of improvement gained from this multi-modal training is greater when GLWPT is applied than when it is not. My findings indicate that these hypotheses were not confirmed, but that GLWPT still helps multi-modal networks adapt to their second sensory modality.

Rights

In Copyright. URI: http://rightsstatements.org/vocab/InC/1.0/ This Item is protected by copyright and/or related rights. You are free to use this Item in any way that is permitted by the copyright and related rights legislation that applies to your use. For other uses you need to obtain permission from the rights-holder(s).

Persistent Identifier

http://archives.pdx.edu/ds/psu/10572

Recommended Citation

Nifong, Nathaniel H., "Learning General Features From Images and Audio With Stacked Denoising Autoencoders" (2014). Dissertations and Theses. Paper 1550.
https://doi.org/10.15760/etd.1549

Download

Included in

Artificial Intelligence and Robotics Commons

COinS

Dissertations and Theses

Learning General Features From Images and Audio With Stacked Denoising Autoencoders

First Advisor

Date of Publication

Document Type

Degree Name

Department

Language

Subjects

DOI

Physical Description

Abstract

Rights

Persistent Identifier

Recommended Citation

Included in

Find

Connect

Dissertations and Theses

Learning General Features From Images and Audio With Stacked Denoising Autoencoders

Author

Sponsor

First Advisor

Date of Publication

Document Type

Degree Name

Department

Language

Subjects

DOI

Physical Description

Abstract

Rights

Persistent Identifier

Recommended Citation

Included in

Share

Find

Connect