A Novel Machine Learning Algorithm to Reduce Prediction Error and Accelerate Learning Curve for Very Large Datasets

Published In

2019 IEEE 49th International Symposium on Multiple-Valued Logic (ISMVL)

Document Type

Citation

Publication Date

7-11-2019

Abstract

This paper presents a novel machine learning algorithm with an improved accuracy and a faster learning curve, for very large datasets. Previously, an algorithm using lr-partitions was designed to improve upon C4.5. However, this algorithm has a relatively high percentage of undefined combinations of attribute values in its final results, increasing the learning error. In this paper, a new type of clustering algorithm was proposed to generate output values for those undefined combinations, thus accelerating the learning curve and reducing the prediction error by several percentage points on various popular datasets from the UCI Machine Learning Database.

Description

© Copyright 2019 IEEE - All rights reserved.

DOI

10.1109/ISMVL.2019.00025

Persistent Identifier

https://archives.pdx.edu/ds/psu/30109

Share

COinS