Document Type


Publication Date



Machine learning, Water quality -- Forecasting, Support vector machines, Supervised learning (Machine learning)


The objective of this research is to explore prediction accuracy of water quality factors, with techniques and algorithms in machine learning consisting of a variation of support vector machines - Support Vector Regression (SVR) and the gradient boosting algorithm Extreme Gradient Boosting (XGBoost). Both the XGBoost and SVR algorithms were used to predict nine different factors with success rates ranging from 79% to 99%. Parameters of these algorithms were also explored to test the prediction accuracy levels of individual water quality factors. These parameters included normalizing the data, filling missing data points, and training and testing on a large set of data.


© Copyright the author(s)

This Item is protected by copyright and/or related rights. You are free to use this Item in any way that is permitted by the copyright and related rights legislation that applies to your use. For other uses you need to obtain permission from the rights-holder(s).

The purpose of this statement is to help the public understand how this Item may be used. When there is a (non-standard) License or contract that governs re-use of the associated Item, this statement only summarizes the effects of some of its terms. It is not a License, and should not be used to license your Work. To license your own Work, use a License offered at


Presentations associated with the report are available below in the Additional Files.

Persistent Identifier

joslyn-symposium.pdf (2170 kB)
2018 Symposium presentation

joslyn-ignite.pdf (3067 kB)
Ignite presentation