Predicting Residential Water Demand with Machine-Based Statistical Learning

Dongwoo Lee, Sybil Derrible

Research output: Contribution to journalArticlepeer-review

36 Scopus citations

Abstract

Predicting residential water demand is challenging because of two technical questions: (1) which data and variables should be used and (2) which modeling technique is most appropriate for high prediction accuracy. To address these issues, this article investigates 12 statistical techniques, including parametric models and machine learning (ML) models, to predict daily household water use. In addition, two data scenarios are adopted, one with only 6 variables, generally available to cities and water utilities (general scenario), and one with all 19 variables available from the Residential End-Use 2016 database (REU 2016 scenario). The results for the REU 2016 scenario indicate that ML models outperform linear models. In particular, gradient boosting regression (GBR) performs best with an Radj2 of 0.69 compared to 0.54 for linear regression. The performance gap between ML and linear models becomes even wider for the general scenario with an Radj2 of 0.60 for GBR compared to 0.33 for linear regression. The finding in this article can be useful to researchers, municipalities, and utilities seeking novel modeling techniques that can provide consistent modeling performance-i.e., high prediction accuracy-depending on data availability. Future work could include the development of new measures to increase the interpretability of ML models to better understand causal relationships between independent variables and daily household water use.

Original languageEnglish
Article number04019067
JournalJournal of Water Resources Planning and Management - ASCE
Volume146
Issue number1
DOIs
StatePublished - 1 Jan 2020

Fingerprint

Dive into the research topics of 'Predicting Residential Water Demand with Machine-Based Statistical Learning'. Together they form a unique fingerprint.

Cite this