Multiple imputations by chained equations for recovering missing daily streamflow observations: A case study of Langat River basin in Malaysia

Document Type

Article

Publication Date

1-2-2022

Abstract

Missing values in hydrological studies are a common issue for hydrologists, especially in statistical analyses as a complete dataset is required. This work evaluates the performance of the multiple imputations by chained equations (MICE) approach to predicting recurrence in streamflow datasets. To evaluate and verify the effectiveness of the MICE approach in treating missing streamflow data, complete historical daily streamflow data from 2012 to 2014 were used. Later, MICE methods coupled with multiple linear regression (MLR) were applied to restore streamflow rates in Malaysia's Langat River basin from 1978 to 2016. The best estimation methods are validated with tests such as adjusted R-squared (Adj R-2), residual standard error (RSE), and mean absolute percentage error (MAPE). The findings revealed that the classification and regression tree (CART) method combined with MLR outperformed the other approaches tested, with the highest Adj R-2 value and the lowest RSE and MAPE values observed regardless of missing conditions.

Keywords

A, Castellarin, A, Castellarin & A, Requena, Missing data, Multiple imputations, Chained equations, Streamflow, CART, MICE

Divisions

fac_eng

Funders

Geran Universiti Penyelidikan[GUP-2020-013]

Publication Title

Hydrological Sciences Journal-Journal des Sciences Hydrologiques

Volume

67

Issue

1

Publisher

Taylor & Francis

Publisher Location

2-4 PARK SQUARE, MILTON PARK, ABINGDON OR14 4RN, OXON, ENGLAND

This document is currently not available here.

Share

COinS