您的位置: 首页 > 外文期刊论文 > 详情页

Water potability classification based on hybrid stacked model and feature selection

作   者:
Ahmed M.,ElsheweyRasha Y.,YoussefHazem M.,El-BakryAhmed M.,Osman
作者机构:
Mansoura UniversitySuez University
关键词:
Feature selectionMachine learningStacking ensembleWater potability classificationWater potability
期刊名称:
Environmental Science and Pollution Research
i s s n:
0944-1344
年卷期:
2025 年 32 卷 13 期
页   码:
7933-7949
页   码:
摘   要:
Abstract Clean water requires accurate water quality categorization. A water potability (WP) dataset with pH, hardness, solids, chloramines, sulfate, conductivity, and other metrics for 3276 water bodies was used in this paper. After median imputation for missing values, normalization for feature scaling, and class imbalance correction using SMOTE, the Kaggle public dataset was prepared. With binary particle swarm optimization (BPSO) and binary whale optimization algorithm (BWAO), feature selection (FS) was used to determine the most important features for classification. A subset of seven essential characteristics is selected with the lowest average error of 0.3745 by the BPSO. Random forest (RF), gradient boosting (GB), support vector machine (SVM), Extra Tree (ET), decision tree (DT), and XGBoost are tested for WP prediction. The ET classifier ranked first, with 70.63% accuracy and 71.17% F1-score. Predictive performance was improved by stacking random forest, extra trees, and XGBoost base learners with Logistic Regression meta-learner. The stacking model improved with 69.53% accuracy, 70.23% F1-score, and 77.62% AUC. We found that stacking uses high-performing models to create a strong and balanced categorization framework. This paper shows that ensemble learning can improve WP categorization and that stacking may be a feasible way for measuring and managing water quality.
相关作者
载入中,请稍后...
相关机构
    载入中,请稍后...
应用推荐

意 见 箱

匿名:登录

个人用户登录

找回密码

第三方账号登录

忘记密码

个人用户注册

必须为有效邮箱
6~16位数字与字母组合
6~16位数字与字母组合
请输入正确的手机号码

信息补充