Please use this identifier to cite or link to this item:
Title: Comparison of machine learning approaches with a general linear model to predict personal exposure to benzene
Authors: Aquilina, Noel
Delgado-Saborit, Juana Maria
Bugelli, Stefano
Padovani Ginies, Jason
Harrison, Roy M.
Keywords: Machine learning
Air -- Pollution -- Measurement
Benzene in the body
Issue Date: 2018
Publisher: American Chemical Society
Citation: Aquilina, N., Delgado-Saborit, J. M., Bugelli, S., Padovani Ginies, J., & Harrison, R. M. (2018). Comparison of machine learning approaches with a general linear model to predict personal exposure to benzene. Environmental Science & Technology, 52(19), 11215-11222.
Abstract: Machine learning techniques (MLTs) offer great power in analyzing complex data sets and have not previously been applied to nonoccupational pollutant exposure. MLT models that can predict personal exposure to benzene have been developed and compared with a standard model using a linear regression approach (GLM). The models were tested against independent data sets obtained from three personal exposure measurement campaigns. A correlation-based feature subset (CFS) selection algorithm identified a reduced attribute set, with common attributes grouped under the use of paints in homes, upholstery materials, space heating, and environmental tobacco smoke as the attributes suitable to predict the personal exposure to benzene. Personal exposure was categorized as low, medium, and high, and for big data sets, both the GLM and MLTs show high variability in performance to correctly classify greater than 90 percentile concentrations, but the MLT models have a higher score when accounting for divergence of incorrectly classified cases. Overall, the MLTs perform at least as well as the GLM and avoid the need to input microenvironment concentrations.
Appears in Collections:Scholarly Works - FacSciGeo

Items in OAR@UM are protected by copyright, with all rights reserved, unless otherwise indicated.