Comparison of machine learning approaches with a general linear model to predict personal exposure to benzene

Aquilina, Noel; Delgado-Saborit, Juana Maria; Bugelli, Stefano; Padovani Ginies, Jason; Harrison, Roy M.

Please use this identifier to cite or link to this item: https://www.um.edu.mt/library/oar/handle/123456789/58325

Title:	Comparison of machine learning approaches with a general linear model to predict personal exposure to benzene
Authors:	Aquilina, Noel Delgado-Saborit, Juana Maria Bugelli, Stefano Padovani Ginies, Jason Harrison, Roy M.
Keywords:	Machine learning Air -- Pollution -- Measurement Benzene Benzene in the body
Issue Date:	2018
Publisher:	American Chemical Society
Citation:	Aquilina, N., Delgado-Saborit, J. M., Bugelli, S., Padovani Ginies, J., & Harrison, R. M. (2018). Comparison of machine learning approaches with a general linear model to predict personal exposure to benzene. Environmental Science & Technology, 52(19), 11215-11222.
Abstract:	Machine learning techniques (MLTs) offer great power in analyzing complex data sets and have not previously been applied to nonoccupational pollutant exposure. MLT models that can predict personal exposure to benzene have been developed and compared with a standard model using a linear regression approach (GLM). The models were tested against independent data sets obtained from three personal exposure measurement campaigns. A correlation-based feature subset (CFS) selection algorithm identified a reduced attribute set, with common attributes grouped under the use of paints in homes, upholstery materials, space heating, and environmental tobacco smoke as the attributes suitable to predict the personal exposure to benzene. Personal exposure was categorized as low, medium, and high, and for big data sets, both the GLM and MLTs show high variability in performance to correctly classify greater than 90 percentile concentrations, but the MLT models have a higher score when accounting for divergence of incorrectly classified cases. Overall, the MLTs perform at least as well as the GLM and avoid the need to input microenvironment concentrations.
URI:	https://www.um.edu.mt/library/oar/handle/123456789/58325
Appears in Collections:	Scholarly Works - FacSciGeo

Files in This Item:

File	Description	Size	Format
Comparison_of_machine_learning_approaches_with_a_general_linear_model_to_predict_personal_exposure_to_benzene_2018.pdf Restricted Access		680.38 kB	Adobe PDF	View/Open Request a copy

Show full item record Statistics