TY - JOUR
T1 - Drift in a Popular Metal Oxide Sensor Dataset Reveals Limitations for Gas Classification Benchmarks
AU - Dennler, Nik
AU - Rastogi, Shavika
AU - Fonollosa, Jordi
AU - von Schaik, André
AU - Schmuker, Michael
N1 - Funding Information:
We thank A. J. Lilienthal, M. Psarrou and S. Sutton for fruitful discussions on multiple occasions, which led to valuable insights. MS was funded by the NSF/CIHR/DFG/FRQ/UKRI-MRC Next Generation Networks for Neuroscience Program (NSF award no. 2014217 , MRC award no. MR/T046759/1 ), and the EU Flagship Human Brain Project SGA3 (H2020 award no. 945539 ). JF acknowledges the Spanish Ministry of Economy and Competitiveness DPI2017-89827-R , Networking Biomedical Research Centre in the subject area of Bioengineering, Biomaterials and Nanomedicine, initiatives of Instituto de Investigación Carlos III, Share4Rare Project (Grant agreement 780262 ), and ACCIÓ (Innotec A CE014/20/000018 ). JF also acknowledges the CERCA Programme/Generalitat de Catalunya and the Serra Húnter Program . B2SLab is certified as 2017 SGR 952.
Funding Information:
We thank A. J. Lilienthal, M. Psarrou and S. Sutton for fruitful discussions on multiple occasions, which led to valuable insights. MS was funded by the NSF/CIHR/DFG/FRQ/UKRI-MRC Next Generation Networks for Neuroscience Program (NSF award no. 2014217, MRC award no. MR/T046759/1), and the EU Flagship Human Brain Project SGA3 (H2020 award no. 945539). JF acknowledges the Spanish Ministry of Economy and Competitiveness DPI2017-89827-R, Networking Biomedical Research Centre in the subject area of Bioengineering, Biomaterials and Nanomedicine, initiatives of Instituto de Investigaci?n Carlos III, Share4Rare Project (Grant agreement 780262), and ACCI? (Innotec ACE014/20/000018). JF also acknowledges the CERCA Programme/Generalitat de Catalunya and the Serra H?nter Program. B2SLab is certified as 2017 SGR 952.
Publisher Copyright:
© 2022
PY - 2022/3/15
Y1 - 2022/3/15
N2 - Metal oxide (MOx) gas sensors are a popular choice for many applications, due to their tunable sensitivity, space efficiency and low cost. Publicly available sensor datasets are particularly valuable for the research community as they accelerate the development and evaluation of novel algorithms for gas sensor data analysis. A dataset published in 2013 by Vergara and colleagues contains recordings from MOx gas sensor arrays in a wind tunnel. It has since become a standard benchmark in the field. Here we report a latent property of this dataset that limits its suitability for gas classification studies. Measurement timestamps show that gases were recorded in separate, temporally clustered batches. Sensor baseline response before gas exposure were strongly correlated with the recording batch, to the extent that baseline response was largely sufficient to infer the gas used in a given trial. Zero-offset baseline compensation did not resolve the issue, since residual short-term drift still contained enough information for gas/trial identification using a machine learning classifier. A subset of the data recorded within a short period of time was minimally affected by drift and suitable for gas classification benchmarking after offset-compensation, but with much reduced classification performance compared to the full dataset. We found 18 publications where this dataset was used without precautions against the circumstances we describe, thus potentially overestimating the accuracy of gas classification algorithms. These observations highlight potential pitfalls in using previously recorded gas sensor data, which may have distorted widely reported results.
AB - Metal oxide (MOx) gas sensors are a popular choice for many applications, due to their tunable sensitivity, space efficiency and low cost. Publicly available sensor datasets are particularly valuable for the research community as they accelerate the development and evaluation of novel algorithms for gas sensor data analysis. A dataset published in 2013 by Vergara and colleagues contains recordings from MOx gas sensor arrays in a wind tunnel. It has since become a standard benchmark in the field. Here we report a latent property of this dataset that limits its suitability for gas classification studies. Measurement timestamps show that gases were recorded in separate, temporally clustered batches. Sensor baseline response before gas exposure were strongly correlated with the recording batch, to the extent that baseline response was largely sufficient to infer the gas used in a given trial. Zero-offset baseline compensation did not resolve the issue, since residual short-term drift still contained enough information for gas/trial identification using a machine learning classifier. A subset of the data recorded within a short period of time was minimally affected by drift and suitable for gas classification benchmarking after offset-compensation, but with much reduced classification performance compared to the full dataset. We found 18 publications where this dataset was used without precautions against the circumstances we describe, thus potentially overestimating the accuracy of gas classification algorithms. These observations highlight potential pitfalls in using previously recorded gas sensor data, which may have distorted widely reported results.
KW - Gas recognition
KW - Metal oxide gas sensors
KW - Sensor drift
KW - Wind tunnel dataset
UR - http://www.scopus.com/inward/record.url?scp=85126630758&partnerID=8YFLogxK
U2 - 10.1016/j.snb.2022.131668
DO - 10.1016/j.snb.2022.131668
M3 - Article
SN - 0925-4005
VL - 361
JO - Sensors and Actuators B: Chemical
JF - Sensors and Actuators B: Chemical
M1 - 131668
ER -