TY - JOUR
T1 - Self-organising maps for the exploration and classification of thin-layer chromatograms
AU - Guggenberger, Matthias
AU - Oberlerchner, Josua T.
AU - Grausgruber, Heinrich
AU - Rosenau, Thomas
AU - Böhmdorfer, Stefan
N1 - Funding Information:
We are grateful for the set of essential oils provided as a gift by Sonnentor Kr?uterhandelsgesellschaft mbH. The support of the Austrian Biorefinery Center Tulln (ABCT) is gratefully acknowledged. We are also grateful for the support by our industry partners in the frame of the Flippr2 Project, Mondi, Sappi, Zellstoff P?ls AG, a member of Heinzel Pulp, and Papierholz Austria. The K-Project Flippr2 is funded as part of COMET-Competence Centers for Excellent Technologies promoted by BMVIT, BMWFJ, the states of Styria (Austria), and Carinthia (Austria). The COMET program is managed by FFG - Austrian Research Promotion Agency (Austria). Open access funding provided by University of Natural Resources and Life Sciences Vienna (BOKU).
Publisher Copyright:
© 2021 The Author(s)
PY - 2021/10/1
Y1 - 2021/10/1
N2 - Thin-layer chromatography (TLC) allows the swift analysis of larger sample sets in almost any laboratory. The obtained chromatograms are patterns of coloured zones that are conveniently evaluated and classified by visual inspection. This manual approach reaches its limit when several dozens or a few hundred samples need to be evaluated. Methods to classify TLCs automatically and objectively have been explored but without a definitive conclusion; established methods, such as principal component analysis, suffer from the variability of the data, while contemporary omics methods were constructed for the analysis of large numbers of highly resolved analyses. Self-organizing maps (SOMs) are an algorithm for unsupervised learning that reduces higher dimensional datasets to a two-dimensional map, locating similar samples close to each other. It tolerates small variations between samples of the same type. We investigated the capability of SOMs for the evaluation of TLCs with two sample sets. With the first one (495 analyses of essential oils), it was confirmed that SOMs arrange the same type of sample in a common region. The obtained multi-class maps were used to classify a test set and to explore the causes for the few misclassifications (<3%). With the second test set (50 extracts of experimental wheats), the effects of a greater variability within substance classes was explored. With SOMs, it was possible to single out the exceptional samples that warranted a more detailed investigation. In addition, the SOM quality control index method was tested. It proved to be considerably stricter than the classification with a SOM of all samples. When this method was unable to classify a sample correctly, it would flag the sample for inspection, as it gave either multiple assignments or none at all. The combination of SOMs and TLC — two accessible analytical tools — can be most useful for the unsupervised classification of samples by TLC, and to identify samples that stand out from a set and are therefore worth the investment into additional analyses with more complex or expensive methods.
AB - Thin-layer chromatography (TLC) allows the swift analysis of larger sample sets in almost any laboratory. The obtained chromatograms are patterns of coloured zones that are conveniently evaluated and classified by visual inspection. This manual approach reaches its limit when several dozens or a few hundred samples need to be evaluated. Methods to classify TLCs automatically and objectively have been explored but without a definitive conclusion; established methods, such as principal component analysis, suffer from the variability of the data, while contemporary omics methods were constructed for the analysis of large numbers of highly resolved analyses. Self-organizing maps (SOMs) are an algorithm for unsupervised learning that reduces higher dimensional datasets to a two-dimensional map, locating similar samples close to each other. It tolerates small variations between samples of the same type. We investigated the capability of SOMs for the evaluation of TLCs with two sample sets. With the first one (495 analyses of essential oils), it was confirmed that SOMs arrange the same type of sample in a common region. The obtained multi-class maps were used to classify a test set and to explore the causes for the few misclassifications (<3%). With the second test set (50 extracts of experimental wheats), the effects of a greater variability within substance classes was explored. With SOMs, it was possible to single out the exceptional samples that warranted a more detailed investigation. In addition, the SOM quality control index method was tested. It proved to be considerably stricter than the classification with a SOM of all samples. When this method was unable to classify a sample correctly, it would flag the sample for inspection, as it gave either multiple assignments or none at all. The combination of SOMs and TLC — two accessible analytical tools — can be most useful for the unsupervised classification of samples by TLC, and to identify samples that stand out from a set and are therefore worth the investment into additional analyses with more complex or expensive methods.
KW - Anthocyanins
KW - Essential oils
KW - Principal component analysis
KW - SOMQC Index
KW - Wheat
UR - http://www.scopus.com/inward/record.url?scp=85106444548&partnerID=8YFLogxK
U2 - 10.1016/j.talanta.2021.122460
DO - 10.1016/j.talanta.2021.122460
M3 - Article
C2 - 34215100
AN - SCOPUS:85106444548
SN - 0039-9140
VL - 233
JO - Talanta
JF - Talanta
M1 - 122460
ER -