학술논문

A Survey of Vectorization Methods in Topological Data Analysis
Document Type
Periodical
Source
IEEE Transactions on Pattern Analysis and Machine Intelligence IEEE Trans. Pattern Anal. Mach. Intell. Pattern Analysis and Machine Intelligence, IEEE Transactions on. 45(12):14069-14080 Dec, 2023
Subject
Computing and Processing
Bioengineering
Measurement
Data analysis
Filtration
Machine learning
Benchmark testing
Task analysis
Electronic mail
Barcodes
persistent homology
topological data analysis
vectorization methods
Language
ISSN
0162-8828
2160-9292
1939-3539
Abstract
Attempts to incorporate topological information in supervised learning tasks have resulted in the creation of several techniques for vectorizing persistent homology barcodes. In this paper, we study thirteen such methods. Besides describing an organizational framework for these methods, we comprehensively benchmark them against three well-known classification tasks. Surprisingly, we discover that the best-performing method is a simple vectorization, which consists only of a few elementary summary statistics. Finally, we provide a convenient web application which has been designed to facilitate exploration and experimentation with various vectorization methods.