Sie befinden Sich nicht im Netzwerk der Universität Paderborn. Der Zugriff auf elektronische Ressourcen ist gegebenenfalls nur via VPN oder Shibboleth (DFN-AAI) möglich. mehr Informationen...
Maximum Information Coefficient Feature Selection Method for Interval-Valued Data
Ist Teil von
IEEE access, 2024-01, Vol.12, p.1-1
Ort / Verlag
Piscataway: IEEE
Erscheinungsjahr
2024
Quelle
EZB Electronic Journals Library
Beschreibungen/Notizen
The feature selection for interval-valued data(IVD) aims to identify representative features from a large set of features, which can reduce the model complexity, minimize the training time, and enhance the generalization ability of the model. Addressing the inter-feature correlations in IVD, we propose a feature selection method called the maximum information coefficient for interval-valued data(IVD_MIC). First, the method balances the relationship between the midpoint and radius of IVD with an adjustment factor, constructing the interval-valued data unified representation frame (URF). Based on the URF, the method measures the degree of correlation between two features by calculating the maximum information coefficient, and obtains the maximum information coefficient matrix for IVD. Then the features with strong correlation are progressively removed from three perspectives(row, column, and both row and column), generating a series of corresponding candidate feature subsets. Finally, IVD_MIC is validated on candidate feature subsets to obtain the final classification accuracy and optimal feature subset. The experiment results on synthetic and real-world datasets with different classifiers demonstrate that the overall performance of IVD_MIC surpasses other methods. The average accuracy of IVD_MIC is higher, improving by 0.23%, 0.53% and 0.45% compared to the second-best method on LIBSVM, CART Tree and KNN, respectively.