Interpretable classifiers for tabular data via feature selection and discretization

Tutkimustuotos: KonferenssiartikkeliTieteellinenvertaisarvioitu

11 Lataukset (Pure)

Abstrakti

We introduce a method for computing immediately human interpretable yet accurate classifiers from tabular data. The classifiers obtained are short Boolean formulas, computed via first discretizing the original data and then using feature selection coupled with a very fast algorithm for producing the best possible Boolean classifier for the setting. We demonstrate the approach via 12 experiments, obtaining results with accuracies comparable to ones obtained via random forests, XGBoost, and existing results for the same datasets in the literature. In most cases, the accuracy of our method is in fact similar to that of the reference methods, even though the main objective of our study is the immediate interpretability of our classifiers. We also prove a new result on the probability that the classifier we obtain from real-life data corresponds to the ideally best classifier with respect to the background distribution the data comes from.

AlkuperäiskieliEnglanti
OtsikkoDAO-XAI 2024: Data meets Ontologies in Explainable AI 2024
AlaotsikkoProceedings of the 4th International Workshop on Data meets Ontologies in Explainable AI co-located with the 27th European Conference on Artificial Intelligence (ECAI 2024)
KustantajaCEUR-WS
Sivumäärä22
TilaJulkaistu - 2024
OKM-julkaisutyyppiA4 Artikkeli konferenssijulkaisussa
TapahtumaInternational Workshop on Data meets Ontologies in Explainable AI - Santiago de Compostela, Espanja
Kesto: 19 lokak. 202419 lokak. 2024

Julkaisusarja

NimiCEUR Workshop Proceedings
KustantajaCEUR-WS
Vuosikerta3833
ISSN (painettu)1613-0073

Workshop

WorkshopInternational Workshop on Data meets Ontologies in Explainable AI
Maa/AlueEspanja
KaupunkiSantiago de Compostela
Ajanjakso19/10/2419/10/24

Julkaisufoorumi-taso

  • Jufo-taso 1

!!ASJC Scopus subject areas

  • Yleinen tietojenkäsittelytiede

Sormenjälki

Sukella tutkimusaiheisiin 'Interpretable classifiers for tabular data via feature selection and discretization'. Ne muodostavat yhdessä ainutlaatuisen sormenjäljen.

Siteeraa tätä