Interpretable classifiers for tabular data via feature selection and discretization

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

7 Downloads (Pure)

Abstract

We introduce a method for computing immediately human interpretable yet accurate classifiers from tabular data. The classifiers obtained are short Boolean formulas, computed via first discretizing the original data and then using feature selection coupled with a very fast algorithm for producing the best possible Boolean classifier for the setting. We demonstrate the approach via 12 experiments, obtaining results with accuracies comparable to ones obtained via random forests, XGBoost, and existing results for the same datasets in the literature. In most cases, the accuracy of our method is in fact similar to that of the reference methods, even though the main objective of our study is the immediate interpretability of our classifiers. We also prove a new result on the probability that the classifier we obtain from real-life data corresponds to the ideally best classifier with respect to the background distribution the data comes from.

Original languageEnglish
Title of host publicationDAO-XAI 2024: Data meets Ontologies in Explainable AI 2024
Subtitle of host publicationProceedings of the 4th International Workshop on Data meets Ontologies in Explainable AI co-located with the 27th European Conference on Artificial Intelligence (ECAI 2024)
PublisherCEUR-WS
Number of pages22
Publication statusPublished - 2024
Publication typeA4 Article in conference proceedings
EventInternational Workshop on Data meets Ontologies in Explainable AI - Santiago de Compostela, Spain
Duration: 19 Oct 202419 Oct 2024

Publication series

NameCEUR Workshop Proceedings
PublisherCEUR-WS
Volume3833
ISSN (Print)1613-0073

Workshop

WorkshopInternational Workshop on Data meets Ontologies in Explainable AI
Country/TerritorySpain
CitySantiago de Compostela
Period19/10/2419/10/24

Keywords

  • Boolean logic
  • Interpretable AI
  • Overfitting

Publication forum classification

  • Publication forum level 1

ASJC Scopus subject areas

  • General Computer Science

Fingerprint

Dive into the research topics of 'Interpretable classifiers for tabular data via feature selection and discretization'. Together they form a unique fingerprint.

Cite this