Logical Analysis of Data: Classification with justification

Boros, Endre; Crama, Yves; Hammer, Peter L.; Ibaraki, Toshihide; Kogan, Alexander; Makino, Kazuhisa

doi:10.1007/s10479-011-0916-1

Download

Article (Scientific journals)

Logical Analysis of Data: Classification with justification

Boros, Endre; Crama, Yves; Hammer, Peter L. et al.

2011 • In Annals of Operations Research, 188, p. 33-61

Peer Reviewed verified by ORBi

Permalink
https://hdl.handle.net/2268/90494

DOI
10.1007/s10479-011-0916-1

Files (1)Send to Details Statistics Bibliography Similar publications

Files

Full Text

LeukerbadiaRevisedDecember2010.pdf

Author preprint (666.5 kB)

Download

All documents in ORBi are protected by a user license.

Send to

RIS BibTex APA Chicago Permalink X Linkedin

Details

Keywords :

classification; data mining; Boolean functions; LAD

Abstract :

[en] Learning from examples is a frequently arising challenge, with a large number of algorithms proposed in the classification, data mining and machine learning literature. The evaluation of the quality of such algorithms is frequently carried out ex post, on an experimental basis: their performance is measured either by cross validation on benchmark data sets, or by clinical trials. Few of these approaches evaluate the learning process ex ante, on its own merits. In this paper, we dis- cuss a property of rule-based classifiers which we call "justifiability", and which focuses on the type of information extracted from the given training set in order to classify new observations. We investigate some interesting mathematical properties of justifiable classifiers. In partic- ular, we establish the existence of justifiable classifiers, and we show that several well-known learning approaches, such as decision trees or nearest neighbor based methods, automatically provide justifiable clas- sifiers. We also identify maximal subsets of observations which must be classified in the same way by every justifiable classifier. Finally, we illustrate by a numerical example that using classifiers based on "most justifiable" rules does not seem to lead to over fitting, even though it involves an element of optimization.

Research Center/Unit :

QuantOM

Disciplines :

Computer science
Quantitative methods in economics & management
Mathematics

Author, co-author :

Boros, Endre

Crama, Yves ; Université de Liège - ULiège > HEC-Ecole de gestion > QuantOM

Hammer, Peter L.

Ibaraki, Toshihide

Kogan, Alexander

Makino, Kazuhisa

Language :

English

Title :

Logical Analysis of Data: Classification with justification

Publication date :

2011

Journal title :

Annals of Operations Research

ISSN :

0254-5330

eISSN :

1572-9338

Publisher :

Springer Science & Business Media B.V.

Special issue title :

The Mathematics of Peter L. Hammer (1936-2006): Graphs, Optimization, and Boolean Models

Volume :

188

Pages :

33-61

Peer reviewed :

Peer Reviewed verified by ORBi

Available on ORBi :

since 09 May 2011

Statistics

Number of views

338 (5 by ULiège)

Number of downloads

618 (1 by ULiège)

More statistics

Scopus citations^®

Scopus citations^®
without self-citations

OpenAlex citations

Bibliography

Alhammady, H., & Ramamohanarao, K. (2004). Using emerging patterns and decision trees in rare-class classification. In Proceedings of the fourth IEEE international conference on data mining (ICDM'04) (pp. 315-318).
Agrawal, R., Imielinski, T., & Swami, A. (1993). Mining association rules between sets of items in large databases. In International conference on management of data (SIGMOD) (pp. 207-216).
Angluin, D. (1988). Queries and concept learning. Machine Learning, 2, 319-342.
Asuncion, A., & Newman, D. J. (2007). UCI machine learning repository [http://www. ics. uci. edu/~mlearn/MLRepository. html]. Irvine, CA: University of California, School of Information and Computer Science.
Breiman, L., Friedman, J. H., Olshen, R. A., & Stone, C. J. (1984). Classification and regression trees. Wadsworth International Group.
Bonates, T. O., & Hammer, P. L. (2006). Logical analysis of data: from combinatorial optimization to medical applications. Annals of Operations Research, 148, 203-225.
Bonates, T. O., & Hammer, P. L. (2007a). A branch-and-bound algorithm for a family of pseudo-Boolean optimization problems (Technical Report RRR 21-2007). RUTCOR-Rutgers Center for Operations Research, Rutgers University.
Bonates, T. O., & Hammer, P. L. (2007b). Large margin LAD classifiers (Technical Report RRR 22-2007). RUTCOR-Rutgers Center for Operations Research, Rutgers University.
Bonates, T. O., Hammer, P. L., & Kogan, A. (2008). Maximum patterns in datasets. Discrete Applied Mathematics, 156(6), 846-861.
Boros, E., Gurvich, V., Hammer, P. L., Ibaraki, T., & Kogan, A. (1995). Decomposability of partially defined Boolean functions. Discrete Applied Mathematics, 62, 51-75.
Boros, E., Hammer, P. L., Ibaraki, T., & Kogan, A. (1997). Logical analysis of numerical data. Mathematical Programming, 79, 163-190.
Boros, E., Ibaraki, T., & Makino, K. (1998). Error-free and best-fit extensions of partially defined Boolean functions. Information and Computation, 140(2), 254-283.
Boros, E., Hammer, P. L., Ibaraki, T., Kogan, A., Mayoraz, E., & Muchnik, I. (2000). An implementation of logical analysis of data. IEEE Transactions on Knowledge and Data Engineering, 12, 292-306.
Crama, Y., & Hammer, P. L. (2011). Boolean functions: theory, algorithms, and applications. New York: Cambridge University Press.
Crama, Y., Hammer, P. L., & Ibaraki, T. (1988). Cause-effect relationships and partially defined boolean functions. Annals of Operations Research, 16, 299-326.
Dong, G., & Li, J. (1999). Efficient mining of emerging patterns: discovering trends and differences. In KDD '99: Proceedings of the fifth ACM SIGKDD international conference on knowledge discovery and data mining (pp. 43-52).
Eckstein, J., Hammer, P. L., Liu, Y., Nediak, M., & Simeone, B. (2002). The maximum box problem and its application to data analysis. Computational Optimization and Applications, 23(3), 285-298.
Ehrenfeucht, A., & Haussler, D. (1989). Learning decision trees from random examples. Information and Computation, 82, 231-246.
Flament, C. (1966). L'analyse booléenne de questionnaires. Mathématiques et Sciences Humaines, 12, 3-10.
Ganter, B., & Wille, R. (1999). Formal concept analysis-mathematical foundations. Berlin: Springer.
Goldberg, N., & Shan, Ch. (2007). Boosting optimal logical patterns. In C. Apte, B. Liu, S. Parthasarathy, & D. Skillicorn (Eds.), Proceedings of the seventh SIAM international conference on data mining.
Hammer, P. L., Kogan, A., Simeone, B., & Szedmák, S. (2004). Pareto-optimal patterns in logical analysis of data. Discrete Applied Mathematics, 144(1-2), 79-102.
Hyafil, L., & Rivest, R. L. (1976). Constructing optimal binary decision trees is NP-complete. Information Processing Letters, 5, 15-17.
Kleitman, D. (1969). On Dedekind's problem: the number of monotone Boolean functions. Proceedings of the American Mathematical Society, 21, 677-682.
Kogan, A., & Zhuravlev, Y. I. (1985). Realization of Boolean functions with a small number of zeros by disjunctive normal forms and related problems. Soviet Mathematics-Doklady, 32(3), 771-775.
Kogan, A. (1987a). Disjunctive normal forms of Boolean functions with a small number of zeros. USSR Computational Mathematics and Mathematical Physics, 27(3), 185-190.
Kogan, A. (1987b). Lower bounds for the complexity of disjunctive normal forms of Boolean functions with a small number of zeros. USSR Computational Mathematics and Mathematical Physics, 27(6), 175-181.
Makino, K., Suda, T., Ono, T., & Ibaraki, T. (1999). Data analysis by positive decision trees. IEICE Transactions on Information and Systems, E82-D(1), 76-88.
Muroga, S. (1971). Threshold logic and its applications. New York: Wiley.
Murthy, S. K., Kasif, S., & Salzberg, S. (1994). A system for induction of oblique decision trees. JAIR, 2, 1-32.
Potharst, R., Bioch, J. C., & Petter, T. (1997). Monotone decision trees (Technical Report EUR-FEW-CS-97-07). Erasmus University, Rotterdam.
Quinlan, J. R. (1986). Induction of decision trees. Machine Learning, 1, 81-106.
Ragin, C. C. (1987). The comparative method. Berkeley/Los Angeles/London: University of California Press.
Valiant, L. G. (1984). A theory of the learnable. Communications of the ACM, 27, 1134-1142.