Back to Search Start Over

Data complexity: An FCA-based approach.

Authors :
Buzmakov, Alexey
Dudyrev, Egor
Kuznetsov, Sergei O.
Makhalova, Tatiana
Napoli, Amedeo
Source :
International Journal of Approximate Reasoning. Feb2024, Vol. 165, pN.PAG-N.PAG. 1p.
Publication Year :
2024

Abstract

In this paper we propose different indices for measuring the complexity of a dataset in terms of Formal Concept Analysis (FCA). We extend the lines of the research about the "closure structure" and the "closure index" based on minimum generators of intents (aka closed itemsets). We would try to capture statistical properties of a dataset, not just extremal characteristics, such as the size of a passkey. For doing so we introduce an alternative approach where we measure the complexity of a dataset w.r.t. five significant elements that can be computed in a concept lattice, namely intents (closed sets of attributes), pseudo-intents, proper premises, keys (minimal generators), and passkeys (minimum generators). Then we define several original indices allowing us to estimate the complexity of a dataset. Moreover we study the distribution of all these different elements and indices in various real-world and synthetic datasets. Finally, we investigate the relations existing between these significant elements and indices, and as well the relations with implications and association rules. [ABSTRACT FROM AUTHOR]

Subjects

Subjects :
*MEASUREMENT

Details

Language :
English
ISSN :
0888613X
Volume :
165
Database :
Academic Search Index
Journal :
International Journal of Approximate Reasoning
Publication Type :
Periodical
Accession number :
174666546
Full Text :
https://doi.org/10.1016/j.ijar.2023.109084