Back to Search Start Over

Partial Data Querying Through Racing Algorithms

Authors :
Vu-Linh Nguyen
Sébastien Destercke
Marie-Hélène Masson
Heuristique et Diagnostic des Systèmes Complexes [Compiègne] (Heudiasyc)
Université de Technologie de Compiègne (UTC)-Centre National de la Recherche Scientifique (CNRS)
Laboratoire d'Excellence 'Maîtrise des Systèmes de Systèmes Technologiques' (Labex MS2T)
Université de Picardie Jules Verne (UPJV)
ANR-11-IDEX-0004,SUPER,Sorbonne Universités à Paris pour l'Enseignement et la Recherche(2011)
Source :
Lecture Notes in Computer Science ISBN: 9783319490458, IUKM, International Journal of Approximate Reasoning, International Journal of Approximate Reasoning, Elsevier, 2018, 96, pp.36-55. ⟨10.1016/j.ijar.2018.03.005⟩, Lecture Notes in Artificial Intelligence, International Symposium on Integrated Uncertainty in Knowledge Modelling and Decision Making (IUKM 2016), International Symposium on Integrated Uncertainty in Knowledge Modelling and Decision Making (IUKM 2016), Nov 2016, Da Nang, Vietnam. pp.163-174, ⟨10.1007/978-3-319-49046-5_14⟩
Publication Year :
2016
Publisher :
Springer International Publishing, 2016.

Abstract

International audience; The paper studies the problem of actively learning from instances characterized by imprecise features or imprecise class labels, where by actively learning we understand the possibility to query the precise value of imprecisely specified data. We differ from classical active learning by the fact that in the later, data are either fully precise or completely missing, while in our case they can be partially specified. Such situations can appear when sensor errors are important to encode, or when experts have only specified a subset of possible labels when tagging data. We provide a general active learning technique that can be applied in principle to any model. It is inspired from racing algorithms, in which several models are competing against each others. The main idea of our method is to identify the query that will be the most helpful in identifying the winning model in the competition. After discussing and formalizing the general ideas of our approach, we illustrate it by studying the particular case of binary SVM in the case of interval valued features and set-valued labels. The experimental results indicate that, in comparison to other baselines, racing algorithms provide a faster reduction of the uncertainty in the learning process, especially in the case of imprecise features.

Details

ISBN :
978-3-319-49045-8
ISSN :
0888613X
ISBNs :
9783319490458
Database :
OpenAIRE
Journal :
Lecture Notes in Computer Science ISBN: 9783319490458, IUKM, International Journal of Approximate Reasoning, International Journal of Approximate Reasoning, Elsevier, 2018, 96, pp.36-55. ⟨10.1016/j.ijar.2018.03.005⟩, Lecture Notes in Artificial Intelligence, International Symposium on Integrated Uncertainty in Knowledge Modelling and Decision Making (IUKM 2016), International Symposium on Integrated Uncertainty in Knowledge Modelling and Decision Making (IUKM 2016), Nov 2016, Da Nang, Vietnam. pp.163-174, ⟨10.1007/978-3-319-49046-5_14⟩
Accession number :
edsair.doi.dedup.....ec8402eb07eb992de101516a8c53a19b
Full Text :
https://doi.org/10.1007/978-3-319-49046-5_14