Start Over

Partial Data Querying Through Racing Algorithms

Authors :: Vu-Linh Nguyen
Sébastien Destercke
Marie-Hélène Masson
Heuristique et Diagnostic des Systèmes Complexes [Compiègne] (Heudiasyc)
Université de Technologie de Compiègne (UTC)-Centre National de la Recherche Scientifique (CNRS)
Laboratoire d'Excellence 'Maîtrise des Systèmes de Systèmes Technologiques' (Labex MS2T)
Université de Picardie Jules Verne (UPJV)
ANR-11-IDEX-0004,SUPER,Sorbonne Universités à Paris pour l'Enseignement et la Recherche(2011)
Source :: Lecture Notes in Computer Science ISBN: 9783319490458, IUKM, International Journal of Approximate Reasoning, International Journal of Approximate Reasoning, Elsevier, 2018, 96, pp.36-55. ⟨10.1016/j.ijar.2018.03.005⟩, Lecture Notes in Artificial Intelligence, International Symposium on Integrated Uncertainty in Knowledge Modelling and Decision Making (IUKM 2016), International Symposium on Integrated Uncertainty in Knowledge Modelling and Decision Making (IUKM 2016), Nov 2016, Da Nang, Vietnam. pp.163-174, ⟨10.1007/978-3-319-49046-5_14⟩
Publication Year :: 2016
Publisher :: Springer International Publishing, 2016.
Abstract: International audience; The paper studies the problem of actively learning from instances characterized by imprecise features or imprecise class labels, where by actively learning we understand the possibility to query the precise value of imprecisely specified data. We differ from classical active learning by the fact that in the later, data are either fully precise or completely missing, while in our case they can be partially specified. Such situations can appear when sensor errors are important to encode, or when experts have only specified a subset of possible labels when tagging data. We provide a general active learning technique that can be applied in principle to any model. It is inspired from racing algorithms, in which several models are competing against each others. The main idea of our method is to identify the query that will be the most helpful in identifying the winning model in the competition. After discussing and formalizing the general ideas of our approach, we illustrate it by studying the particular case of binary SVM in the case of interval valued features and set-valued labels. The experimental results indicate that, in comparison to other baselines, racing algorithms provide a faster reduction of the uncertainty in the learning process, especially in the case of imprecise features.

Subjects :: querying
Process (engineering)
Active learning (machine learning)
Computer science
[INFO.INFO-OH]Computer Science [cs]/Other [cs.OH]
Value (computer science)
Binary number
02 engineering and technology
ENCODE
partial data
Machine learning
computer.software_genre
01 natural sciences
Theoretical Computer Science
[INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]
Reduction (complexity)
Competition (economics)
data querying
Artificial Intelligence
active learning
020204 information systems
0202 electrical engineering, electronic engineering, information engineering
set-valued labels
0101 mathematics
racing algorithms
Class (computer programming)
business.industry
Applied Mathematics
010102 general mathematics
Support vector machine
data
Line (geometry)
020201 artificial intelligence & image processing
Artificial intelligence
business
Algorithm
Value (mathematics)
computer
Software
interval-valued data

Details

ISBN :: 978-3-319-49045-8
ISSN :: 0888613X
ISBNs :: 9783319490458
Database :: OpenAIRE
Journal :: Lecture Notes in Computer Science ISBN: 9783319490458, IUKM, International Journal of Approximate Reasoning, International Journal of Approximate Reasoning, Elsevier, 2018, 96, pp.36-55. ⟨10.1016/j.ijar.2018.03.005⟩, Lecture Notes in Artificial Intelligence, International Symposium on Integrated Uncertainty in Knowledge Modelling and Decision Making (IUKM 2016), International Symposium on Integrated Uncertainty in Knowledge Modelling and Decision Making (IUKM 2016), Nov 2016, Da Nang, Vietnam. pp.163-174, ⟨10.1007/978-3-319-49046-5_14⟩
Accession number :: edsair.doi.dedup.....ec8402eb07eb992de101516a8c53a19b
Full Text :: https://doi.org/10.1007/978-3-319-49046-5_14