Author: "Jan N. van Rijn" / Topic: artificial intelligence - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Jan N. van Rijn"' showing total 14 results

Start Over Author "Jan N. van Rijn" Topic artificial intelligence

14 results on '"Jan N. van Rijn"'

1. Stateless neural meta-learning using second-order gradients

Author: Mike Huisman, Aske Plaat, and Jan N. van Rijn
Subjects: FOS: Computer and information sciences, Computer Science - Machine Learning, Artificial Intelligence (cs.AI), Computer Science - Artificial Intelligence, Statistics - Machine Learning, Artificial Intelligence, Machine Learning (stat.ML), Software, Machine Learning (cs.LG)
Abstract: Meta-learning can be used to learn a good prior that facilitates quick learning; two popular approaches are MAML and the meta-learner LSTM. These two methods represent important and different approaches in meta-learning. In this work, we study the two and formally show that the meta-learner LSTM subsumes MAML, although MAML, which is in this sense less general, outperforms the other. We suggest the reason for this surprising performance gap is related to second-order gradients. We construct a new algorithm (named TURTLE) to gain more insight into the importance of second-order gradients. TURTLE is simpler than the meta-learner LSTM yet more expressive than MAML and outperforms both techniques at few-shot sine wave regression and 50% of the tested image classification settings (without any additional hyperparameter tuning) and is competitive otherwise, at a computational cost that is comparable to second-order MAML. We find that second-order gradients also significantly increase the accuracy of the meta-learner LSTM. When MAML was introduced, one of its remarkable features was the use of second-order gradients. Subsequent work focused on cheaper first-order approximations. On the basis of our findings, we argue for more attention for second-order gradients.
Published: 2022

2. Fast and Informative Model Selection using Learning Curve Cross-Validation

Author: Felix Mohr and Jan N. van Rijn
Subjects: FOS: Computer and information sciences, Computer Science - Machine Learning, Computational Theory and Mathematics, Artificial Intelligence, Applied Mathematics, Computer Vision and Pattern Recognition, Software, Machine Learning (cs.LG)
Abstract: Common cross-validation (CV) methods like k-fold cross-validation or Monte-Carlo cross-validation estimate the predictive performance of a learner by repeatedly training it on a large portion of the given data and testing on the remaining data. These techniques have two major drawbacks. First, they can be unnecessarily slow on large datasets. Second, beyond an estimation of the final performance, they give almost no insights into the learning process of the validated algorithm. In this paper, we present a new approach for validation based on learning curves (LCCV). Instead of creating train-test splits with a large portion of training data, LCCV iteratively increases the number of instances used for training. In the context of model selection, it discards models that are very unlikely to become competitive. We run a large scale experiment on the 67 datasets from the AutoML benchmark and empirically show that in over 90% of the cases using LCCV leads to similar performance (at most 1.5% difference) as using 5/10-fold CV. However, it yields substantial runtime reductions of over 20% on average. Additionally, it provides important insights, which for example allow assessing the benefits of acquiring more data. These results are orthogonal to other advances in the field of AutoML.
Published: 2021

3. Artificial Intelligence and Machine Learning

Author: Mitra Baratchi, Lu Cao, Frank W. Takes, Walter A. Kosters, Jefrey Lijffijt, and Jan N. van Rijn
Subjects: Coronavirus disease 2019 (COVID-19), business.industry, Computer science, Volume (computing), Artificial intelligence, business, Machine learning, computer.software_genre, computer, Game theory, Selection (genetic algorithm)
Abstract: This book contains a selection of the best papers of the 32nd Benelux Conference on Artificial Intelligence, BNAIC/Benelearn 2020, held in Leiden, The Netherlands, in November 2020. Due to the COVID-19 pandemic the conference was held online. The 12 papers presented in this volume were carefully reviewed and selected from 41 regular submissions. They address various aspects of artificial intelligence such as natural language processing, agent technology, game theory, problem solving, machine learning, human-agent interaction, AI and education, and data analysis.
Published: 2021

4. Automatic Human-Like Detection of Code Smells

Author: Jan N. van Rijn, Chitsutha Soomlek, and Marcello M. Bonsangue
Subjects: Java, business.industry, Computer science, Code smell, Machine learning, computer.software_genre, Software quality, Software metric, Set (abstract data type), Metric (mathematics), Classifier (linguistics), Artificial intelligence, business, Heuristics, computer, computer.programming_language
Abstract: Many code smell detection techniques and tools have been proposed, mainly aiming to eliminate design flaws and improve software quality. Most of them are based on heuristics which rely on a set of software metrics and corresponding threshold values. Those techniques and tools suffer from subjectivity issues, discordant results among the tools, and the reliability of the thresholds. To mitigate these problems, we used machine learning to automate developers’ perception in code smells detection. Different from other existing machine learning used in code smell detection we trained our models with an extensive dataset based on more than 3000 professional reviews on 518 open source projects. We conclude by an empirical evaluation of the performance of the machine learning approach against PMD, a widely used metric-based code smell detection tool for Java. The experimental results show that the machine learning approach outperforms the PMD classifier in all evaluations.
Published: 2021

5. Eating Sound Dataset for 20 Food Types and Sound Classification Using Convolutional Neural Networks

Author: Jan N. van Rijn, Marcello A. Gómez Maureira, and Jeannette Shijie Ma
Subjects: Protocol (science), Artificial neural network, business.industry, Computer science, Heuristic (computer science), Sound classification, Machine learning, computer.software_genre, Convolutional neural network, Task (project management), Data-driven, Binary classification, Artificial intelligence, business, computer
Abstract: Food identification technology potentially benefits both food and media industries, and can enrich human-computer interaction. We assembled a food classification dataset consisting of 11,141 clips, based on YouTube videos of 20 food types. This dataset is freely available on Kaggle. We suggest the grouped holdout evaluation protocol as evaluation method to assess model performance. As a first approach, we applied Convolutional Neural Networks on this dataset. When applying an evaluation protocol based on grouped holdout, the model obtained an accuracy of 18.5%, whereas when applying an evaluation protocol based on uniform holdout, the model obtained an accuracy of 37.58%. When approaching this as a binary classification task, the model performed well for most pairs. In both settings, the method clearly outperformed reasonable baselines. We found that besides texture properties, eating action differences are important consideration for data driven eating sound researches. Protocols based on biting sound are limited to textural classification and less heuristic while assembling food differences.
Published: 2020

6. A Survey of Deep Meta-Learning

Author: Mike Huisman, Jan N. van Rijn, and Aske Plaat
Subjects: FOS: Computer and information sciences, Linguistics and Language, Computer Science - Machine Learning, Meta learning (computer science), Computer science, Computer Science - Artificial Intelligence, Few-shot learning, Machine Learning (stat.ML), 02 engineering and technology, 010501 environmental sciences, 01 natural sciences, Language and Linguistics, Field (computer science), Bridge (nautical), Machine Learning (cs.LG), Reduction (complexity), Meta-learning, Artificial Intelligence, Statistics - Machine Learning, 0202 electrical engineering, electronic engineering, information engineering, 0105 earth and related environmental sciences, Deep learning, Learning to learn, Data science, Transfer learning, Artificial Intelligence (cs.AI), Work (electrical), Key (cryptography), Deep neural networks, 020201 artificial intelligence & image processing
Abstract: Deep neural networks can achieve great successes when presented with large data sets and sufficient computational resources. However, their ability to learn new concepts quickly is limited. Meta-learning is one approach to address this issue, by enabling the network to learn how to learn. The field of Deep Meta-Learning advances at great speed, but lacks a unified, in-depth overview of current techniques. With this work, we aim to bridge this gap. After providing the reader with a theoretical foundation, we investigate and summarize key methods, which are categorized into i)~metric-, ii)~model-, and iii)~optimization-based techniques. In addition, we identify the main open challenges, such as performance evaluations on heterogeneous benchmarks, and reduction of the computational costs of meta-learning., Published in the AI Review (AIRE) Journal (2021)
Published: 2020

7. Multi-task learning with a natural metric for quantitative structure activity relationship learning

Author: Jérémy Besnard, Joaquin Vanschoren, Crina Grosan, Jan N. van Rijn, Ross D. King, Noureddin Sadawi, Ivan Olier, Larisa N. Soldatova, G. Richard J. Bickerton, Data Mining, Soldatova, Larisa [0000-0001-6489-3029], and Apollo - University of Cambridge Repository
Subjects: Quantitative structure–activity relationship, Computer science, media_common.quotation_subject, education, multi-task learning, Multi-task learning, Sequence-based similarity, quantitative structure activity relationship, 02 engineering and technology, Library and Information Sciences, Machine learning, computer.software_genre, sequence-based similarity, lcsh:Chemistry, 03 medical and health sciences, 0202 electrical engineering, electronic engineering, information engineering, Feature (machine learning), Physical and Theoretical Chemistry, QA, Function (engineering), 030304 developmental biology, media_common, 0303 health sciences, lcsh:T58.5-58.64, lcsh:Information technology, business.industry, chEMBL, Computer Graphics and Computer-Aided Design, Computer Science Applications, Random forest, Drug activity, lcsh:QD1-999, Quantitative structure activity relationship, Metric (mathematics), 020201 artificial intelligence & image processing, Artificial intelligence, business, computer, random forest, psychological phenomena and processes, Research Article
Abstract: © The Author(s) 2019. The goal of quantitative structure activity relationship (QSAR) learning is to learn a function that, given the structure of a small molecule (a potential drug), outputs the predicted activity of the compound. We employed multi-task learning (MTL) to exploit commonalities in drug targets and assays. We used datasets containing curated records about the activity of specific compounds on drug targets provided by ChEMBL. Totally, 1091 assays have been analysed. As a baseline, a single task learning approach that trains random forest to predict drug activity for each drug target individually was considered. We then carried out feature-based and instance-based MTL to predict drug activities. We introduced a natural metric of evolutionary distance between drug targets as a measure of tasks relatedness. Instance-based MTL significantly outperformed both, feature-based MTL and the base learner, on 741 drug targets out of 1091. Feature-based MTL won on 179 occasions and the base learner performed best on 171 drug targets. We conclude that MTL QSAR is improved by incorporating the evolutionary distance between targets. These results indicate that QSAR learning can be performed effectively, even if little data is available for specific drug targets, by leveraging what is known about similar drug targets. This research was funded by the Engineering and Physical Sciences Research Council (EPSRC) grant EP/K030469/1. NS would like to thank the EU PhenoM-eNal project (Horizon 2020, 654241)
Published: 2020
Full Text: View/download PDF

8. Speeding up algorithm selection using average ranking and active testing by introducing runtime

Author: Jan N. van Rijn, Joaquin Vanschoren, Salisu Mamman Abdulrahman, Pavel Brazdil, and Data Mining
Subjects: Meta learning (computer science), Computer science, 02 engineering and technology, Interval (mathematics), Machine learning, computer.software_genre, Loss curves, Ranking of algorithms, Active testing, Algorithm Selection, Set (abstract data type), Algorithm selection, Software, Meta-learning, Artificial Intelligence, 020204 information systems, 0202 electrical engineering, electronic engineering, information engineering, Measure (data warehouse), business.industry, Function (mathematics), Ranking, Mean interval loss, 020201 artificial intelligence & image processing, Artificial intelligence, business, computer, Average ranking
Abstract: Algorithm selection methods can be speeded-up substantially by incorporating multi-objective measures that give preference to algorithms that are both promising and fast to evaluate. In this paper, we introduce such a measure, A3R, and incorporate it into two algorithm selection techniques: average ranking and active testing. Average ranking combines algorithm rankings observed on prior datasets to identify the best algorithms for a new dataset. The aim of the second method is to iteratively select algorithms to be tested on the new dataset, learning from each new evaluation to intelligently select the next best candidate. We show how both methods can be upgraded to incorporate a multi-objective measure A3R that combines accuracy and runtime. It is necessary to establish the correct balance between accuracy and runtime, as otherwise time will be wasted by conducting less informative tests. The correct balance can be set by an appropriate parameter setting within function A3R that trades off accuracy and runtime. Our results demonstrate that the upgraded versions of Average Ranking and Active Testing lead to much better mean interval loss values than their accuracy-based counterparts.
Published: 2017

9. The Algorithm Selection Competitions 2015 and 2017

Author: Marius Lindauer, Jan N. van Rijn, and Lars Kotthoff
Subjects: FOS: Computer and information sciences, Linguistics and Language, Meta learning (computer science), Computer science, business.industry, Competition Analysis, Computer Science - Artificial Intelligence, Algorithm Selection, 02 engineering and technology, Machine learning, computer.software_genre, Language and Linguistics, Artificial Intelligence (cs.AI), Artificial Intelligence, 020204 information systems, Complementarity (molecular biology), Meta-Learning, 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Artificial intelligence, State (computer science), business, computer
Abstract: The algorithm selection problem is to choose the most suitable algorithm for solving a given problem instance. It leverages the complementarity between different approaches that is present in many areas of AI. We report on the state of the art in algorithm selection, as defined by the Algorithm Selection competitions in 2015 and 2017. The results of these competitions show how the state of the art improved over the years. We show that although performance in some cases is very good, there is still room for improvement in other cases. Finally, we provide insights into why some scenarios are hard, and pose challenges to the community on how to advance the current state of the art.
Published: 2018

10. The online performance estimation framework: heterogeneous ensemble learning for data streams

Author: Joaquin Vanschoren, Jan N. van Rijn, Geoffrey Holmes, Bernhard Pfahringer, and Data Mining
Subjects: Computer science, Performance estimation, media_common.quotation_subject, 02 engineering and technology, Machine learning, computer.software_genre, Ensembles of classifiers, Meta-learning, Artificial Intelligence, 020204 information systems, Voting, 0202 electrical engineering, electronic engineering, information engineering, Ensembles, media_common, Training set, business.industry, Data stream mining, Dynamic data, Ensemble learning, ComputingMethodologies_PATTERNRECOGNITION, 020201 artificial intelligence & image processing, Artificial intelligence, business, computer, Classifier (UML), Software, Data streams
Abstract: Ensembles of classifiers are among the best performing classifiers available in many data mining applications, including the mining of data streams. Rather than training one classifier, multiple classifiers are trained, and their predictions are combined according to a given voting schedule. An important prerequisite for ensembles to be successful is that the individual models are diverse. One way to vastly increase the diversity among the models is to build an heterogeneous ensemble, comprised of fundamentally different model types. However, most ensembles developed specifically for the dynamic data stream setting rely on only one type of base-level classifier, most often Hoeffding Trees. We study the use of heterogeneous ensembles for data streams. We introduce the Online Performance Estimation framework, which dynamically weights the votes of individual classifiers in an ensemble. Using an internal evaluation on recent training data, it measures how well ensemble members performed on this and dynamically updates their weights. Experiments over a wide range of data streams show performance that is competitive with state of the art ensemble techniques, including Online Bagging and Leveraging Bagging, while being significantly faster. All experimental results from this work are easily reproducible and publicly available online.
Published: 2018

11. Don’t Rule Out Simple Models Prematurely: A Large Scale Benchmark Comparing Linear and Non-linear Classifiers in OpenML

Author: Benjamin Strang, Peter van der Putten, Frank Hutter, and Jan N. van Rijn
Subjects: Meta learning (computer science), Artificial neural network, Scale (ratio), business.industry, Computer science, 020209 energy, Decision tree, 02 engineering and technology, Machine learning, computer.software_genre, Support vector machine, Task (computing), Nonlinear system, 0202 electrical engineering, electronic engineering, information engineering, Benchmark (computing), Artificial intelligence, business, computer
Abstract: A basic step for each data-mining or machine learning task is to determine which model to choose based on the problem and the data at hand. In this paper we investigate when non-linear classifiers outperform linear classifiers by means of a large scale experiment. We benchmark linear and non-linear versions of three types of classifiers (support vector machines; neural networks; and decision trees), and analyze the results to determine on what type of datasets the non-linear version performs better. To the best of our knowledge, this work is the first principled and large scale attempt to support the common assumption that non-linear classifiers excel only when large amounts of data are available.
Published: 2018

12. Hyperparameter Importance Across Datasets

Author: Jan N. van Rijn and Frank Hutter
Subjects: FOS: Computer and information sciences, Meta learning (computer science), Computer science, Machine Learning (stat.ML), 02 engineering and technology, 010501 environmental sciences, Machine learning, computer.software_genre, 01 natural sciences, Machine Learning (cs.LG), Statistics - Machine Learning, Prior probability, 0202 electrical engineering, electronic engineering, information engineering, AdaBoost, 0105 earth and related environmental sciences, Hyperparameter, business.industry, Random forest, Support vector machine, Computer Science - Learning, Hyperparameter optimization, 020201 artificial intelligence & image processing, Algorithm design, Artificial intelligence, business, computer
Abstract: With the advent of automated machine learning, automated hyperparameter optimization methods are by now routinely used in data mining. However, this progress is not yet matched by equal progress on automatic analyses that yield information beyond performance-optimizing hyperparameter settings. In this work, we aim to answer the following two questions: Given an algorithm, what are generally its most important hyperparameters, and what are typically good values for these? We present methodology and a framework to answer these questions based on meta-learning across many datasets. We apply this methodology using the experimental meta-data available on OpenML to determine the most important hyperparameters of support vector machines, random forests and Adaboost, and to infer priors for all their hyperparameters. The results, obtained fully automatically, provide a quantitative basis to focus efforts in both manual algorithm design and in automated hyperparameter optimization. The conducted experiments confirm that the hyperparameters selected by the proposed method are indeed the most important ones and that the obtained priors also lead to statistically significant improvements in hyperparameter optimization., Comment: \c{opyright} 2018. Copyright is held by the owner/author(s). Publication rights licensed to ACM. This is the author's version of the work. It is posted here for your personal use, not for redistribution. The definitive Version of Record was published in Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining
Published: 2017
Full Text: View/download PDF

13. Does Feature Selection Improve Classification? A Large Scale Experiment in OpenML

Author: Peter van der Putten, Jan N. van Rijn, and Martijn J. Post
Subjects: Scale (ratio), Meta learning (computer science), Computer science, business.industry, Pattern recognition, Feature selection, 02 engineering and technology, Machine learning, computer.software_genre, 01 natural sciences, Task (project management), 010104 statistics & probability, Statistical classification, Factor (programming language), 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Artificial intelligence, 0101 mathematics, business, computer, computer.programming_language, Complement (set theory)
Abstract: It is often claimed that data pre-processing is an important factor contributing towards the performance of classification algorithms. In this paper we investigate feature selection, a common data pre-processing technique. We conduct a large scale experiment and present results on what algorithms and data sets benefit from this technique. Using meta-learning we can find out for which combinations this is the case. To complement a large set of meta-features, we introduce the Feature Selection Landmarkers, which prove useful for this task. All our experimental results are made publicly available on OpenML.
Published: 2016

14. OpenML: networked science in machine learning

Author: Luís Torgo, Bernd Bischl, Joaquin Vanschoren, and Jan N. van Rijn
Subjects: Structure (mathematical logic), FOS: Computer and information sciences, Computer science, business.industry, Geography, Planning and Development, 02 engineering and technology, Machine learning, computer.software_genre, Machine Learning (cs.LG), Computer Science - Learning, Computer Science - Computers and Society, Work (electrical), 020204 information systems, Computers and Society (cs.CY), 0202 electrical engineering, electronic engineering, information engineering, Citizen science, General Earth and Planetary Sciences, 020201 artificial intelligence & image processing, Artificial intelligence, business, computer, Water Science and Technology
Abstract: Many sciences have made significant breakthroughs by adopting online tools that help organize, structure and mine information that is too detailed to be printed in journals. In this paper, we introduce OpenML, a place for machine learning researchers to share and organize data in fine detail, so that they can work more effectively, be more visible, and collaborate with others to tackle harder problems. We discuss how OpenML relates to other examples of networked science and what benefits it brings for machine learning research, individual scientists, as well as students and practitioners., Comment: 12 pages, 10 figures
Published: 2014
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

14 results on '"Jan N. van Rijn"'

1. Stateless neural meta-learning using second-order gradients

2. Fast and Informative Model Selection using Learning Curve Cross-Validation

3. Artificial Intelligence and Machine Learning

4. Automatic Human-Like Detection of Code Smells

5. Eating Sound Dataset for 20 Food Types and Sound Classification Using Convolutional Neural Networks

6. A Survey of Deep Meta-Learning

7. Multi-task learning with a natural metric for quantitative structure activity relationship learning

8. Speeding up algorithm selection using average ranking and active testing by introducing runtime

9. The Algorithm Selection Competitions 2015 and 2017

10. The online performance estimation framework: heterogeneous ensemble learning for data streams

11. Don’t Rule Out Simple Models Prematurely: A Large Scale Benchmark Comparing Linear and Non-linear Classifiers in OpenML

12. Hyperparameter Importance Across Datasets

13. Does Feature Selection Improve Classification? A Large Scale Experiment in OpenML

14. OpenML: networked science in machine learning

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Category

Publication Type

Database

Publisher

14 results on '"Jan N. van Rijn"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources