Back to Search Start Over

AlgoLabel: A Large Dataset for Multi-Label Classification of Algorithmic Challenges.

Authors :
Iacob, Radu Cristian Alexandru
Monea, Vlad Cristian
Rădulescu, Dan
Ceapă, Andrei-Florin
Rebedea, Traian
Trăușan-Matu, Ștefan
Source :
Mathematics (2227-7390). Nov2020, Vol. 8 Issue 11, p1995-1995. 1p.
Publication Year :
2020

Abstract

While semantic parsing has been an important problem in natural language processing for decades, recent years have seen a wide interest in automatic generation of code from text. We propose an alternative problem to code generation: labelling the algorithmic solution for programming challenges. While this may seem an easier task, we highlight that current deep learning techniques are still far from offering a reliable solution. The contributions of the paper are twofold. First, we propose a large multi-modal dataset of text and code pairs consisting of algorithmic challenges and their solutions, called AlgoLabel. Second, we show that vanilla deep learning solutions need to be greatly improved to solve this task and we propose a dual text-code neural model for detecting the algorithmic solution type for a programming challenge. While the proposed text-code model increases the performance of using the text or code alone, the improvement is rather small highlighting that we require better methods to combine text and code features. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
22277390
Volume :
8
Issue :
11
Database :
Academic Search Index
Journal :
Mathematics (2227-7390)
Publication Type :
Academic Journal
Accession number :
147273952
Full Text :
https://doi.org/10.3390/math8111995