Back to Search Start Over

RelevAI-Reviewer: A Benchmark on AI Reviewers for Survey Paper Relevance

Authors :
Couto, Paulo Henrique
Ho, Quang Phuoc
Kumari, Nageeta
Rachmat, Benedictus Kent
Khuong, Thanh Gia Hieu
Ullah, Ihsan
Sun-Hosoya, Lisheng
Source :
Conf{\'e}rence sur l'Apprentissage Automatique 2024, Jul 2024, Lille, France
Publication Year :
2024

Abstract

Recent advancements in Artificial Intelligence (AI), particularly the widespread adoption of Large Language Models (LLMs), have significantly enhanced text analysis capabilities. This technological evolution offers considerable promise for automating the review of scientific papers, a task traditionally managed through peer review by fellow researchers. Despite its critical role in maintaining research quality, the conventional peer-review process is often slow and subject to biases, potentially impeding the swift propagation of scientific knowledge. In this paper, we propose RelevAI-Reviewer, an automatic system that conceptualizes the task of survey paper review as a classification problem, aimed at assessing the relevance of a paper in relation to a specified prompt, analogous to a "call for papers". To address this, we introduce a novel dataset comprised of 25,164 instances. Each instance contains one prompt and four candidate papers, each varying in relevance to the prompt. The objective is to develop a machine learning (ML) model capable of determining the relevance of each paper and identifying the most pertinent one. We explore various baseline approaches, including traditional ML classifiers like Support Vector Machine (SVM) and advanced language models such as BERT. Preliminary findings indicate that the BERT-based end-to-end classifier surpasses other conventional ML methods in performance. We present this problem as a public challenge to foster engagement and interest in this area of research.

Details

Database :
arXiv
Journal :
Conf{\'e}rence sur l'Apprentissage Automatique 2024, Jul 2024, Lille, France
Publication Type :
Report
Accession number :
edsarx.2406.10294
Document Type :
Working Paper