Back to Search Start Over

Comparative Analysis of Generic and Fine-Tuned Large Language Models for Conversational Agent Systems.

Authors :
Villa, Laura
Carneros-Prado, David
Dobrescu, Cosmin C.
Sánchez-Miguel, Adrián
Cubero, Guillermo
Hervás, Ramón
Source :
Robotics; May2024, Vol. 13 Issue 5, p68, 20p
Publication Year :
2024

Abstract

In the rapidly evolving domain of conversational agents, the integration of Large Language Models (LLMs) into Chatbot Development Platforms (CDPs) is a significant innovation. This study compares the efficacy of employing generic and fine-tuned GPT-3.5-turbo models for designing dialog flows, focusing on the intent and entity recognition crucial for dynamic conversational interactions. Two distinct approaches are introduced: a generic GPT-based system (G-GPT) leveraging the pre-trained model with complex prompts for intent and entity detection, and a fine-tuned GPT-based system (FT-GPT) employing customized models for enhanced specificity and efficiency. The evaluation encompassed the systems' ability to accurately classify intents and recognize named entities, contrasting their adaptability, operational efficiency, and customization capabilities. The results revealed that, while the G-GPT system offers ease of deployment and versatility across various contexts, the FT-GPT system demonstrates superior precision, efficiency, and customization, although it requires initial training and dataset preparation. This research highlights the versatility of LLMs in enriching conversational features for talking assistants, from social robots to interactive chatbots. By tailoring these advanced models, the fluidity and responsiveness of conversational agents can be enhanced, making them more adaptable and effective in a variety of settings, from customer service to interactive learning environments. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
22186581
Volume :
13
Issue :
5
Database :
Complementary Index
Journal :
Robotics
Publication Type :
Academic Journal
Accession number :
177488496
Full Text :
https://doi.org/10.3390/robotics13050068