Descriptor: "GENERATIVE pre-trained transformers" / Topic: chatbots - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"GENERATIVE pre-trained transformers"' showing total 109 results

Start Over Descriptor "GENERATIVE pre-trained transformers" Topic chatbots

109 results on '"GENERATIVE pre-trained transformers"'

1. ChatGPT compared to national guidelines for management of ovarian cancer: Did ChatGPT get it right? – A Memorial Sloan Kettering Cancer Center Team Ovary study.

Author: Finch, Lindsey, Broach, Vance, Feinberg, Jacqueline, Al-Niaimi, Ahmed, Abu-Rustum, Nadeem R., Zhou, Qin, Iasonos, Alexia, and Chi, Dennis S.
Subjects: *LANGUAGE models, *CHATGPT, *CHATBOTS, *ARTIFICIAL intelligence, *GENERATIVE pre-trained transformers
Abstract: We evaluated the performance of a chatbot compared to the National Comprehensive Cancer Network (NCCN) Guidelines for the management of ovarian cancer. Using NCCN Guidelines, we generated 10 questions and answers regarding management of ovarian cancer at a single point in time. Questions were thematically divided into risk factors, surgical management, medical management, and surveillance. We asked ChatGPT (GPT-4) to provide responses without prompting (unprompted GPT) and with prompt engineering (prompted GPT). Responses were blinded and evaluated for accuracy and completeness by 5 gynecologic oncologists. A score of 0 was defined as inaccurate, 1 as accurate and incomplete, and 2 as accurate and complete. Evaluations were compared among NCCN, unprompted GPT, and prompted GPT answers. Overall, 48% of responses from NCCN, 64% from unprompted GPT, and 66% from prompted GPT were accurate and complete. The percentage of accurate but incomplete responses was higher for NCCN vs GPT-4. The percentage of accurate and complete scores for questions regarding risk factors, surgical management, and surveillance was higher for GPT-4 vs NCCN; however, for questions regarding medical management, the percentage was lower for GPT-4 vs NCCN. Overall, 14% of responses from unprompted GPT, 12% from prompted GPT, and 10% from NCCN were inaccurate. GPT-4 provided accurate and complete responses at a single point in time to a limited set of questions regarding ovarian cancer, with best performance in areas of risk factors, surgical management, and surveillance. Occasional inaccuracies, however, should limit unsupervised use of chatbots at this time. • There is interest in expanding the use of chatbot technology in medicine. • A high percentage of responses from ChatGPT to questions about ovarian cancer were graded as accurate and complete. • ChatGPT had a higher percentage of inaccurate responses compared to the National Comprehensive Cancer Network Guidelines. • ChatGPT received the highest scores in response to questions regarding risk factors, surgical management, and surveillance. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

2. Using AI to write scholarly publications.

Author: Hosseini, Mohammad, Rasmussen, Lisa M., and Resnik, David B.
Subjects: GENERATIVE pre-trained transformers, LANGUAGE models, RESEARCH integrity, ARTIFICIAL intelligence, MEDICAL periodicals, CHATBOTS, NATURAL language processing
Abstract: Artificial intelligence (AI) natural language processing (NLP) systems have the potential to be widely used in scientific and scholarly publications. These systems convert unstructured text into structured text suitable for computation, allowing computers to interact with human language. While NLP systems can generate coherent and informative text, there are concerns about accuracy, bias, relevance, and reasoning. They may produce erroneous or misleading information and reflect biases in the data they are trained on. Therefore, any text generated by an NLP system should be checked by a domain expert for accuracy, bias, relevance, and reasoning. Researchers should also prioritize accountability, transparency, and data integrity when using NLP systems in research. The article calls for the adoption of policies on the use of AI in research and highlights ongoing ethical considerations in this field. [Extracted from the article]
Published: 2024
Full Text: View/download PDF

3. Exploring the Effectiveness of Advanced Chatbots in Educational Settings: A Mixed-Methods Study in Statistics.

Author: Navas, Gustavo, Navas-Reascos, Gustavo, Navas-Reascos, Gabriel E., and Proaño-Orellana, Julio
Subjects: GENERATIVE pre-trained transformers, NATURAL language processing, CHATGPT, ENGINEERING students, GROUNDED theory
Abstract: The Generative Pre-trained Transformer (GPT) is a highly advanced natural language processing model. This model can generate conversation-style responses to user input. The rapid rise of GPT has transformed academic domains, with studies exploring the potential of chatbots in education. This research investigates the effectiveness of ChatGPT 3.5, ChatGPT 4.0 by OpenAI, and Chatbot Bing by Microsoft in solving statistical exam-type problems in the educational setting. In addition to quantifying the errors made by these chatbots, this study seeks to understand the causes of these errors to provide recommendations. A mixed-methods approach was employed to achieve this goal, including quantitative and qualitative analyses (Grounded Theory with semi-structured interviews). The quantitative stage involves statistical problem-solving exercises for undergraduate engineering students, revealing error rates based on the reason for the error, statistical fields, sub-statistics fields, and exercise types. The quantitative analysis provided crucial information necessary to proceed with the qualitative study. The qualitative stage employs semi-structured interviews with selected chatbots; this includes confrontation between them that generates agreement, disagreement, and differing viewpoints. On some occasions, chatbots tend to maintain rigid positions, lacking the ability to adapt or acknowledge errors. This inflexibility may affect their effectiveness. The findings contribute to understanding the integration of AI tools in education, offering insights for future implementations and emphasizing the need for critical evaluation and responsible use. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

4. Revolutionizing teledermatology: Exploring the integration of artificial intelligence, including Generative Pre-trained Transformer chatbots for artificial intelligence-driven anamnesis, diagnosis, and treatment plans.

Author: Shapiro, Jonathan and Lyakhovitsky, Anna
Subjects: *GENERATIVE pre-trained transformers, *MEDICAL personnel, *CHATBOTS, *DATA privacy, *ARTIFICIAL intelligence
Abstract: The integration of teledermatology and artificial intelligence (AI) marks a significant advancement in dermatologic care. This study examines the synergistic interplay between these two domains, highlighting their collective impact on enhancing the accuracy, accessibility, and efficiency of teledermatologic services. Teledermatology expands dermatologic care to remote and underserved areas, and AI technologies show considerable potential in analyzing dermatologic images and performing various tasks involved in teledermatology consultations. Such integration facilitates rapid, precise diagnoses, personalized treatment plans, and data-driven insights. Our explorative study involved designing a GPT-based chatbot named "Dr. DermBot" and exploring its performance in a teledermatologic consultation process. The design phase focused on the chatbot's ability to conduct consultations autonomously. The subsequent testing phase assessed its performance against the backdrop of current teledermatologic practices, exploring the potential of AI and chatbots to simulate and potentially enhance teledermatologic health care. Our study demonstrates the promising future of combining teledermatology with AI. It also brings to light ethical and legal concerns, including the protection of patient data privacy and adherence to regulatory standards. The union of teledermatology and AI not only aims to enhance the precision of teledermatologic diagnoses but also broadens the accessibility of dermatologic services to previously underserved populations, benefiting patients, health care providers, and the overall health care system. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

5. Teaming Up with an AI: Exploring Human–AI Collaboration in a Writing Scenario with ChatGPT.

Author: Luther, Teresa, Kimmerle, Joachim, and Cress, Ulrike
Subjects: *GENERATIVE pre-trained transformers, *LANGUAGE models, *CHATGPT, *ARTIFICIAL intelligence, *PROHIBITION of alcohol, *CHATBOTS
Abstract: Recent advancements in artificial intelligence (AI) technologies, particularly in generative pre-trained transformer large language models, have significantly enhanced the capabilities of text-generative AI tools—a development that opens new avenues for human–AI collaboration across various domains. However, the dynamics of human interaction with AI-based chatbots, such as ChatGPT, remain largely unexplored. We observed and analyzed how people interact with ChatGPT in a collaborative writing setting to address this research gap. A total of 135 participants took part in this exploratory lab study, which consisted of engaging with ChatGPT to compose a text discussing the prohibition of alcohol in public in relation to a given statement on risky alcohol consumption. During the writing task, all screen activity was logged. In addition to the writing task, further insights on user behavior and experience were gained by applying questionnaires and conducting an additional short interview with a randomly selected subset of 18 participants. Our results reveal high satisfaction with ChatGPT regarding quality aspects, mainly cognitive rather than affect-based trust in ChatGPT's responses, and higher ratings on perceived competence than on warmth. Compared to other types of prompts, mostly content-related prompts for data, facts, and information were sent to ChatGPT. Mixed-method analysis showed that affinity for technology integration and current use of ChatGPT were positively associated with the frequency of complete text requests. Moreover, prompts for complete texts were associated with more copy–paste behavior. These first insights into co-writing with ChatGPT can inform future research on how successful human–AI collaborative writing can be designed. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

6. Optimizing biomedical information retrieval with a keyword frequency-driven prompt enhancement strategy.

Author: Aftab, Wasim, Apostolou, Zivkos, Bouazoune, Karim, and Straub, Tobias
Subjects: *LANGUAGE models, *NATURAL language processing, *GENERATIVE pre-trained transformers, *NATURAL languages, *INFORMATION retrieval, *CHATBOTS
Abstract: Background: Mining the vast pool of biomedical literature to extract accurate responses and relevant references is challenging due to the domain's interdisciplinary nature, specialized jargon, and continuous evolution. Early natural language processing (NLP) approaches often led to incorrect answers as they failed to comprehend the nuances of natural language. However, transformer models have significantly advanced the field by enabling the creation of large language models (LLMs), enhancing question-answering (QA) tasks. Despite these advances, current LLM-based solutions for specialized domains like biology and biomedicine still struggle to generate up-to-date responses while avoiding "hallucination" or generating plausible but factually incorrect responses. Results: Our work focuses on enhancing prompts using a retrieval-augmented architecture to guide LLMs in generating meaningful responses for biomedical QA tasks. We evaluated two approaches: one relying on text embedding and vector similarity in a high-dimensional space, and our proposed method, which uses explicit signals in user queries to extract meaningful contexts. For robust evaluation, we tested these methods on 50 specific and challenging questions from diverse biomedical topics, comparing their performance against a baseline model, BM25. Retrieval performance of our method was significantly better than others, achieving a median Precision@10 of 0.95, which indicates the fraction of the top 10 retrieved chunks that are relevant. We used GPT-4, OpenAI's most advanced LLM to maximize the answer quality and manually accessed LLM-generated responses. Our method achieved a median answer quality score of 2.5, surpassing both the baseline model and the text embedding-based approach. We developed a QA bot, WeiseEule (https://github.com/wasimaftab/WeiseEule-LocalHost), which utilizes these methods for comparative analysis and also offers advanced features for review writing and identifying relevant articles for citation. Conclusions: Our findings highlight the importance of prompt enhancement methods that utilize explicit signals in user queries over traditional text embedding-based approaches to improve LLM-generated responses for specialized queries in specialized domains such as biology and biomedicine. By providing users complete control over the information fed into the LLM, our approach addresses some of the major drawbacks of existing web-based chatbots and LLM-based QA systems, including hallucinations and the generation of irrelevant or outdated responses. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

7. Examining the Performance of ChatGPT 3.5 and Microsoft Copilot in Otolaryngology: A Comparative Study with Otolaryngologists' Evaluation.

Author: Mayo-Yáñez, Miguel, Lechien, Jerome R., Maria-Saibene, Alberto, Vaira, Luigi A., Maniaci, Antonino, and Chiesa-Estomba, Carlos M.
Subjects: *CHATGPT, *CHATBOTS, *GENERATIVE pre-trained transformers, *INTERNET access, *OTOLARYNGOLOGISTS
Abstract: To evaluate the response capabilities, in a public healthcare system otolaryngology job competition examination, of ChatGPT 3.5 and an internet-connected GPT-4 engine (Microsoft Copilot) with the real scores of otolaryngology specialists as the control group. In September 2023, 135 questions divided into theoretical and practical parts were input into ChatGPT 3.5 and an internet-connected GPT-4. The accuracy of AI responses was compared with the official results from otolaryngologists who took the exam, and statistical analysis was conducted using Stata 14.2. Copilot (GPT-4) outperformed ChatGPT 3.5. Copilot achieved a score of 88.5 points, while ChatGPT scored 60 points. Both AIs had discrepancies in their incorrect answers. Despite ChatGPT's proficiency, Copilot displayed superior performance, ranking as the second-best score among the 108 otolaryngologists who took the exam, while ChatGPT was placed 83rd. A chat powered by GPT-4 with internet access (Copilot) demonstrates superior performance in responding to multiple-choice medical questions compared to ChatGPT 3.5. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

8. A Chemical Engineer’s Introduction to ChatGPT.

Author: Dee, Sean J. and Faraji, Sahand
Subjects: CHATGPT, CHEMICAL engineering, ARTIFICIAL neural networks, CHEMICAL engineers, GENERATIVE pre-trained transformers, CHATBOTS, SHARED workspaces
Abstract: The article presents the discussion on use of AI-based chatbots like OpenAI's ChatGPT in the chemical engineering industry, highlighting both the potential benefits and the challenges of deploying such technologies. Topics include recent updates to ChatGPT, differences between paid and free versions; and case studies demonstrating its application in process safety.
Published: 2024

9. How can ChatGPT assist hospitality and tourism education?

Author: Zhang, Ke, Li, Xiaonan, Ye, Huiyue, Lin, Katsy J., Chen, Sirong, and Law, Rob
Subjects: *GENERATIVE artificial intelligence, *GENERATIVE pre-trained transformers, *CHATGPT, *CHATBOTS, *ARTIFICIAL intelligence
Abstract: Artificial intelligence (AI) poses new challenges and opportunities for hospitality and tourism (H&T) education. This study explores how ChatGPT, a chatbot powered by a generative pre-trained transformer model, can affect H&T curriculum design and delivery. Using secondary data from academic articles, news reports, and official documents, the paper applies curriculum theory and stakeholder theory to examine the competencies, expectations, and interests of H&T students, educators, and industry practitioners regarding ChatGPT. This study reveals that ChatGPT can enhance learning outcomes and experiences, improve educational equity and efficiency, and raise ethical, technical, and pedagogical issues for H&T education. This research note also offers insights and recommendations for H&T educators to integrate ChatGPT into their teaching practices and prepare students for research work and careers that incorporate generative AI. Moreover, the study contributes to the literature on H&T education by examining the role and impact of ChatGPT as an innovative and emerging technology. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

10. A chatbot based question and answer system for the auxiliary diagnosis of chronic diseases based on large language model.

Author: Zhang, Sainan and Song, Jisung
Subjects: *LANGUAGE models, *ARTIFICIAL intelligence, *QUESTION answering systems, *GENERATIVE pre-trained transformers, *DEEP learning, *CHATBOTS
Abstract: In recent years, artificial intelligence has made remarkable strides, improving various aspects of our daily lives. One notable application is in intelligent chatbots that use deep learning models. These systems have shown tremendous promise in the medical sector, enhancing healthcare quality, treatment efficiency, and cost-effectiveness. However, their role in aiding disease diagnosis, particularly chronic conditions, remains underexplored. Addressing this issue, this study employs large language models from the GPT series, in conjunction with deep learning techniques, to design and develop a diagnostic system targeted at chronic diseases. Specifically, performed transfer learning and fine-tuning on the GPT-2 model, enabling it to assist in accurately diagnosing 24 common chronic diseases. To provide a user-friendly interface and seamless interactive experience, we further developed a dialog-based interface, naming it Chat Ella. This system can make precise predictions for chronic diseases based on the symptoms described by users. Experimental results indicate that our model achieved an accuracy rate of 97.50% on the validation set, and an area under the curve (AUC) value reaching 99.91%. Moreover, conducted user satisfaction tests, which revealed that 68.7% of participants approved of Chat Ella, while 45.3% of participants found the system made daily medical consultations more convenient. It can rapidly and accurately assess a patient's condition based on the symptoms described and provide timely feedback, making it of significant value in the design of medical auxiliary products for household use. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

11. Comparing patient education tools for chronic pain medications: Artificial intelligence chatbot versus traditional patient information leaflets.

Author: Gondode, Prakash, Duggal, Sakshi, Garg, Neha, Sethupathy, Surrender, Asai, Omshubham, and Lohakare, Pooja
Subjects: *GENERATIVE pre-trained transformers, *ARTIFICIAL intelligence, *CHATBOTS, *DRUG information materials, *CHATGPT
Abstract: Background and Aims: Artificial intelligence (AI) chatbots like Conversational Generative Pre-trained Transformer (ChatGPT) have recently created much buzz, especially regarding patient education. Such informed patients understand and adhere to the management and get involved in shared decision making. The accuracy and understandability of the generated educational material are prime concerns. Thus, we compared ChatGPT with traditional patient information leaflets (PILs) about chronic pain medications. Methods: Patients' frequently asked questions were generated from PILs available on the official websites of the British Pain Society (BPS) and the Faculty of Pain Medicine. Eight blinded annexures were prepared for evaluation, consisting of traditional PILs from the BPS and AI-generated patient information materials structured similar to PILs by ChatGPT. The authors performed a comparative analysis to assess materials' readability, emotional tone, accuracy, actionability, and understandability. Readability was measured using Flesch Reading Ease (FRE), Gunning Fog Index (GFI), and Flesch-Kincaid Grade Level (FKGL). Sentiment analysis determined emotional tone. An expert panel evaluated accuracy and completeness. Actionability and understandability were assessed with the Patient Education Materials Assessment Tool. Results: Traditional PILs generally exhibited higher readability (P values < 0.05), with [mean (standard deviation)] FRE [62.25 (1.6) versus 48 (3.7)], GFI [11.85 (0.9) versus 13.65 (0.7)], and FKGL [8.33 (0.5) versus 10.23 (0.5)] but varied emotional tones, often negative, compared to more positive sentiments in ChatGPT-generated texts. Accuracy and completeness did not significantly differ between the two. Actionability and understandability scores were comparable. Conclusion: While AI chatbots offer efficient information delivery, ensuring accuracy and readability, patient-centeredness remains crucial. It is imperative to balance innovation with evidence-based practice. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

12. LES PERFORMANCES DES GRANDS MODÈLES DE LA FAMILLE DES TRANSFORMATEURS GÉNÉRATIFS PRÉENTRAÎNÉS EN SOUTIEN AUX SERVICES DE RÉFÉRENCE VIRTUELLE AUTOMATISÉS EN BIBLIOTHÈQUE.

Author: LÉGER-ROUSSEAU, Mireille
Subjects: *GENERATIVE pre-trained transformers, *LANGUAGE models, *LIBRARY reference services, *CHATBOTS, *CHATGPT
Abstract: This article reviews the existing literature on the use of conversational AI robots based on GPT language models in libraries, focusing on their viability, usefulness and performance in answering reference questions. The main objectives are to synthesize current research, evaluate the performance of GPT models and reflect on librarians' positioning regarding the integration of these technologies into library reference services. The methodology includes the analysis of 11 varied studies, with a search in various databases and a rigorous selection of relevant studies. The results reveal the promising performance of GPT models, but also highlight limitations such as their inability to understand nuances and complex questions. In addition, challenges persist in terms of intellectual property, confidentiality and reliability of responses. The studies include performance evaluations, narrative reviews, a systematic review and a case study on the implementation of a ChatGPT-based chatbot in a university library. These studies offer interesting insights into the potential benefits and challenges of successfully integrating ChatGPT into library services. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

13. A Comparative Sentiment Analysis of Greek Clinical Conversations Using BERT, RoBERTa, GPT-2, and XLNet.

Author: Chatzimina, Maria Evangelia, Papadaki, Helen A., Pontikoglou, Charalampos, and Tsiknakis, Manolis
Subjects: *SENTIMENT analysis, *GENERATIVE pre-trained transformers, *NATURAL language processing, *COMMUNICATION models, *HEMATOLOGIC malignancies, *CHATBOTS
Abstract: In addressing the critical role of emotional context in patient–clinician conversations, this study conducted a comprehensive sentiment analysis using BERT, RoBERTa, GPT-2, and XLNet. Our dataset includes 185 h of Greek conversations focused on hematologic malignancies. The methodology involved data collection, data annotation, model training, and performance evaluation using metrics such as accuracy, precision, recall, F1-score, and specificity. BERT outperformed the other methods across all sentiment categories, demonstrating its effectiveness in capturing the emotional context in clinical interactions. RoBERTa showed a strong performance, particularly in identifying neutral sentiments. GPT-2 showed promising results in neutral sentiments but exhibited a lower precision and recall for negatives. XLNet showed a moderate performance, with variations across categories. Overall, our findings highlight the complexities of sentiment analysis in clinical contexts, especially in underrepresented languages like Greek. These insights highlight the potential of advanced deep-learning models in enhancing communication and patient care in healthcare settings. The integration of sentiment analysis in healthcare could provide insights into the emotional states of patients, resulting in more effective and empathetic patient support. Our study aims to address the gap and limitations of sentiment analysis in a Greek clinical context, an area where resources are scarce and its application remains underexplored. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

14. The Personification of ChatGPT (GPT-4)—Understanding Its Personality and Adaptability.

Author: Stöckli, Leandro, Joho, Luca, Lehner, Felix, and Hanne, Thomas
Subjects: *ADAPTABILITY (Personality), *CHATGPT, *GENERATIVE pre-trained transformers, *PERSONALITY tests, *ARTIFICIAL intelligence, *CHATBOTS
Abstract: Thanks to the publication of ChatGPT, Artificial Intelligence is now basically accessible and usable to all internet users. The technology behind it can be used in many chatbots, whereby the chatbots should be trained for the respective area of application. Depending on the application, the chatbot should react differently and thus, for example, also take on and embody personality traits to be able to help and answer people better and more personally. This raises the question of whether ChatGPT-4 is able to embody personality traits. Our study investigated whether ChatGPT-4's personality can be analyzed using personality tests for humans. To test possible approaches to measuring the personality traits of ChatGPT-4, experiments were conducted with two of the most well-known personality tests: the Big Five and Myers–Briggs. The experiments also examine whether and how personality can be changed by user input and what influence this has on the results of the personality tests. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

15. ChatGPT‐4 performance in rhinology: A clinical case series.

Author: Radulesco, Thomas, Saibene, Alberto Maria, Michel, Justin, Vaira, Luigi Angelo, and Lechien, Jérôme R.
Subjects: *CHATGPT, *GENERATIVE pre-trained transformers, *NOSE, *CHATBOTS
Abstract: Key points: Chatbot Generative Pre‐trained Transformer (ChatGPT)‐4 indicated more than twice additional examinations than practitioners in the management of clinical cases in rhinology.The consistency between ChatGPT‐4 and practitioner in the indication of additional examinations may significantly vary from one examination to another.The ChatGPT‐4 proposed a plausible and correct primary diagnosis in 62.5% cases, while pertinent and necessary additional examinations and therapeutic regimen were indicated in 7.5%–30.0% and 7.5%–32.5% of cases, respectively.The stability of ChatGPT‐4 responses is moderate‐to‐high. The performance of ChatGPT‐4 was not influenced by the human‐reported level of difficulty of clinical cases. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

16. Prompt Sapper: A LLM-Empowered Production Tool for Building AI Chains.

Author: Cheng, Yu, Chen, Jieshan, Huang, Qing, Xing, Zhenchang, Xu, Xiwei, and Lu, Qinghua
Subjects: LANGUAGE models, SOFTWARE engineering, CHATBOTS, ARTIFICIAL intelligence, CHATGPT, GENERATIVE pre-trained transformers, REQUIREMENTS engineering
Abstract: The emergence of foundation models, such as large language models (LLMs) GPT-4 and text-to-image models DALL-E, has opened up numerous possibilities across various domains. People can now use natural language (i.e., prompts) to communicate with AI to perform tasks. While people can use foundation models through chatbots (e.g., ChatGPT), chat, regardless of the capabilities of the underlying models, is not a production tool for building reusable AI services. APIs like LangChain allow for LLM-based application development but require substantial programming knowledge, thus posing a barrier. To mitigate this, we systematically review, summarise, refine and extend the concept of AI chain by incorporating the best principles and practices that have been accumulated in software engineering for decades into AI chain engineering, to systematize AI chain engineering methodology. We also develop a no-code integrated development environment, Prompt Sapper , which embodies these AI chain engineering principles and patterns naturally in the process of building AI chains, thereby improving the performance and quality of AI chains. With Prompt Sapper, AI chain engineers can compose prompt-based AI services on top of foundation models through chat-based requirement analysis and visual programming. Our user study evaluated and demonstrated the efficiency and correctness of Prompt Sapper. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

17. Why You MUST Use AI.

Author: Irvine, Robert
Subjects: COMPUTER passwords, ARTIFICIAL intelligence, CHATBOTS, GENERATIVE pre-trained transformers, COMPUTER vision, GEMINI (Chatbot), DIGITAL music
Published: 2024

18. A retrieval-augmented chatbot based on GPT-4 provides appropriate differential diagnosis in gastrointestinal radiology: a proof of concept study.

Author: Rau, Stephan, Rau, Alexander, Nattenmüller, Johanna, Fink, Anna, Bamberg, Fabian, Reisert, Marco, and Russe, Maximilian F.
Subjects: CHATBOTS, GENERATIVE pre-trained transformers, DECISION support systems, DIFFERENTIAL diagnosis, ARTIFICIAL intelligence
Abstract: Background: We investigated the potential of an imaging-aware GPT-4-based chatbot in providing diagnoses based on imaging descriptions of abdominal pathologies. Methods: Utilizing zero-shot learning via the LlamaIndex framework, GPT-4 was enhanced using the 96 documents from the Radiographics Top 10 Reading List on gastrointestinal imaging, creating a gastrointestinal imaging-aware chatbot (GIA-CB). To assess its diagnostic capability, 50 cases on a variety of abdominal pathologies were created, comprising radiological findings in fluoroscopy, MRI, and CT. We compared the GIA-CB to the generic GPT-4 chatbot (g-CB) in providing the primary and 2 additional differential diagnoses, using interpretations from senior-level radiologists as ground truth. The trustworthiness of the GIA-CB was evaluated by investigating the source documents as provided by the knowledge-retrieval mechanism. Mann–Whitney U test was employed. Results: The GIA-CB demonstrated a high capability to identify the most appropriate differential diagnosis in 39/50 cases (78%), significantly surpassing the g-CB in 27/50 cases (54%) (p = 0.006). Notably, the GIA-CB offered the primary differential in the top 3 differential diagnoses in 45/50 cases (90%) versus g-CB with 37/50 cases (74%) (p = 0.022) and always with appropriate explanations. The median response time was 29.8 s for GIA-CB and 15.7 s for g-CB, and the mean cost per case was $0.15 and $0.02, respectively. Conclusions: The GIA-CB not only provided an accurate diagnosis for gastrointestinal pathologies, but also direct access to source documents, providing insight into the decision-making process, a step towards trustworthy and explainable AI. Integrating context-specific data into AI models can support evidence-based clinical decision-making. Relevance statement: A context-aware GPT-4 chatbot demonstrates high accuracy in providing differential diagnoses based on imaging descriptions, surpassing the generic GPT-4. It provided formulated rationale and source excerpts supporting the diagnoses, thus enhancing trustworthy decision-support. Key points: • Knowledge retrieval enhances differential diagnoses in a gastrointestinal imaging-aware chatbot (GIA-CB). • GIA-CB outperformed the generic counterpart, providing formulated rationale and source excerpts. • GIA-CB has the potential to pave the way for AI-assisted decision support systems. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

19. Comparing and assessing four AI chatbots' competence in economics.

Author: Hultberg, Patrik T., Santandreu Calonge, David, Kamalov, Firuz, and Smail, Linda
Subjects: *CHATBOTS, *ARTIFICIAL intelligence, *BLOOM'S taxonomy, *COGNITIVE learning, *WORD frequency, *GENERATIVE pre-trained transformers
Abstract: Artificial Intelligence (AI) chatbots have emerged as powerful tools in modern academic endeavors, presenting both opportunities and challenges in the learning landscape. They can provide content information and analysis across most academic disciplines, but significant differences exist in terms of response accuracy for conclusions and explanations, as well as word counts. This study explores four distinct AI chatbots, GPT-3.5, GPT-4, Bard, and LLaMA 2, for accuracy of conclusions and quality of explanations in the context of university-level economics. Leveraging Bloom's taxonomy of cognitive learning complexity as a guiding framework, the study confronts the four AI chatbots with a standard test for university-level understanding of economics, as well as more advanced economics problems. The null hypothesis that all AI chatbots perform equally well on prompts that explore understanding of economics is rejected. The results are that significant differences are observed across the four AI chatbots, and these differences are exacerbated as the complexity of the economics-related prompts increased. These findings are relevant to both students and educators; students can choose the most appropriate chatbots to better understand economics concepts and thought processes, while educators can design their instruction and assessment while recognizing the support and resources students have access to through AI chatbot platforms. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

20. A decalogue for personalized travel health assistance with AI-driven chatbots.

Author: Baglivo, Francesco, Angelis, Luigi De, Cruschelli, Gianluca, and Rizzo, Caterina
Subjects: *TRAVEL hygiene, *CHATBOTS, *GENERATIVE artificial intelligence, *LANGUAGE models, *GENERATIVE pre-trained transformers
Abstract: The article explores the potential use of AI-driven chatbots in personalized travel health assistance. It introduces Custom GPTs, which allow users to create specialized AI models for specific purposes. The article proposes a set of desirable features for travel medicine chatbots, including a comprehensive knowledge base, up-to-date information, respect for professional medical advice, and data privacy. It also emphasizes the importance of user-friendly interfaces, multilingual support, geolocation services, and integration with electronic health records. The article provides an example of a custom GPT called the Italian Travel Medicine Advisor (ITMA) and discusses its adherence to the proposed features. The authors suggest that AI chatbots can be a valuable tool in travel medicine, providing initial engagement and information to travelers before seeking in-person medical advice. However, further research is needed to evaluate the impact and accuracy of AI-driven chatbots in travel medicine. [Extracted from the article]
Published: 2024
Full Text: View/download PDF

21. From task structures to world models: what do LLMs know?

Author: Yildirim, Ilker and Paul, L.A.
Subjects: *LANGUAGE models, *CHATBOTS, *GENERATIVE pre-trained transformers, *COGNITIVE science, *ARTIFICIAL intelligence
Abstract: OpenAI's GPT-4 and similar large language models (LLMs) show impressive conversational capabilities. This Opinion asks: in what sense does a LLM have knowledge? The answer to this question extends beyond the capabilities of a particular AI chatbot and challenges our assumptions about the nature of knowledge and intelligence. We answer by granting LLMs 'instrumental knowledge': knowledge defined by a certain set of abilities. How is such knowledge related to the more ordinary, 'worldly' knowledge exhibited by humans? To address this, we turn to a core concept in cognitive science, world models, and explore the degree to which instrumental knowledge might incorporate such structured representations. We discuss how LLMs could recover degrees of worldly knowledge and suggest that such recovery will be governed by an implicit, resource-rational tradeoff between world models and task demands. In what sense does a large language model (LLM) have knowledge? We answer by granting LLMs 'instrumental knowledge': knowledge gained by using next-word generation as an instrument. We then ask how instrumental knowledge is related to the ordinary, 'worldly knowledge' exhibited by humans, and explore this question in terms of the degree to which instrumental knowledge can be said to incorporate the structured world models of cognitive science. We discuss ways LLMs could recover degrees of worldly knowledge and suggest that such recovery will be governed by an implicit, resource-rational tradeoff between world models and tasks. Our answer to this question extends beyond the capabilities of a particular AI system and challenges assumptions about the nature of knowledge and intelligence. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

22. ChatGPT-4 accuracy for patient education in laryngopharyngeal reflux.

Author: Lechien, Jerome R., Carroll, Thomas L., Huston, Molly N., and Naunheim, Matthew R.
Subjects: *CHATGPT, *GENERATIVE pre-trained transformers, *PATIENT education, *LANGUAGE models, *CHATBOTS, *HYPOPHARYNGEAL cancer
Abstract: Introduction: Chatbot Generative Pre-trained Transformer (ChatGPT) is an artificial intelligence-powered language model chatbot able to help otolaryngologists in practice and research. The ability of ChatGPT in generating patient-centered information related to laryngopharyngeal reflux disease (LPRD) was evaluated. Methods: Twenty-five questions dedicated to definition, clinical presentation, diagnosis, and treatment of LPRD were developed from the Dubai definition and management of LPRD consensus and recent reviews. Questions about the four aforementioned categories were entered into ChatGPT-4. Four board-certified laryngologists evaluated the accuracy of ChatGPT-4 with a 5-point Likert scale. Interrater reliability was evaluated. Results: The mean scores (SD) of ChatGPT-4 answers for definition, clinical presentation, additional examination, and treatments were 4.13 (0.52), 4.50 (0.72), 3.75 (0.61), and 4.18 (0.47), respectively. Experts reported high interrater reliability for sub-scores (ICC = 0.973). The lowest performances of ChatGPT-4 were on answers about the most prevalent LPR signs, the most reliable objective tool for the diagnosis (hypopharyngeal-esophageal multichannel intraluminal impedance-pH monitoring (HEMII-pH)), and the criteria for the diagnosis of LPR using HEMII-pH. Conclusion: ChatGPT-4 may provide adequate information on the definition of LPR, differences compared to GERD (gastroesophageal reflux disease), and clinical presentation. Information provided upon extra-laryngeal manifestations and HEMII-pH may need further optimization. Regarding the recent trends identifying increasing patient use of internet sources for self-education, the findings of the present study may help draw attention to ChatGPT-4's accuracy on the topic of LPR. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

23. Who is making the decisions? How retail managers can use the power of ChatGPT.

Author: Kumar, Anuj, Gupta, Nimit, and Bapat, Gautam
Subjects: CHATGPT, GENERATIVE pre-trained transformers, DECISION making, ELECTRIC power consumption, STRESS (Linguistics), CUSTOMER experience, VIRTUAL reality, LANGUAGE & languages
Abstract: Purpose: This paper aims to explore ChatGPT's (generative pre-trained transformers) potential as a tool for retailers to improve customer experience and boost sales. While it provides benefits like personalized recommendations and 24/7 assistance, there are limitations, like difficulty in understanding unconventional language. The paper stresses careful integration to overcome these limitations and create a better customer experience. Additionally, it discusses the potential for further development and integration of ChatGPT in retail, such as generating product descriptions and virtual try-on experiences. Finally, the paper encourages retailers to embrace ChatGPT to meet their customer needs. Design/methodology/approach: Case-based methodology involves using specific cases or examples to explore a broader issue or phenomenon. Researchers have analysed real-world cases to identify patterns, themes and insights that can be applied to other contexts or situations. This was useful for understanding complex and multifaceted issues as it allowed us to delve deeper into specific examples and explore the nuances of the situation. Findings: While ChatGPT is a powerful tool for retailers, limitations such as difficulty in understanding non-standard accents and unconventional language can arise, causing customer frustration. Retail managers must integrate ChatGPT in a way that enhances customer experience. In the future, ChatGPT has the potential to generate product descriptions, provide virtual try-on experiences and integrate with augmented or virtual reality technology to offer more immersive experiences. Careful consideration and integration can help retailers overcome these limitations and offer personalized recommendations, round-the-clock assistance and an engaging customer experience that improves sales. Originality/value: The case topic is very much in a novel stage of research and writing. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

24. Assessing ChatGPT 4.0's test performance and clinical diagnostic accuracy on USMLE STEP 2 CK and clinical case reports.

Author: Shieh, Allen, Tran, Brandon, He, Gene, Kumar, Mudit, Freed, Jason A., and Majety, Priyanka
Subjects: *CHATGPT, *CHATBOTS, *GENERATIVE pre-trained transformers, *LANGUAGE models, *ARTIFICIAL intelligence
Abstract: While there is data assessing the test performance of artificial intelligence (AI) chatbots, including the Generative Pre-trained Transformer 4.0 (GPT 4) chatbot (ChatGPT 4.0), there is scarce data on its diagnostic accuracy of clinical cases. We assessed the large language model (LLM), ChatGPT 4.0, on its ability to answer questions from the United States Medical Licensing Exam (USMLE) Step 2, as well as its ability to generate a differential diagnosis based on corresponding clinical vignettes from published case reports. A total of 109 Step 2 Clinical Knowledge (CK) practice questions were inputted into both ChatGPT 3.5 and ChatGPT 4.0, asking ChatGPT to pick the correct answer. Compared to its previous version, ChatGPT 3.5, we found improved accuracy of ChatGPT 4.0 when answering these questions, from 47.7 to 87.2% (p = 0.035) respectively. Utilizing the topics tested on Step 2 CK questions, we additionally found 63 corresponding published case report vignettes and asked ChatGPT 4.0 to come up with its top three differential diagnosis. ChatGPT 4.0 accurately created a shortlist of differential diagnoses in 74.6% of the 63 case reports (74.6%). We analyzed ChatGPT 4.0's confidence in its diagnosis by asking it to rank its top three differentials from most to least likely. Out of the 47 correct diagnoses, 33 were the first (70.2%) on the differential diagnosis list, 11 were second (23.4%), and three were third (6.4%). Our study shows the continued iterative improvement in ChatGPT's ability to answer standardized USMLE questions accurately and provides insights into ChatGPT's clinical diagnostic accuracy. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

25. Performance Assessment of ChatGPT versus Bard in Detecting Alzheimer's Dementia.

Author: B.T, Balamurali and Chen, Jer-Ming
Subjects: *ALZHEIMER'S disease, *CHATGPT, *LANGUAGE models, *CHATBOTS, *GENERATIVE pre-trained transformers
Abstract: Large language models (LLMs) find increasing applications in many fields. Here, three LLM chatbots (ChatGPT-3.5, ChatGPT-4, and Bard) are assessed in their current form, as publicly available, for their ability to recognize Alzheimer's dementia (AD) and Cognitively Normal (CN) individuals using textual input derived from spontaneous speech recordings. A zero-shot learning approach is used at two levels of independent queries, with the second query (chain-of-thought prompting) eliciting more detailed information than the first. Each LLM chatbot's performance is evaluated on the prediction generated in terms of accuracy, sensitivity, specificity, precision, and F1 score. LLM chatbots generated a three-class outcome ("AD", "CN", or "Unsure"). When positively identifying AD, Bard produced the highest true-positives (89% recall) and highest F1 score (71%), but tended to misidentify CN as AD, with high confidence (low "Unsure" rates); for positively identifying CN, GPT-4 resulted in the highest true-negatives at 56% and highest F1 score (62%), adopting a diplomatic stance (moderate "Unsure" rates). Overall, the three LLM chatbots can identify AD vs. CN, surpassing chance-levels, but do not currently satisfy the requirements for clinical application. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

26. Embracing the future—is artificial intelligence already better? A comparative study of artificial intelligence performance in diagnostic accuracy and decision‐making.

Author: Fonseca, Ângelo, Ferreira, Axel, Ribeiro, Luís, Moreira, Sandra, and Duque, Cristina
Subjects: *ARTIFICIAL intelligence, *LANGUAGE models, *CHATBOTS, *CHATGPT, *GENERATIVE pre-trained transformers, *ADRENAL insufficiency
Abstract: Background and purpose: The integration of artificial intelligence (AI) in healthcare has the potential to revolutionize patient care and clinical decision‐making. This study aimed to explore the reliability of large language models in neurology by comparing the performance of an AI chatbot with neurologists in diagnostic accuracy and decision‐making. Methods: A cross‐sectional observational study was conducted. A pool of clinical cases from the American Academy of Neurology's Question of the Day application was used as the basis for the study. The AI chatbot used was ChatGPT, based on GPT‐3.5. The results were then compared to neurology peers who also answered the questions—a mean of 1500 neurologists/neurology residents. Results: The study included 188 questions across 22 different categories. The AI chatbot demonstrated a mean success rate of 71.3% in providing correct answers, with varying levels of proficiency across different neurology categories. Compared to neurology peers, the AI chatbot performed at a similar level, with a mean success rate of 69.2% amongst peers. Additionally, the AI chatbot achieved a correct diagnosis in 85.0% of cases and it provided an adequate justification for its correct responses in 96.1%. Conclusions: The study highlights the potential of AI, particularly large language models, in assisting with clinical reasoning and decision‐making in neurology and emphasizes the importance of AI as a complementary tool to human expertise. Future advancements and refinements are needed to enhance the AI chatbot's performance and broaden its application across various medical specialties. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

27. Accuracy of ChatGPT-3.5 and -4 in providing scientific references in otolaryngology–head and neck surgery.

Author: Lechien, Jerome R., Briganti, Giovanni, and Vaira, Luigi A.
Subjects: *CHATGPT, *GENERATIVE pre-trained transformers, *CHATBOTS, *LANGUAGE models, *ARTIFICIAL intelligence
Abstract: Introduction: Chatbot generative pre-trained transformer (ChatGPT) is a new artificial intelligence-powered language model of chatbot able to help otolaryngologists in practice and research. We investigated the accuracy of ChatGPT-3.5 and -4 in the referencing of manuscripts published in otolaryngology. Methods: ChatGPT-3.5 and ChatGPT-4 were interrogated for providing references of the top-30 most cited papers in otolaryngology in the past 40 years including clinical guidelines and key studies that changed the practice. The responses were regenerated three times to assess the accuracy and stability of ChatGPT. ChatGPT-3.5 and ChatGPT-4 were compared for accuracy of reference and potential mistakes. Results: The accuracy of ChatGPT-3.5 and ChatGPT-4.0 ranged from 47% to 60%, and 73% to 87%, respectively (p < 0.005). ChatGPT-3.5 provided 19 inaccurate references and invented 2 references throughout the regenerated questions. ChatGPT-4.0 provided 13 inaccurate references, while it proposed only one invented reference. The stability of responses throughout regenerated answers was mild (k = 0.238) and moderate (k = 0.408) for ChatGPT-3.5 and 4.0, respectively. Conclusions: ChatGPT-4.0 reported higher accuracy than the free-access version (3.5). False references were detected in both 3.5 and 4.0 versions. Practitioners need to be careful regarding the use of ChatGPT in the reach of some key reference when writing a report. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

28. Enhancing Child Safety in Online Gaming: The Development and Application of Protectbot, an AI-Powered Chatbot Framework.

Author: Faraz, Anum, Ahsan, Fardin, Mounsef, Jinane, Karamitsos, Ioannis, and Kanavos, Andreas
Subjects: *ARTIFICIAL intelligence, *CHATBOTS, *GENERATIVE pre-trained transformers, *INTERNET safety, *VIDEO games, *GAMES
Abstract: This study introduces Protectbot, an innovative chatbot framework designed to improve safety in children's online gaming environments. At its core, Protectbot incorporates DialoGPT, a conversational Artificial Intelligence (AI) model rooted in Generative Pre-trained Transformer 2 (GPT-2) technology, engineered to simulate human-like interactions within gaming chat rooms. The framework is distinguished by a robust text classification strategy, rigorously trained on the Publicly Available Natural 2012 (PAN12) dataset, aimed at identifying and mitigating potential sexual predatory behaviors through chat conversation analysis. By utilizing fastText for word embeddings to vectorize sentences, we have refined a support vector machine (SVM) classifier, achieving remarkable performance metrics, with recall, accuracy, and F-scores approaching 0.99. These metrics not only demonstrate the classifier's effectiveness, but also signify a significant advancement beyond existing methodologies in this field. The efficacy of our framework is additionally validated on a custom dataset, composed of 71 predatory chat logs from the Perverted Justice website, further establishing the reliability and robustness of our classifier. Protectbot represents a crucial innovation in enhancing child safety within online gaming communities, providing a proactive, AI-enhanced solution to detect and address predatory threats promptly. Our findings highlight the immense potential of AI-driven interventions to create safer digital spaces for young users. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

29. QUICK REVIEW OF PEDAGOGICAL EXPERIENCES USING GPT-3 IN EDUCATION.

Author: Manuel Prieto-Andreu, Joel and Labisa-Palmeira, Antonio
Subjects: GENERATIVE pre-trained transformers, LANGUAGE models, TEXT summarization, CHATBOTS, COMPUTER science, ARTIFICIAL intelligence
Abstract: GPT-3 is a neuronal language model that performs tasks such as classification, question-answering and text summarization. Although chatbots like BlenderBot-3 work well in a conversational sense, and GPT-3 can assist experts in evaluating questions, they are quantifiably worse than real teachers in several pedagogical dimensions. We present the first systematic literature review that analyzes the main contributions and uses of GPT-3 in the field of education. The protocols suggested in the PRISMA 2020 statement were followed for the drafting of the review. According to the results, 34 significant productions were identified through a systematic search in ISI Web of Science, SCOPUS and Google Scholar. GPT-3 has been considered in the academic, ethical and medical fields, in humanities and in computer science, in the formulation of questions and answers, and through cooperative educational dialogs. GPT-3 has been proven to have valuable applications in education, such as the automation of routine tasks, in making quick diagnoses of the students' weaknesses and in the automatic generation of questions, but it still faces challenges and limitations that require additional investigation. We discuss the educational possibilities and the limitations to the use of GPT-3. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

30. Sobre Bits y Colonialidades: Revitalización de categorías coloniales mediante inteligencia artificial en la representación de las minorías étnicas del Ecuador.

Author: Espín Carrión, Andrés
Subjects: CHATBOTS, GENERATIVE pre-trained transformers, ARTIFICIAL intelligence, MINORITIES, IMPERIALISM
Abstract: Copyright of Cuadernos del Centro de Estudios de Diseño y Comunicación is the property of Cuadernos del Centro de Estudios de Diseno y Comunicacion and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Published: 2024

31. From Traditional Recommender Systems to GPT-Based Chatbots: A Survey of Recent Developments and Future Directions.

Author: Al-Hasan, Tamim Mahmud, Sayed, Aya Nabil, Bensaali, Faycal, Himeur, Yassine, Varlamis, Iraklis, and Dimitrakopoulos, George
Subjects: RECOMMENDER systems, CHATBOTS, NATURAL language processing, GENERATIVE pre-trained transformers, REINFORCEMENT learning, DEEP learning
Abstract: Recommender systems are a key technology for many applications, such as e-commerce, streaming media, and social media. Traditional recommender systems rely on collaborative filtering or content-based filtering to make recommendations. However, these approaches have limitations, such as the cold start and the data sparsity problem. This survey paper presents an in-depth analysis of the paradigm shift from conventional recommender systems to generative pre-trained-transformers-(GPT)-based chatbots. We highlight recent developments that leverage the power of GPT to create interactive and personalized conversational agents. By exploring natural language processing (NLP) and deep learning techniques, we investigate how GPT models can better understand user preferences and provide context-aware recommendations. The paper further evaluates the advantages and limitations of GPT-based recommender systems, comparing their performance with traditional methods. Additionally, we discuss potential future directions, including the role of reinforcement learning in refining the personalization aspect of these systems. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

32. The model student: GPT-4 performance on graduate biomedical science exams.

Author: Stribling, Daniel, Xia, Yuxing, Amer, Maha K., Graim, Kiley S., Mulligan, Connie J., and Renne, Rolf
Subjects: *GENERATIVE pre-trained transformers, *LANGUAGE models, *MEDICAL sciences, *CHATGPT, *CHATBOTS, *BRAILLE
Abstract: The GPT-4 large language model (LLM) and ChatGPT chatbot have emerged as accessible and capable tools for generating English-language text in a variety of formats. GPT-4 has previously performed well when applied to questions from multiple standardized examinations. However, further evaluation of trustworthiness and accuracy of GPT-4 responses across various knowledge domains is essential before its use as a reference resource. Here, we assess GPT-4 performance on nine graduate-level examinations in the biomedical sciences (seven blinded), finding that GPT-4 scores exceed the student average in seven of nine cases and exceed all student scores for four exams. GPT-4 performed very well on fill-in-the-blank, short-answer, and essay questions, and correctly answered several questions on figures sourced from published manuscripts. Conversely, GPT-4 performed poorly on questions with figures containing simulated data and those requiring a hand-drawn answer. Two GPT-4 answer-sets were flagged as plagiarism based on answer similarity and some model responses included detailed hallucinations. In addition to assessing GPT-4 performance, we discuss patterns and limitations in GPT-4 capabilities with the goal of informing design of future academic examinations in the chatbot era. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

33. Creativity and Innovation in Civic Spaces Supported by Cognitive Flexibility When Learning with AI Chatbots in Smart Cities.

Author: Chauncey, Sarah A. and McKenna, H. Patricia
Subjects: CHATBOTS, SMART cities, COGNITIVE flexibility, ARTIFICIAL intelligence, GENERATIVE pre-trained transformers, LANGUAGE models
Abstract: The purpose of this study is to advance conceptual understandings of the cognitive flexibility construct, in support of creativity and innovation in smart city civic spaces, employing the use of large language model artificial intelligence chatbots such as ChatGPT. Based on a review of the research and practice literature, this study formulates a conceptual framework for cognitive flexibility in support of creativity and innovation in AI environments, adaptable to smart cities. A research design is used that employs AI as a design material, in combination with a topical inquiry involving boundary setting and perspective taking, to co-pilot an exploration with ChatGPT-3.5/4. This study operationalizes the framework for applications to learning approaches, addressing flexibility and inclusivity in smart city spaces and regions. With the rapid evolving of chatbot technologies, ChatGPT-4 is used in the exploration of a speculative real-world urban example. This work is significant in that AI chatbots are explored for application in urban spaces involving creative ideation, iteration, engagement, and cognitive flexibility; future directions for exploration are identified pertaining to ethical and civil discourse in smart cities and learning cities, as well as the notion that AI chatbots and GPTs (generative pre-trained transformers) may become a zeitgeist for understanding and learning in smart cities. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

34. Artificial intelligence falls for scams.

Author: Stokel-Walker, Chris
Subjects: *LANGUAGE models, *ARTIFICIAL intelligence, *GENERATIVE pre-trained transformers, *SWINDLERS & swindling, *PRICES, *CHATBOTS
Abstract: A new battery design using water and clay has been developed, offering sustainable power in extreme environments like Mars. The battery, made with graphene electrodes and water-filled clay layers, can store energy efficiently and has potential for widespread use. Additionally, large language models used in chatbots are susceptible to being scammed, with different AI models showing varying levels of susceptibility to persuasive tactics in scam scenarios. Researchers emphasize the need for human oversight in AI decision-making processes due to these findings. [Extracted from the article]
Published: 2024
Full Text: View/download PDF

35. Calling in at Dr. GPT.

Author: Engel, Christoph
Subjects: LANGUAGE models, PSYCHOLOGICAL vulnerability, GENERATIVE pre-trained transformers, INSTITUTIONAL economics, CHATBOTS
Abstract: The article titled "Calling in at Dr. GPT" discusses the potential impact of language models, specifically social bots, on mental health. The authors examine whether these bots can provide a safe haven for individuals with mental illness by offering a means of communication free from physical contact. The study finds that while superusers of the platform SimSimi report higher levels of perceived depression, the causality between excessive use of social bots and mental health is unclear. The authors suggest further research and experimentation to determine whether social bots can effectively address the needs of individuals with mental health issues. [Extracted from the article]
Published: 2024
Full Text: View/download PDF

36. COPILOT PRO & COPILOT FOR MICROSOFT 365.

Author: Betteridge, Ian, Danton, Tim, and Timberley, Adam
Subjects: LANGUAGE models, ARTIFICIAL intelligence, GENERATIVE artificial intelligence, CHATBOTS, SHARED workspaces, GENERATIVE pre-trained transformers, ASPECT ratio (Images)
Abstract: This document provides an overview of Microsoft's Copilot software and its different versions. Copilot is a powerful tool that can be accessed for free via the web. The article discusses Copilot Pro, which offers additional features for a monthly fee, and Copilot for Microsoft 365, which is aimed at businesses. It also mentions limitations and potential risks associated with using Copilot, such as incorrect statements and the possibility of confidential files being surfaced. The article covers the use of Copilot in various Microsoft Office applications, such as PowerPoint, Excel, Outlook, and OneNote, highlighting its effectiveness in each. Additionally, it mentions alternative AI models and pricing options for Copilot. The article also discusses the use of Copilot in Microsoft Teams meetings, emphasizing its ability to summarize discussions and provide meeting recaps, but advising against relying solely on the tool and the importance of human oversight. [Extracted from the article]
Published: 2024

37. ChatGPT-Based Tools that can Boost Research Productivity.

Author: Bhardwaj, Aditya
Subjects: GENERATIVE pre-trained transformers, CHATBOTS, CHATGPT
Abstract: The article introduces ChatGPT-based tools, revolutionizing research with their natural language understanding and text generation capabilities. Topics include research-oriented Artificial IntelligenceI assistants like ResearchGPT, Academic Research Reviewer, and Literature Review GPT, facilitating tasks such as literature search, academic paper reviewing, and literature review generation, thus enhancing research productivity.
Published: 2024

38. Exploring Diagnostic Precision and Triage Proficiency: A Comparative Study of GPT-4 and Bard in Addressing Common Ophthalmic Complaints.

Author: Zandi, Roya, Fahey, Joseph D., Drakopoulos, Michael, Bryan, John M., Dong, Siyuan, Bryar, Paul J., Bidwell, Ann E., Bowen, R. Chris, Lavine, Jeremy A., and Mirza, Rukhsana G.
Subjects: *GENERATIVE pre-trained transformers, *CHATBOTS, *LANGUAGE models, *MEDICAL triage, *CHATGPT, *PATIENT satisfaction
Abstract: In the modern era, patients often resort to the internet for answers to their health-related concerns, and clinics face challenges to providing timely response to patient concerns. This has led to a need to investigate the capabilities of AI chatbots for ophthalmic diagnosis and triage. In this in silico study, 80 simulated patient complaints in ophthalmology with varying urgency levels and clinical descriptors were entered into both ChatGPT and Bard in a systematic 3-step submission process asking chatbots to triage, diagnose, and evaluate urgency. Three ophthalmologists graded chatbot responses. Chatbots were significantly better at ophthalmic triage than diagnosis (90.0% appropriate triage vs. 48.8% correct leading diagnosis; p < 0.001), and GPT-4 performed better than Bard for appropriate triage recommendations (96.3% vs. 83.8%; p = 0.008), grader satisfaction for patient use (81.3% vs. 55.0%; p < 0.001), and lower potential harm rates (6.3% vs. 20.0%; p = 0.010). More descriptors improved the accuracy of diagnosis for both GPT-4 and Bard. These results indicate that chatbots may not need to recognize the correct diagnosis to provide appropriate ophthalmic triage, and there is a potential utility of these tools in aiding patients or triage staff; however, they are not a replacement for professional ophthalmic evaluation or advice. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

39. Classification of Conversational Sentences Using an Ensemble Pre-Trained Language Model with the Fine-Tuned Parameter.

Author: Sujatha, R. and Nimala, K.
Subjects: LANGUAGE models, GENERATIVE pre-trained transformers, TRANSFORMER models, NATURAL language processing, SEMANTICS, GENERAL semantics, CHATBOTS
Abstract: Sentence classification is the process of categorizing a sentence based on the context of the sentence. Sentence categorization requires more semantic highlights than other tasks, such as dependence parsing, which requires more syntactic elements. Most existing strategies focus on the general semantics of a conversation without involving the context of the sentence, recognizing the progress and comparing impacts. An ensemble pre-trained language model was taken up here to classify the conversation sentences from the conversation corpus. The conversational sentences are classified into four categories: information, question, directive, and commission. These classification label sequences are for analyzing the conversation progress and predicting the pecking order of the conversation. Ensemble of Bidirectional Encoder for Representation of Transformer (BERT), Robustly Optimized BERT pretraining Approach (RoBERTa), Generative Pre-Trained Transformer (GPT), DistilBERT and Generalized Autoregressive Pretraining for Language Understanding (XLNet) models are trained on conversation corpus with hyperparameters. Hyperparameter tuning approach is carried out for better performance on sentence classification. This Ensemble of Pre-trained LanguageModels with a Hyperparameter Tuning (EPLM-HT) system is trained on an annotated conversation dataset. The proposed approach outperformed compared to the base BERT, GPT, DistilBERT and XLNet transformer models. The proposed ensemble model with the fine-tuned parameters achieved an F1_score of 0.88. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

40. Ten simple rules for using large language models in science, version 1.0.

Author: Smith, Gabriel Reuben, Bello, Carolina, Bialic-Murphy, Lalasia, Clark, Emily, Delavaux, Camille S., Fournier de Lauriere, Camille, van den Hoogen, Johan, Lauber, Thomas, Ma, Haozhi, Maynard, Daniel S., Mirman, Matthew, Mo, Lidong, Rebindaine, Dominic, Reek, Josephine Elena, Werden, Leland K., Wu, Zhaofei, Yang, Gayoung, Zhao, Qingzhou, Zohner, Constantin M., and Crowther, Thomas W.
Subjects: *LANGUAGE models, *GENERATIVE artificial intelligence, *SCIENTIFIC language, *GENERATIVE pre-trained transformers, *LINGUISTICS, *CHATBOTS, *TUNDRAS
Abstract: The article explores the use of large language models (LLMs) in scientific research, highlighting their potential benefits and limitations. It provides examples of how LLMs can assist in various scientific tasks, such as summarizing documents and improving writing. However, it also acknowledges the need for fact-checking and the potential biases in training data. The authors emphasize the importance of adhering to ethical guidelines and journal policies when using LLMs. While LLMs can be valuable tools, caution should be exercised in their use. [Extracted from the article]
Published: 2024
Full Text: View/download PDF

41. An In-Depth Review of ChatGPT's Pros and Cons for Learning and Teaching in Education.

Author: Samala, Agariadne Dwinggo, Xiaoming Zhai, Kumiko Aoki, Bojic, Ljubisa, and Zikic, Simona
Subjects: CHATGPT, LANGUAGE models, GENERATIVE pre-trained transformers, CHATBOTS, GENERATIVE artificial intelligence
Abstract: As technology progresses, there has been an increasing interest in using Chatbot GPT (Generative Pre-trained Transformer) in education. Chatbot GPT, or ChatGPT, gained one million users within the first week of launching in November 2022 and had amassed over 100 million active users by February 2023. This type of artificial intelligence uses natural language processing to convert it into a user. This paper presents a comprehensive analysis and review of 34 articles published on ChatGPT and its potential impact on education by utilizing the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) methodology. This review analyzed various studies and articles to examine the strengths and limitations of GPT language models in education from 2018 to the present. The advantages of ChatGPT include its capacity to provide personalized and adaptive learning, instant feedback, and improved accessibility. However, there are potential drawbacks, such as the lack of emotional intelligence, the risk of overreliance on technology, and privacy concerns. This review suggests that ChatGPT has significant promise for education yet reinforces the necessity for further research and careful consideration of possible risks and limitations. Specifically, it pointed out potential invisible manipulations by instructing ChatGPT to answer educationrelated topics. The paper concludes by discussing the implications of ChatGPT for the future of education and emphasizing the need for further research in this field. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

42. ChatGPT for Professional English Course Development.

Author: Kostikova, Ilona, Holubnycha, Liudmyla, Besarab, Tetiana, Moshynska, Olena, Moroz, Tetiana, and Shamaieva, Iuliia
Subjects: CHATGPT, CHATBOTS, ELECTRONIC textbooks, GENERATIVE pre-trained transformers, LANGUAGE models, ENGLISH language, ARTIFICIAL intelligence
Abstract: Digitalization of all life spheres is the reality of modern world development. The global digitization creates a powerful information environment. Its navigation requires serious tools for structuring, systematizing and processing information. Digital processes, as the most progressive, are in constant development. Artificial Intelligence (AI) is gaining popularity. The latest product that has created a lot of discussion is chatGPT (Generative Pre-trained Transformer) from OpenAI, which is an artificial intelligence chatbot that demonstrates the ability of digital devices to perform the tasks inherent to intelligent beings. The paper shows some issues of using ChatGPT for making a new course of Law English, a curriculum, a syllabus at the tertiary level removing concerns related to its application and utilization. The purpose of the manuscript is to describe a real case of working out a professional English course for university students applying ChatGPT. The methods of analysis, synthesis, case study, expert assessment were used. The results are as follows: a new course for Law English training, a curriculum, a syllabus and a textbook using content created by ChatGPT were readied. The conclusion is that, nowadays, everything can be taught by a teacher partly with an AI. ChatGPT can be used for a wide variety of educational purposes, including providing information, generating necessary texts, tasks, tests, and answering many questions. Most of the mentioned ChatGPT services are important educational elements in language teaching and learning, they can be used for professional English course development. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

43. Data Extraction from Unstructured Medical Records with the GPT-4 Chatbot.

Author: Ntinopoulos, V., Rodriguez Cetina, Biefer H., Tudorache, I., Papadopoulos, N., Odavic, D., Risteski, P., Haeussler, A., and Dzemali, O.
Subjects: *GENERATIVE pre-trained transformers, *DATA extraction, *CHATBOTS, *MEDICAL records
Abstract: This article discusses a study that explores the use of GPT-4, a generative pre-trained transformer, as a tool for extracting data from unstructured medical records. The study involved drafting fifty fictitious patient records in German and providing GPT-4 with instructions on how to process each one. The accuracy, recall, precision, and F1-Score of GPT-4 were assessed for various variables, and it was found that GPT-4 exhibited excellent performance in both text-mining and classification tasks. The study concludes that reliable data extraction from unstructured medical records is possible with the GPT-4 chatbot. [Extracted from the article]
Published: 2024
Full Text: View/download PDF

44. Application of Conversational Generative Pre-trained Transformer for improvement of information services in academic libraries.

Author: Modiba, Mashilo
Subjects: *GENERATIVE pre-trained transformers, *CHATGPT, *CHATBOTS, *LITERATURE reviews, *ARTIFICIAL intelligence
Abstract: Conversational Generative Pre-Trained Transformer (ChatGPT) can be applied in academic libraries to provide information services in the Fifth Industrial Revolution (5IR). In providing information services in academic libraries, ChatGPT refers to an artificial intelligence (AI)-powered chatbot or virtual assistant specifically tailored for library environments cybrarians can use to provide information services in academic libraries. The purpose of this study was to investigate the application of ChatGPT to provide information services in academic libraries in the 5IR. This qualitative study used literature review as methodology and incorporated insights from researchers' experiences with using AI chatbots such as ChatGPT to provide information services in academic libraries in the 5IR. The study findings revealed that ChatGPT can effectively provide information services in academic libraries in the 5IR. It can enable smooth interaction between the patrons and the library, and answer queries of patrons. It can provide information services around the clock without interruption. This study proposes a framework to apply ChatGPT to provide information services in academic libraries in the 5IR, which can guide academic libraries in the 5IR to adopt and utilise ChatGPT to provide the information services effectively and efficiently. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

45. ENHANCING USER EXPERIENCE AND INFORMATION RETRIEVAL IN LIBRARIES THROUGH CHATBOT INTEGRATION: A STUDY ON UTILIZING CHATGPT.

Author: Gupta, Mohit and Singh, Ritu
Subjects: *CHATGPT, *CHATBOTS, *GENERATIVE pre-trained transformers, *INFORMATION retrieval, *USER experience, *INTERNET privacy, *DATA privacy
Abstract: This research investigates the integration of ChatGPT, a generative pre-trained transformer model, within library systems to augment user experience and information retrieval efficacy. The study explores user interactions with ChatGPT, analyzes data security anxieties, assesses technical feasibility, and evaluates the influence of integration on users’ ability to find relevant resources. Findings reveal a significant portion of library patrons utilize chatbots for information retrieval, perceive ChatGPT as helpful for queries, and value personalized recommendations. Nevertheless, concerns regarding data privacy and the potential for diminished human interaction necessitate mitigation strategies. The study advocates for a measured approach to Generative Pre-trained Transformer (GPT) integration, emphasizing user training and transparent communication regarding data privacy practices. Furthermore, librarians should retain an active role to address complex inquiries demanding human expertise and attenuate potential biases within GPT responses. [ABSTRACT FROM AUTHOR]
Published: 2024

46. UA-LLM: ADVANCING CONTEXT-BASED QUESTION ANSWERING IN UKRAINIAN THROUGH LARGE LANGUAGE MODELS.

Author: M. V., Syromiatnikov and V. M., Ruvinskaya
Subjects: LANGUAGE models, QUESTION answering systems, NATURAL language processing, UKRAINIAN language, CHATBOTS, GENERATIVE pre-trained transformers
Abstract: Context-based question answering, a fundamental task in natural language processing, demands a deep understanding of the language’s nuances. While being a sophisticated task, it’s an essential part of modern search systems, intelligent assistants, chatbots, and the whole Conversational AI field. While English, Chinese, and other widely spoken languages have gathered an extensive number of datasets, algorithms, and benchmarks, the Ukrainian language, with its rich linguistic heritage and intricate syntax, has remained among low-resource languages in the NLP community, making the Question Answering problem even harder. Objective. The purpose of this work is to establish and benchmark a set of techniques, leveraging Large Language Models, combined in a single framework for solving the low-resource problem for Context-based question-answering task in Ukrainian. Method. A simple yet flexible framework for leveraging Large Language Models, developed as a part of this research work, enlights two key methods proposed and evaluated in this paper for dealing with a small amount of training data for context-based question-answering tasks. The first one utilizes Zero-shot and Few-shot learning – the two major subfields of N-shot learning, where N corresponds to the number of training samples, to build a bilingual instruction-based prompt strategy for language models inferencing in an extractive manner (find an answer span in context) instead of their natural generative behavior (summarize the context according to question). The second proposed method is based on the first one, but instead of just answering the question, the language model annotates the input context through the generation of question-answer pairs for the given paragraph. This synthetic data is used for extractive model training. This paper explores both augmentation-based training, when there is some annotated data already, and completely synthetic training, when no data is available. The key benefit of these two methods is the ability to obtain comparable prediction quality even without an expensive and long-term human annotation process. Results. Two proposed methods for solving the low-to-zero amount of training data problem for context-based questionanswering tasks in Ukrainian were implemented and combined into the flexible LLM experimentation framework. Conclusions. This research comprehensively studied OpenAI GPT-3.5, OpenAI GPT-4, Cohere Command, and Meta LLaMa-2 language understanding capabilities applied to context-based question answering in low-resource Ukrainian. The thorough evaluation of proposed methods on a diverse set of metrics proves their efficiency, unveiling the possibility of building components of search engines, chatbot applications, and standalone general-domain CBQA systems with Ukrainian language support while having almost zero annotated data. The prospect for further research is to extend the scope from the CBQA task evaluated in this paper to all major NLU tasks with the final goal of establishing a complete benchmark for LLMs’ capabilities evaluation in the Ukrainian language. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

47. GenAI et al.: Cocreation, Authorship, Ownership, Academic Ethics and Integrity in a Time of Generative AI.

Author: Bozkurt, Aras
Subjects: GENERATIVE artificial intelligence, EDUCATION ethics, INTEGRITY, HONESTY, LANGUAGE models, GENERATIVE pre-trained transformers, NATURAL language processing
Abstract: This paper investigates the complex interplay between generative artificial intelligence (AI) and human intellect in academic writing and publishing. It examines the 'organic versus synthetic' paradox, emphasizing the implications of using generative AI tools in educational and academic integrity contexts. The paper critiques the prevalent 'publish or perish' culture in academia, highlighting the need for systemic reevaluation due to generative AI's emerging role in academic writing and reporting. It delves into the legal and ethical challenges of authorship and ownership, especially in relation to copyright laws and AI-generated content. The paper discusses generative AI's diverse roles and advocates for transparent reporting to uphold academic integrity. Additionally, it calls for a broader examination of generative AI tools and stresses the need for new mechanisms to identify generative AI use and ensure adherence to academic integrity and ethics. The implications of generative AI are also explored, suggesting the need for innovative AI-inclusive strategies in academia. The paper concludes by emphasizing the significance of generative AI in various information-processing domains, highlighting the urgency to adapt and transform academic practices in an era of rapid generative AI-driven change. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

48. THE FUTURE OF AI: PATTERN DISCOVERY, EXPLAINABLE AI, AND ETHICAL AI.

Author: Anderson, Mark
Subjects: ARTIFICIAL intelligence, GENERATIVE pre-trained transformers, GENERATIVE artificial intelligence, CHATBOTS, LANGUAGE models
Abstract: The article focuses on the future of artificial intelligence (AI), exploring topics such as pattern discovery, explainable AI, and ethical AI. It discusses the need for trustworthy computing, the emergence of hypotheses-generating engines, and the importance of transparent and unbiased AI systems for scientific discovery and problem-solving.
Published: 2024

49. AI-generated feedback on writing: insights into efficacy and ENL student preference.

Author: Escalante, Juan, Pack, Austin, and Barrett, Alex
Subjects: GENERATIVE artificial intelligence, PSYCHOLOGICAL feedback, CHATBOTS, LANGUAGE models, CHATGPT, EDUCATIONAL outcomes, GENERATIVE pre-trained transformers, EDUCATIONAL evaluation, ARTIFICIAL intelligence
Abstract: The question of how generative AI tools, such as large language models and chatbots, can be leveraged ethically and effectively in education is ongoing. Given the critical role that writing plays in learning and assessment within educational institutions, it is of growing importance for educators to make thoughtful and informed decisions as to how and in what capacity generative AI tools should be leveraged to assist in the development of students' writing skills. This paper reports on two longitudinal studies. Study 1 examined learning outcomes of 48 university English as a new language (ENL) learners in a six-week long repeated measures quasi experimental design where the experimental group received writing feedback generated from ChatGPT (GPT-4) and the control group received feedback from their human tutor. Study 2 analyzed the perceptions of a different group of 43 ENLs who received feedback from both ChatGPT and their tutor. Results of study 1 showed no difference in learning outcomes between the two groups. Study 2 results revealed a near even split in preference for AI-generated or human-generated feedback, with clear advantages to both forms of feedback apparent from the data. The main implication of these studies is that the use of AI-generated feedback can likely be incorporated into ENL essay evaluation without affecting learning outcomes, although we recommend a blended approach that utilizes the strengths of both forms of feedback. The main contribution of this paper is in addressing generative AI as an automatic essay evaluator while incorporating learner perspectives. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

50. Retraining fact-checkers: The emergence of ChatGPT in information verification.

Author: Cuartielles, Roger, Ramon-Vegas, Xavier, and Pont-Sorribes, Carles
Subjects: *CHATGPT, *GENERATIVE artificial intelligence, *CHATBOTS, *ARTIFICIAL intelligence, *AMBIVALENCE, *GENERATIVE pre-trained transformers, *LANGUAGE models, *OCCUPATIONAL retraining
Abstract: The open launch of new artificial intelligence tools such as ChatGPT-3.5 (Generated Pre-trained Transformer) in November 2022 by the company OpenAI -and then its update to version GPT-4 in March 2023-poses new opportunities and challenges for journalism, and especially for professionals specifically focused on information verification. This research aims to understand and analyze the perceptions generated by the irruption of ChatGPT among fact-checking professionals in Spain with the aim of identifying disadvantages and advantages in its use, professional implications and desired functionalities. The study uses qualitative methodology with in-depth interviews with professionals from all Spanish fact-checking platforms belonging to the International Factchecking Network (IFCN) and the European Digital Media Observatory (EDMO). The results conclude that the use of ChatGPT presents notable ambivalences. On the one hand, there are perceived drawbacks in issues such as the transparency and reliability of sources, the scope of the data, and the format of the responses generated. However, fact-checkers also point to a possible auxiliary use of the chatbot in the tasks of gathering information, detecting falsehoods, and producing denials. The irruption of ChatGPT has a direct impact on the work routines of the fact-checkers, which can be made more difficult, reinforced or extended. Fact-checking professionals perceive themselves as "context agents" in a new ecosystem that also obliges them to further diversify their fields of action in the fight against disinformation and to accelerate the implementation of media education actions that empower citizens in the responsible use of artificial intelligence. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

109 results on '"GENERATIVE pre-trained transformers"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources