Author: "Luis Espinosa-Anke" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Luis Espinosa-Anke"' showing total 184 results

Start Over Author "Luis Espinosa-Anke"

184 results on '"Luis Espinosa-Anke"'

1. Enabling Early Health Care Intervention by Detecting Depression in Users of Web-Based Forums using Language Models: Longitudinal Analysis and Evaluation

Author: David Owen, Dimosthenis Antypas, Athanasios Hassoulas, Antonio F Pardiñas, Luis Espinosa-Anke, and Jose Camacho Collados
Subjects: Computer applications to medicine. Medical informatics, R858-859.7
Abstract: BackgroundMajor depressive disorder is a common mental disorder affecting 5% of adults worldwide. Early contact with health care services is critical for achieving accurate diagnosis and improving patient outcomes. Key symptoms of major depressive disorder (depression hereafter) such as cognitive distortions are observed in verbal communication, which can also manifest in the structure of written language. Thus, the automatic analysis of text outputs may provide opportunities for early intervention in settings where written communication is rich and regular, such as social media and web-based forums. ObjectiveThe objective of this study was 2-fold. We sought to gauge the effectiveness of different machine learning approaches to identify users of the mass web-based forum Reddit, who eventually disclose a diagnosis of depression. We then aimed to determine whether the time between a forum post and a depression diagnosis date was a relevant factor in performing this detection. MethodsA total of 2 Reddit data sets containing posts belonging to users with and without a history of depression diagnosis were obtained. The intersection of these data sets provided users with an estimated date of depression diagnosis. This derived data set was used as an input for several machine learning classifiers, including transformer-based language models (LMs). ResultsBidirectional Encoder Representations from Transformers (BERT) and MentalBERT transformer-based LMs proved the most effective in distinguishing forum users with a known depression diagnosis from those without. They each obtained a mean F1-score of 0.64 across the experimental setups used for binary classification. The results also suggested that the final 12 to 16 weeks (about 3-4 months) of posts before a depressed user’s estimated diagnosis date are the most indicative of their illness, with data before that period not helping the models detect more accurately. Furthermore, in the 4- to 8-week period before the user’s estimated diagnosis date, their posts exhibited more negative sentiment than any other 4-week period in their post history. ConclusionsTransformer-based LMs may be used on data from web-based social media forums to identify users at risk for psychiatric conditions such as depression. Language features picked up by these classifiers might predate depression onset by weeks to months, enabling proactive mental health care interventions to support those at risk for this condition.
Published: 2023
Full Text: View/download PDF

2. English–Welsh Cross-Lingual Embeddings

Author: Luis Espinosa-Anke, Geraint Palmer, Padraig Corcoran, Maxim Filimonov, Irena Spasić, and Dawn Knight
Subjects: natural language processing, distributional semantics, machine learning, language model, word embeddings, machine translation, Technology, Engineering (General). Civil engineering (General), TA1-2040, Biology (General), QH301-705.5, Physics, QC1-999, Chemistry, QD1-999
Abstract: Cross-lingual embeddings are vector space representations where word translations tend to be co-located. These representations enable learning transfer across languages, thus bridging the gap between data-rich languages such as English and others. In this paper, we present and evaluate a suite of cross-lingual embeddings for the English–Welsh language pair. To train the bilingual embeddings, a Welsh corpus of approximately 145 M words was combined with an English Wikipedia corpus. We used a bilingual dictionary to frame the problem of learning bilingual mappings as a supervised machine learning task, where a word vector space is first learned independently on a monolingual corpus, after which a linear alignment strategy is applied to map the monolingual embeddings to a common bilingual vector space. Two approaches were used to learn monolingual embeddings, including word2vec and fastText. Three cross-language alignment strategies were explored, including cosine similarity, inverted softmax and cross-domain similarity local scaling (CSLS). We evaluated different combinations of these approaches using two tasks, bilingual dictionary induction, and cross-lingual sentiment analysis. The best results were achieved using monolingual fastText embeddings and the CSLS metric. We also demonstrated that by including a few automatically translated training documents, the performance of a cross-lingual text classifier for Welsh can increase by approximately 20 percent points.
Published: 2021
Full Text: View/download PDF

3. Who is better at math, Jenny or Jingzhen? Uncovering Stereotypes in Large Language Models.

Author: Zara Siddique, Liam D. Turner, and Luis Espinosa Anke
Published: 2024

4. Overview of the CLEF 2024 LongEval Lab on Longitudinal Evaluation of Model Performance.

Author: Rabab Alkhalifa, Hsuvas Borkakoty, Romain Deveaud, Alaa El-Ebshihy, Luis Espinosa Anke, Tobias Fink, Petra Galuscáková, Gabriela González Sáez, Lorraine Goeuriot, David Iommi, Maria Liakata, Harish Tayyar Madabushi, Pablo Medina-Alias, Philippe Mulhem, Florina Piroi, Martin Popel, and Arkaitz Zubiaga
Published: 2024
Full Text: View/download PDF

5. Extended Overview of the CLEF 2024 LongEval Lab on Longitudinal Evaluation of Model Performance.

Author: Rabab Alkhalifa, Hsuvas Borkakoty, Romain Deveaud, Alaa El-Ebshihy, Luis Espinosa Anke, Tobias Fink, Petra Galuscáková, Gabriela González Sáez, Lorraine Goeuriot, David Iommi, Maria Liakata, Harish Tayyar Madabushi, Pablo Medina-Alias, Philippe Mulhem, Florina Piroi, Martin Popel, and Arkaitz Zubiaga
Published: 2024

6. WordNet under Scrutiny: Dictionary Examples in the Era of Large Language Models.

Author: Fatemah Yousef Almeman, Steven Schockaert, and Luis Espinosa Anke
Published: 2024

7. AMenDeD: Modelling Concepts by Aligning Mentions, Definitions and Decontextualised Embeddings.

Author: Amit Gajbhiye, Zied Bouraoui, Luis Espinosa Anke, and Steven Schockaert
Published: 2024

8. RAGAs: Automated Evaluation of Retrieval Augmented Generation.

Author: Shahul ES, Jithin James, Luis Espinosa Anke, and Steven Schockaert
Published: 2024

9. LongEval: Longitudinal Evaluation of Model Performance at CLEF 2024.

Author: Rabab Alkhalifa, Hsuvas Borkakoty, Romain Deveaud, Alaa El-Ebshihy, Luis Espinosa Anke, Tobias Fink, Gabriela González Sáez, Petra Galuscáková, Lorraine Goeuriot, David Iommi, Maria Liakata, Harish Tayyar Madabushi, Pablo Medina-Alias, Philippe Mulhem, Florina Piroi, Martin Popel, Christophe Servan, and Arkaitz Zubiaga
Published: 2024
Full Text: View/download PDF

10. Savana: Re-using Electronic Health Records with Artificial Intelligence

Author: Ignacio Hernández Medrano, Jorge Tello Guijarro, Cristóbal Belda, Alberto Ureña, Ignacio Salcedo, Luis Espinosa-Anke, and Horacio Saggion
Subjects: Artificial Intelligence, e-health, Electronic Records, Machine Learning, NLP, Technology
Abstract: Health information grows exponentially (doubling every 5 years), thus generating a sort of inflation of science, i.e. the generation of more knowledge than we can leverage. In an unprecedented data-driven shift, today doctors have no longer time to keep updated. This fact explains why only one in every five medical decisions is based strictly on evidence, which inevitably leads to variability. A good solution lies on clinical decision support systems, based on big data analysis. As the processing of large amounts of information gains relevance, automatic approaches become increasingly capable to see and correlate information further and better than the human mind can. In this context, healthcare professionals are increasingly counting on a new set of tools in order to deal with the growing information that becomes available to them on a daily basis. By allowing the grouping of collective knowledge and prioritizing “mindlines” against “guidelines”, these support systems are among the most promising applications of big data in health. In this demo paper we introduce Savana, an AI-enabled system based on Natural Language Processing (NLP) and Neural Networks, capable of, for instance, the automatic expansion of medical terminologies, thus enabling the re-use of information expressed in natural language in clinical reports. This automatized and precise digital extraction allows the generation of a real time information engine, which is currently being deployed in healthcare institutions, as well as clinical research and management.
Published: 2018
Full Text: View/download PDF

11. WiDe-analysis: Enabling One-click Content Moderation Analysis on Wikipedia's Articles for Deletion.

Author: Hsuvas Borkakoty and Luis Espinosa Anke
Published: 2024
Full Text: View/download PDF

12. Hoaxpedia: A Unified Wikipedia Hoax Articles Dataset.

Author: Hsuvas Borkakoty and Luis Espinosa Anke
Published: 2024
Full Text: View/download PDF

13. CHEW: A Dataset of CHanging Events in Wikipedia.

Author: Hsuvas Borkakoty and Luis Espinosa Anke
Published: 2024
Full Text: View/download PDF

14. SuperTweetEval: A Challenging, Unified and Heterogeneous Benchmark for Social Media NLP Research.

Author: Dimosthenis Antypas, Asahi Ushio, Francesco Barbieri, Leonardo Neves, Kiamehr Rezaee, Luis Espinosa Anke, Jiaxin Pei, and José Camacho-Collados
Published: 2023
Full Text: View/download PDF

15. Construction Artifacts in Metaphor Identification Datasets.

Author: Joanne Boisson, Luis Espinosa Anke, and José Camacho-Collados
Published: 2023
Full Text: View/download PDF

16. What do Deck Chairs and Sun Hats Have in Common? Uncovering Shared Properties in Large Concept Vocabularies.

Author: Amit Gajbhiye, Zied Bouraoui, Na Li, Usashi Chatterjee, Luis Espinosa Anke, and Steven Schockaert
Published: 2023
Full Text: View/download PDF

17. Extended Overview of the CLEF-2023 LongEval Lab on Longitudinal Evaluation of Model Performance.

Author: Rabab Alkhalifa, Iman Munire Bilal, Hsuvas Borkakoty, José Camacho-Collados, Romain Deveaud, Alaa El-Ebshihy, Luis Espinosa Anke, Gabriela Nicole González Sáez, Petra Galuscáková, Lorraine Goeuriot, Elena Kochkina, Maria Liakata, Daniel Loureiro, Philippe Mulhem, Florina Piroi, Martin Popel, Christophe Servan, Harish Tayyar Madabushi, and Arkaitz Zubiaga
Published: 2023

18. Overview of the CLEF-2023 LongEval Lab on Longitudinal Evaluation of Model Performance.

Author: Rabab Alkhalifa, Iman Munire Bilal, Hsuvas Borkakoty, José Camacho-Collados, Romain Deveaud, Alaa El-Ebshihy, Luis Espinosa Anke, Gabriela González Sáez, Petra Galuscáková, Lorraine Goeuriot, Elena Kochkina, Maria Liakata, Daniel Loureiro, Philippe Mulhem, Florina Piroi, Martin Popel, Christophe Servan, Harish Tayyar Madabushi, and Arkaitz Zubiaga
Published: 2023
Full Text: View/download PDF

19. LongEval: Longitudinal Evaluation of Model Performance at CLEF 2023.

Author: Rabab Alkhalifa, Iman Munire Bilal, Hsubhas Borkakoty, José Camacho-Collados, Romain Deveaud, Alaa El-Ebshihy, Luis Espinosa Anke, Gabriela González Sáez, Petra Galuscáková, Lorraine Goeuriot, Elena Kochkina, Maria Liakata, Daniel Loureiro, Harish Tayyar Madabushi, Philippe Mulhem, Florina Piroi, Martin Popel, Christophe Servan, and Arkaitz Zubiaga
Published: 2023
Full Text: View/download PDF

20. WIKITIDE: A Wikipedia-Based Timestamped Definition Pairs Dataset.

Author: Hsuvas Borkakoty and Luis Espinosa Anke
Published: 2023

21. 3D-EX: A Unified Dataset of Definitions and Dictionary Examples.

Author: Fatemah Almeman, Hadi Sheikhi, and Luis Espinosa Anke
Published: 2023

22. Meemi: A simple method for post-processing and integrating cross-lingual word embeddings.

Author: Yerai Doval, José Camacho-Collados, Luis Espinosa Anke, and Steven Schockaert
Published: 2023
Full Text: View/download PDF

23. TweetNLP: Cutting-Edge Natural Language Processing for Social Media.

Author: José Camacho-Collados, Kiamehr Rezaee, Talayeh Riahi, Asahi Ushio, Daniel Loureiro, Dimosthenis Antypas, Joanne Boisson, Luis Espinosa Anke, Fangyu Liu, and Eugenio Martínez Cámara
Published: 2022
Full Text: View/download PDF

24. Multilingual Extraction and Categorization of Lexical Collocations with Graph-aware Transformers.

Author: Luis Espinosa Anke, Alexander V. Shvets, Alireza Mohammadshahi, James Henderson 0001, and Leo Wanner
Published: 2022
Full Text: View/download PDF

25. Distilling Hypernymy Relations from Language Models: On the Effectiveness of Zero-Shot Taxonomy Induction.

Author: Devansh Jain 0002 and Luis Espinosa Anke
Published: 2022
Full Text: View/download PDF

26. Modelling Commonsense Properties Using Pre-Trained Bi-Encoders.

Author: Amit Gajbhiye, Luis Espinosa Anke, and Steven Schockaert
Published: 2022

27. Self-Supervised Intermediate Fine-Tuning of Biomedical Language Models for Interpreting Patient Case Descriptions.

Author: Israa Alghanmi, Luis Espinosa Anke, and Steven Schockaert
Published: 2022

28. TempoWiC: An Evaluation Benchmark for Detecting Meaning Shift in Social Media.

Author: Daniel Loureiro, Aminette D'Souza, Areej Nasser Muhajab, Isabella A. White, Gabriel Wong, Luis Espinosa Anke, Leonardo Neves, Francesco Barbieri, and José Camacho-Collados
Published: 2022

29. SemEval-2022 Task 4: Patronizing and Condescending Language Detection.

Author: Carla Pérez-Almendros, Luis Espinosa Anke, and Steven Schockaert
Published: 2022
Full Text: View/download PDF

30. CardiffNLP-Metaphor at SemEval-2022 Task 2: Targeted Fine-tuning of Transformer-based Language Models for Idiomaticity Detection.

Author: Joanne Boisson, José Camacho-Collados, and Luis Espinosa Anke
Published: 2022
Full Text: View/download PDF

31. Pre-Training Language Models for Identifying Patronizing and Condescending Language: An Analysis.

Author: Carla Pérez-Almendros, Luis Espinosa Anke, and Steven Schockaert
Published: 2022

32. Sentence Selection Strategies for Distilling Word Embeddings from BERT.

Author: Yixiao Wang, Zied Bouraoui, Luis Espinosa Anke, and Steven Schockaert
Published: 2022

33. XLM-T: Multilingual Language Models in Twitter for Sentiment Analysis and Beyond.

Author: Francesco Barbieri, Luis Espinosa Anke, and José Camacho-Collados
Published: 2022

34. TimeLMs: Diachronic Language Models from Twitter.

Author: Daniel Loureiro, Francesco Barbieri, Leonardo Neves, Luis Espinosa Anke, and José Camacho-Collados
Published: 2022
Full Text: View/download PDF

35. Putting WordNet's Dictionary Examples in the Context of Definition Modelling: An Empirical Analysis.

Author: Fatemah Almeman and Luis Espinosa Anke
Published: 2022

36. Interpreting Patient Descriptions using Distantly Supervised Similar Case Retrieval.

Author: Israa Alghanmi, Luis Espinosa Anke, and Steven Schockaert
Published: 2022
Full Text: View/download PDF

37. Tweet Insights: A Visualization Platform to Extract Temporal Insights from Twitter.

Author: Daniel Loureiro, Kiamehr Rezaee, Talayeh Riahi, Francesco Barbieri, Leonardo Neves, Luis Espinosa Anke, and José Camacho-Collados
Published: 2023
Full Text: View/download PDF

38. RAGAS: Automated Evaluation of Retrieval Augmented Generation.

Author: Shahul ES, Jithin James, Luis Espinosa Anke, and Steven Schockaert
Published: 2023
Full Text: View/download PDF

39. Modelling General Properties of Nouns by Selectively Averaging Contextualised Embeddings.

Author: Na Li, Zied Bouraoui, José Camacho-Collados, Luis Espinosa Anke, Qing Gu, and Steven Schockaert
Published: 2021
Full Text: View/download PDF

40. Evaluating language models for the retrieval and categorization of lexical collocations.

Author: Luis Espinosa Anke, Joan Codina-Filbà, and Leo Wanner
Published: 2021
Full Text: View/download PDF

41. BERT is to NLP what AlexNet is to CV: Can Pre-Trained Language Models Identify Analogies?

Author: Asahi Ushio, Luis Espinosa Anke, Steven Schockaert, and José Camacho-Collados
Published: 2021
Full Text: View/download PDF

42. Probing Pre-Trained Language Models for Disease Knowledge.

Author: Israa Alghanmi, Luis Espinosa Anke, and Steven Schockaert
Published: 2021
Full Text: View/download PDF

43. Deriving Word Vectors from Contextualized Language Models using Topic-Aware Mention Selection.

Author: Yixiao Wang, Zied Bouraoui, Luis Espinosa Anke, and Steven Schockaert
Published: 2021
Full Text: View/download PDF

44. TweetEval: Unified Benchmark and Comparative Evaluation for Tweet Classification.

Author: Francesco Barbieri, José Camacho-Collados, Luis Espinosa Anke, and Leonardo Neves
Published: 2020
Full Text: View/download PDF

45. Don't Patronize Me! An Annotated Dataset with Patronizing and Condescending Language towards Vulnerable Communities.

Author: Carla Pérez-Almendros, Luis Espinosa Anke, and Steven Schockaert
Published: 2020
Full Text: View/download PDF

46. Capturing Word Order in Averaging Based Sentence Embeddings.

Author: Jae Hee Lee 0001, José Camacho-Collados, Luis Espinosa Anke, and Steven Schockaert
Published: 2020
Full Text: View/download PDF

47. Cardiff University at SemEval-2020 Task 6: Fine-tuning BERT for Domain-Specific Definition Classification.

Author: Shelan S. Jeawak, Luis Espinosa Anke, and Steven Schockaert
Published: 2020
Full Text: View/download PDF

48. On the Robustness of Unsupervised and Semi-supervised Cross-lingual Word Embedding Learning.

Author: Yerai Doval, José Camacho-Collados, Luis Espinosa Anke, and Steven Schockaert
Published: 2020

49. CollFrEn: Rich Bilingual English-French Collocation Resource.

Author: Beatríz Fisas, Joan Codina-Filbà, Luis Espinosa Anke, and Leo Wanner
Published: 2020

50. Learning Cross-Lingual Word Embeddings from Twitter via Distant Supervision.

Author: José Camacho-Collados, Yerai Doval, Eugenio Martínez-Cámara, Luis Espinosa Anke, Francesco Barbieri, and Steven Schockaert
Published: 2020

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Category

Publication Type

Journal

Database

Publisher

184 results on '"Luis Espinosa-Anke"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources