Language: english / Publication Year Range: Last 50 years / Search Limiters: Full Text / Topic: 4 selected - Searchworks@Jio Institute Digital Library Search Results

Showing total 6 results

Start Over Search Limiters Full Text Topic business.industry Topic data mining Topic deep learning Topic natural language processing Publication Year Range Last 50 years Language english

6 results

1. BioBERT: a pre-trained biomedical language representation model for biomedical text mining

Author: Chan Ho So, Sungdong Kim, Jinhyuk Lee, Jaewoo Kang, Donghyeon Kim, Wonjin Yoon, and Sunkyu Kim
Subjects: Statistics and Probability, FOS: Computer and information sciences, Source code, Computer science, media_common.quotation_subject, computer.software_genre, Biochemistry, Named-entity recognition, Question answering, Data Mining, Molecular Biology, ComputingMilieux_MISCELLANEOUS, media_common, Language, Natural Language Processing, Computer Science - Computation and Language, business.industry, Deep learning, Biomedical text mining, Relationship extraction, Original Papers, Computer Science Applications, Computational Mathematics, Computational Theory and Mathematics, Artificial intelligence, Language model, Data and Text Mining, business, F1 score, computer, Computation and Language (cs.CL), Natural language processing, Software
Abstract: Biomedical text mining is becoming increasingly important as the number of biomedical documents rapidly grows. With the progress in natural language processing (NLP), extracting valuable information from biomedical literature has gained popularity among researchers, and deep learning has boosted the development of effective biomedical text mining models. However, directly applying the advancements in NLP to biomedical text mining often yields unsatisfactory results due to a word distribution shift from general domain corpora to biomedical corpora. In this article, we investigate how the recently introduced pre-trained language model BERT can be adapted for biomedical corpora. We introduce BioBERT (Bidirectional Encoder Representations from Transformers for Biomedical Text Mining), which is a domain-specific language representation model pre-trained on large-scale biomedical corpora. With almost the same architecture across tasks, BioBERT largely outperforms BERT and previous state-of-the-art models in a variety of biomedical text mining tasks when pre-trained on biomedical corpora. While BERT obtains performance comparable to that of previous state-of-the-art models, BioBERT significantly outperforms them on the following three representative biomedical text mining tasks: biomedical named entity recognition (0.62% F1 score improvement), biomedical relation extraction (2.80% F1 score improvement) and biomedical question answering (12.24% MRR improvement). Our analysis results show that pre-training BERT on biomedical corpora helps it to understand complex biomedical texts. We make the pre-trained weights of BioBERT freely available at https://github.com/naver/biobert-pretrained, and the source code for fine-tuning BioBERT available at https://github.com/dmis-lab/biobert., Bioinformatics
Published: 2019

2. Enhanced Word Embedding Variations for the Detection of Substance Abuse and Mental Health Issues on Social Media Writings

Author: Ana Freire, Christine Largeron, Julien Tissier, Diana Ramírez-Cifuentes, Ricardo Baeza-Yates, Laboratoire Hubert Curien [Saint Etienne] (LHC), and Université Jean Monnet [Saint-Étienne] (UJM)-Centre National de la Recherche Scientifique (CNRS)-Institut d'Optique Graduate School (IOGS)
Subjects: Vocabulary, Word embedding, General Computer Science, Computer science, 020209 energy, media_common.quotation_subject, 02 engineering and technology, computer.software_genre, supervised learning, Visual approach, [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG], 0202 electrical engineering, electronic engineering, information engineering, General Materials Science, Word2vec, natural language processing, ComputingMilieux_MISCELLANEOUS, media_common, Classification algorithms, business.industry, Deep learning, Cosine similarity, General Engineering, data mining, mental disorders, TK1-9971, Task analysis, 020201 artificial intelligence & image processing, Artificial intelligence, Electrical engineering. Electronics. Nuclear engineering, business, computer, Word (computer architecture), Natural language processing
Abstract: Substance abuse and mental health issues are severe conditions that affect millions. Signs of certain conditions have been traced on social media through the analysis of posts. In this paper we analyze textual cues that characterize and differentiate Reddit posts related to depression, eating disorders, suicidal ideation, and alcoholism, along with control posts. We also generate enhanced word embeddings for binary and multi-class classification tasks dedicated to the detection of these types of posts. Our enhancement method to generate word embeddings focuses on identifying terms that are predictive for a class and aims to move their vector representations close to each other while moving them away from the vectors of terms that are predictive for other classes. Variations of the embeddings are defined and evaluated through predictive tasks, a cosine similarity-based method, and a visual approach. We generate predictive models using variations of our enhanced representations with statistical and deep learning approaches. We also propose a method that leverages the properties of the enhanced embeddings in order to build features for predictive models. Results show that variations of our enhanced representations outperform in Recall, Accuracy, and F1-Score the embeddings learned with Word2vec , DistilBERT , GloVe ’s fine-tuned pre-learned embeddings and other methods based on domain adapted embeddings. The approach presented has the potential to be used on similar binary or multi-class classification tasks that deal with small domain-specific textual corpora. This work was supported by the University of Lyon IDEXLYON, the Auvergne-Rhône-Alpes Region, and the Spanish Ministry of Economy and Competitiveness through the Maria de Maeztu Units of Excellence Program under Grant MDM-2015-0502.
Published: 2021

3. A survey on fake news and rumour detection techniques

Author: Francesco Marcelloni and Alessandro Bondielli
Subjects: Information Systems and Management, Text mining, Computer science, Rumours, 02 engineering and technology, Public opinion, Classification, Data mining, Deep learning, Fake news, Machine learning, Natural language processing, Theoretical Computer Science, Artificial Intelligence, 0202 electrical engineering, electronic engineering, information engineering, Focus (computing), business.industry, 05 social sciences, 050301 education, Data science, Computer Science Applications, Control and Systems Engineering, 020201 artificial intelligence & image processing, Artificial intelligence, business, 0503 education, Software
Abstract: False or unverified information spreads just like accurate information on the web, thus possibly going viral and influencing the public opinion and its decisions. Fake news and rumours represent the most popular forms of false and unverified information, respectively, and should be detected as soon as possible for avoiding their dramatic effects. The interest in effective detection techniques has been therefore growing very fast in the last years. In this paper we survey the different approaches to automatic detection of fake news and rumours proposed in the recent literature. In particular, we focus on five main aspects. First, we report and discuss the various definitions of fake news and rumours that have been considered in the literature. Second, we highlight how the collection of relevant data for performing fake news and rumours detection is problematic and we present the various approaches, which have been adopted to gather these data, as well as the publicly available datasets. Third, we describe the features that have been considered in fake news and rumour detection approaches. Fourth, we provide a comprehensive analysis on the various techniques used to perform rumour and fake news detection. Finally, we identify and discuss future directions.
Published: 2019

4. Position-aware deep multi-task learning for drug–drug interaction extraction

Author: Deyu Zhou, Lei Miao, and Yulan He
Subjects: 0301 basic medicine, RM, Databases, Factual, Drug-Related Side Effects and Adverse Reactions, Computer science, Drug-drug interaction, Medicine (miscellaneous), Multi-task learning, 02 engineering and technology, computer.software_genre, ENCODE, QA76, 03 medical and health sciences, Deep Learning, Artificial Intelligence, Position (vector), 0202 electrical engineering, electronic engineering, information engineering, Data Mining, Drug Interactions, business.industry, Identification (information), 030104 developmental biology, 020201 artificial intelligence & image processing, Artificial intelligence, business, DrugBank, computer, Sentence, Natural language processing, Word (computer architecture)
Abstract: Objective\ud \ud A drug–drug interaction (DDI) is a situation in which a drug affects the activity of another drug synergistically or antagonistically when being administered together. The information of DDIs is crucial for healthcare professionals to prevent adverse drug events. Although some known DDIs can be found in purposely-built databases such as DrugBank, most information is still buried in scientific publications. Therefore, automatically extracting DDIs from biomedical texts is sorely needed.\ud \ud Methods and material\ud \ud In this paper, we propose a novel position-aware deep multi-task learning approach for extracting DDIs from biomedical texts. In particular, sentences are represented as a sequence of word embeddings and position embeddings. An attention-based bidirectional long short-term memory (BiLSTM) network is used to encode each sentence. The relative position information of words with the target drugs in text is combined with the hidden states of BiLSTM to generate the position-aware attention weights. Moreover, the tasks of predicting whether or not two drugs interact with each other and further distinguishing the types of interactions are learned jointly in multi-task learning framework.\ud \ud Results\ud \ud The proposed approach has been evaluated on the DDIExtraction challenge 2013 corpus and the results show that with the position-aware attention only, our proposed approach outperforms the state-of-the-art method by 0.99% for binary DDI classification, and with both position-aware attention and multi-task learning, our approach achieves a micro F-score of 72.99% on interaction type identification, outperforming the state-of-the-art approach by 1.51%, which demonstrates the effectiveness of the proposed approach.
Published: 2018

5. A Characterization Study of Arabic Twitter Data with a Benchmarking for State-of-the-Art Opinion Mining Models

Author: Georges El-Khoury, Ramy Baly, Nizar Habash, Wassim El-Hajj, Khaled Bashir Shaban, Rita Aoun, Hazem Hajj, Rawan Moukalled, and Gilbert Badaro
Subjects: Feature engineering, Morphology, Engineering, Source of noise, Arabic, Performance, computer.software_genre, State of the art, Task (project management), Opinion mining, Sentiment analysis, Dialectal variation, Code-switching, Analytical studies, Data mining, business.industry, Deep learning, Characterization studies, Social networking (online), Linguistics, Benchmarking, Advanced researches, Data science, language.human_language, ComputingMethodologies_PATTERNRECOGNITION, ComputingMethodologies_DOCUMENTANDTEXTPROCESSING, language, Artificial intelligence, Noise (video), State (computer science), business, Linguistic phenomena, computer, Natural language processing
Abstract: Opinion mining in Arabic is a challenging task given the rich morphology of the language. The task becomes more challenging when it is applied to Twitter data, which contains additional sources of noise, such as the use of unstandardized dialectal variations, the non-conformation to grammatical rules, the use of Arabizi and code-switching, and the use of non-text objects such as images and URLs to express opinion. In this paper, we perform an analytical study to observe how such linguistic phenomena vary across different Arab regions. This study of Arabic Twitter characterization aims at providing better understanding of Arabic Tweets, and fostering advanced research on the topic. Furthermore, we explore the performance of the two schools of machine learning on Arabic Twitter, namely the feature engineering approach and the deep learning approach. We consider models that have achieved state-of-the-art performance for opinion mining in English. Results highlight the advantages of using deep learning-based models, and confirm the importance of using morphological abstractions to address Arabic's complex morphology. 2017 Association for Computational Linguistics This work was made possible by NPRP 6-716-1-138 grant from the Qatar National Research Fund (a member of Qatar Foundation). The statements made herein are solely the responsibility of the authors. Scopus
Published: 2017

6. Active Semi-Supervised Learning Method with Hybrid Deep Belief Networks

Author: Shusen Zhou, Qingcai Chen, and Xiaolong Wang
Subjects: Computer Science::Machine Learning, Computer and Information Sciences, Neural Networks, Active learning (machine learning), Computer science, Culture, Boltzmann machine, lcsh:Medicine, Semi-supervised learning, Bioinformatics, Machine learning, computer.software_genre, Pattern Recognition, Automated, Machine Learning, Deep belief network, Artificial Intelligence, Data Mining, lcsh:Science, Natural Language Processing, Internet, Multidisciplinary, Artificial neural network, business.industry, Deep learning, Supervised learning, lcsh:R, Information Processing, Biology and Life Sciences, ComputingMethodologies_PATTERNRECOGNITION, Active learning, Cognitive Science, lcsh:Q, Artificial intelligence, business, Information Technology, computer, Algorithms, Research Article, Neuroscience
Abstract: In this paper, we develop a novel semi-supervised learning algorithm called active hybrid deep belief networks (AHD), to address the semi-supervised sentiment classification problem with deep learning. First, we construct the previous several hidden layers using restricted Boltzmann machines (RBM), which can reduce the dimension and abstract the information of the reviews quickly. Second, we construct the following hidden layers using convolutional restricted Boltzmann machines (CRBM), which can abstract the information of reviews effectively. Third, the constructed deep architecture is fine-tuned by gradient-descent based supervised learning with an exponential loss function. Finally, active learning method is combined based on the proposed deep architecture. We did several experiments on five sentiment classification datasets, and show that AHD is competitive with previous semi-supervised learning algorithm. Experiments are also conducted to verify the effectiveness of our proposed method with different number of labeled reviews and unlabeled reviews respectively.
Published: 2014

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

6 results

1. BioBERT: a pre-trained biomedical language representation model for biomedical text mining

2. Enhanced Word Embedding Variations for the Detection of Substance Abuse and Mental Health Issues on Social Media Writings

3. A survey on fake news and rumour detection techniques

4. Position-aware deep multi-task learning for drug–drug interaction extraction

5. A Characterization Study of Arabic Twitter Data with a Benchmarking for State-of-the-Art Opinion Mining Models

6. Active Semi-Supervised Learning Method with Hybrid Deep Belief Networks

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Database

Publisher

6 results

Search Results

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources