Author: "Taihua Shao" / Topic: business - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Taihua Shao"' showing total 5 results

Start Over Author "Taihua Shao" Topic business

5 results on '"Taihua Shao"'

1. Collaborative Learning for Answer Selection in Question Answering

Author: Pengfei Zhang, Honghui Chen, Xiaoyan Kui, and Taihua Shao
Subjects: General Computer Science, Computer science, collaborative learning, 02 engineering and technology, computer.software_genre, Convolutional neural network, Knowledge extraction, Answer selection, 0502 economics and business, 0202 electrical engineering, electronic engineering, information engineering, Question answering, Selection (linguistics), General Materials Science, natural language processing, Artificial neural network, business.industry, Deep learning, 05 social sciences, General Engineering, deep learning, Collaborative learning, question answering, Task analysis, Embedding, 050211 marketing, 020201 artificial intelligence & image processing, lcsh:Electrical engineering. Electronics. Nuclear engineering, Artificial intelligence, business, lcsh:TK1-9971, computer, Sentence, Natural language processing
Abstract: Answer selection is an essential step in a question answering (QA) system. Traditional methods for this task mainly focus on developing linguistic features that are limited in practice. With the great success of deep learning method in distributed text representation, deep learning-based answer selection approaches have been well investigated, which mainly employ only one neural network, i.e., convolutional neural network (CNN) or long short term memory (LSTM), leading to failures in extracting some rich sentence features. Thus, in this paper, we propose a collaborative learning-based answer selection model (QA-CL), where we deploy a parallel training architecture to collaboratively learn the initial word vector matrix of the sentence by CNN and bidirectional LSTM (BiLSTM) at the same time. In addition, we extend our model by incorporating the sentence embedding generated by the QA-CL model into a joint distributed sentence representation using a strong unsupervised baseline weight removal (WR), i.e., the QA-CLWR model. We evaluate our proposals on a popular QA dataset, InsuranceQA. The experimental results indicate that our proposed answer selection methods can produce a better performance compared with several strong baselines. Finally, we investigate the models’ performance with respect to different question types and find that question types with a medium number of questions have a better and more stable performance than those types with too large or too small number of questions.
Published: 2019
Full Text: View/download PDF

2. Transformer-Based Neural Network for Answer Selection in Question Answering

Author: Honghui Chen, Taihua Shao, Yupu Guo, and Hao Zepeng
Subjects: Theoretical computer science, General Computer Science, Computer science, Pooling, Feature extraction, 02 engineering and technology, 010501 environmental sciences, 01 natural sciences, Convolutional neural network, Answer selection, 0202 electrical engineering, electronic engineering, information engineering, Question answering, General Materials Science, 0105 earth and related environmental sciences, Transformer (machine learning model), Transformer, Artificial neural network, business.industry, Deep learning, General Engineering, deep learning, Recurrent neural network, question answering, Embedding, 020201 artificial intelligence & image processing, Artificial intelligence, lcsh:Electrical engineering. Electronics. Nuclear engineering, business, lcsh:TK1-9971, Sentence
Abstract: Answer selection is a crucial subtask in the question answering (QA) system. Conventional avenues for this task mainly concentrate on developing linguistic tools that are limited in both performance and practicability. Answer selection approaches based on deep learning have been well investigated with the tremendous success of deep learning in natural language processing. However, the traditional neural networks employed in existing answer selection models, i.e., recursive neural network or convolutional neural network, typically suffer from obtaining the global text information due to their operating mechanisms. The recent Transformer neural network is considered to be good at extracting the global information by employing only self-attention mechanism. Thus, in this paper, we design a Transformer-based neural network for answer selection, where we deploy a bidirectional long short-term memory (BiLSTM) behind the Transformer to acquire both global information and sequential features in the question or answer sentence. Different from the original Transformer, our Transformer-based network focuses on sentence embedding rather than the seq2seq task. In addition, we employ a BiLSTM rather than utilizing the position encoding to incorporate sequential features as the universal Transformer does. Furthermore, we apply three aggregated strategies to generate sentence embeddings for question and answer, i.e., the weighted mean pooling, the max pooling, and the attentive pooling, leading to three corresponding Transformer-based models, i.e., QA-TF $_{{W\!P}}$ , QA-TF $_{{M\!P}}$ , and QA-TF $_{{A\!P}}$ , respectively. Finally, we evaluate our proposals on a popular QA dataset WikiQA. The experimental results demonstrate that our proposed Transformer-based answer selection models can produce a better performance compared with several competitive baselines. In detail, our best model outperforms the state-of-the-art baseline by up to 2.37%, 2.83%, and 3.79% in terms of MAP, MRR, and accuracy, respectively.
Published: 2019

3. Length-adaptive Neural Network for Answer Selection

Author: Maarten de Rijke, Taihua Shao, Honghui Chen, Fei Cai, Communication, and Information and Language Processing Syst (IVI, FNWI)
Subjects: Artificial neural network, Sentence length, Computer science, business.industry, 05 social sciences, Feature extraction, 010501 environmental sciences, Machine learning, computer.software_genre, 01 natural sciences, 0502 economics and business, Question answering, Artificial intelligence, 050207 economics, business, computer, Sentence, 0105 earth and related environmental sciences, Transformer (machine learning model)
Abstract: Answer selection focuses on selecting the correct answer for a question. Most previous work on answer selection achieves good performance by employing an RNN, which processes all question and answer sentences with the same feature extractor regardless of the sentence length. These methods often encounter the problem of long-term dependencies. To address this issue, we propose a Length-adaptive Neural Network (LaNN) for answer selection that can auto-select a neural feature extractor according to the length of the input sentence. In particular, we propose a flexible neural structure that applies a BiLSTM-based feature extractor for short sentences and a Transformer-based feature extractor for long sentences. To the best of our knowledge, LaNN is the first neural network structure that can auto-select the feature extraction mechanism based on the input. We quantify the improvements of LaNN against several competitive baselines on the public WikiQA dataset, showing significant improvements over the state-of-the-art.
Published: 2019
Full Text: View/download PDF

4. Self-Interaction Attention Mechanism-Based Text Representation for Document Classification

Author: Honghui Chen, Fei Cai, Taihua Shao, and Zheng Jianming
Subjects: Computer science, Context (language use), 02 engineering and technology, Semantics, computer.software_genre, lcsh:Technology, Field (computer science), Ranking (information retrieval), lcsh:Chemistry, 0202 electrical engineering, electronic engineering, information engineering, General Materials Science, Representation (mathematics), Instrumentation, lcsh:QH301-705.5, Fluid Flow and Transfer Processes, Artificial neural network, business.industry, document classification, lcsh:T, Process Chemistry and Technology, Document classification, General Engineering, hierarchical architecture, 021001 nanoscience & nanotechnology, interaction representation, attention mechanism, lcsh:QC1-999, Computer Science Applications, lcsh:Biology (General), lcsh:QD1-999, lcsh:TA1-2040, 020201 artificial intelligence & image processing, Artificial intelligence, 0210 nano-technology, business, lcsh:Engineering (General). Civil engineering (General), computer, Sentence, Natural language processing, lcsh:Physics
Abstract: Document classification has a broad application in the field of sentiment classification, document ranking and topic labeling, etc. Previous neural network-based work has mainly focused on investigating a so-called forward implication, i.e., the preceding text segments are taken as the context of the following text segments when generating the text representation. Such a scenario typically ignores the fact that the semantics of a document are a product of the mutual implication of all text segments in a document. Thus, in this paper, we introduce a concept of interaction and propose a text representation model with Self-interaction Attention Mechanism (TextSAM) for document classification. In particular, we design three aggregated strategies to integrate the interaction into a hierarchical architecture for document classification, i.e., averaging the interaction, maximizing the interaction and adding one more attention layer on the interaction, which leads to three models, i.e., TextSAMAVE, TextSAMMAX and TextSAMATT, respectively. Our comprehensive experimental results on two public datasets, i.e., Yelp 2016 and Amazon Reviews (Electronics), show that our proposals can significantly outperform the state-of-the-art neural-based baselines for document classification, presenting a general improvement in terms of accuracy ranging from 5.97% to 14.05% against the best baseline. Furthermore, we find that our proposals with a self-interaction attention mechanism can obviously alleviate the impact brought by the increase of sentence number as the relative improvement of our proposals against the baselines are enlarged when the sentence number increases.
Published: 2018

5. Memory-Enhanced Abstractive Summarization

Author: Honghui Chen, Shengwei Zhou, Hao Zepeng, and Taihua Shao
Subjects: History, business.industry, Computer science, Artificial intelligence, business, computer.software_genre, Automatic summarization, computer, Natural language processing, Computer Science Applications, Education
Published: 2019
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

5 results on '"Taihua Shao"'

1. Collaborative Learning for Answer Selection in Question Answering

2. Transformer-Based Neural Network for Answer Selection in Question Answering

3. Length-adaptive Neural Network for Answer Selection

4. Self-Interaction Attention Mechanism-Based Text Representation for Document Classification

5. Memory-Enhanced Abstractive Summarization

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Journal

Database

Publisher

5 results on '"Taihua Shao"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources