1. Building siamese attention-augmented recurrent convolutional neural networks for document similarity scoring.
- Author
-
Han, Sifei, Shi, Lingyun, Richie, Russell, and Tsui, Fuchiang R.
- Subjects
- *
CONVOLUTIONAL neural networks , *RECURRENT neural networks , *ARTIFICIAL neural networks , *NATURAL language processing , *JOB resumes - Abstract
• We proposed a new architecture - the Siamese attention-augmented recurrent convolutional neural network (S-ARCNN). • We compared the performance of S-ARCNN with eight popular models for measuring document similarity. • Our model outperformed the state-of-the-art Transformer based model (Sentence BERT) by over 5% in F1. • Simply fitting an optimal decision threshold can significantly improve pre-trained BERT model for the new task. • S-ARCNN performed best in longer question pairs (length > = 50 words). Automatically measuring document similarity is imperative in natural language processing, with applications ranging from recommendation to duplicate document detection. State-of-the-art approach in document similarity commonly involves deep neural networks, yet there is little study on how different architectures may be combined. Thus, we introduce the Siamese Attention-augmented Recurrent Convolutional Neural Network (S-ARCNN) that combines multiple neural network architectures. In each subnetwork of S-ARCNN, a document passes through a bidirectional Long Short-Term Memory (bi-LSTM) layer, which sends representations to local and global document modules. A local document module uses convolution, pooling, and attention layers, whereas a global document module uses last states of the bi-LSTM. Both local and global features are concatenated to form a single document representation. Using the Quora Question Pairs dataset, we evaluated S-ARCNN, Siamese convolutional neural networks (S-CNNs), Siamese LSTM, and two BERT models. While S-CNNs (82.02% F1) outperformed S-ARCNN (79.83% F1) overall, S-ARCNN slightly outperformed S-CNN on duplicate question pairs with more than 50 words (39.96% vs. 39.42% accuracy). With the potential advantage of S-ARCNN for processing longer documents, S-ARCNN may help researchers identify collaborators with similar research interests, help editors find potential reviewers, or match resumes with job descriptions. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF