Author: "Tian, Qi" / Search Limiters: Peer Reviewed / Topic: artificial neural networks and feature extraction - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Tian, Qi"' showing total 5 results

Start Over Author "Tian, Qi" Search Limiters Peer Reviewed Topic artificial neural networks Topic feature extraction

5 results on '"Tian, Qi"'

1. Learning Affective Features With a Hybrid Deep Model for Audio–Visual Emotion Recognition.

Author: Zhang, Shiqing, Zhang, Shiliang, Huang, Tiejun, Gao, Wen, and Tian, Qi
Subjects: EMOTION recognition, DEEP learning, AUDIOVISUAL materials, FEATURE extraction, IMAGE processing, ARTIFICIAL neural networks
Abstract: Emotion recognition is challenging due to the emotional gap between emotions and audio–visual features. Motivated by the powerful feature learning ability of deep neural networks, this paper proposes to bridge the emotional gap by using a hybrid deep model, which first produces audio–visual segment features with Convolutional Neural Networks (CNNs) and 3D-CNN, then fuses audio–visual segment features in a Deep Belief Networks (DBNs). The proposed method is trained in two stages. First, CNN and 3D-CNN models pre-trained on corresponding large-scale image and video classification tasks are fine-tuned on emotion recognition tasks to learn audio and visual segment features, respectively. Second, the outputs of CNN and 3D-CNN models are combined into a fusion network built with a DBN model. The fusion network is trained to jointly learn a discriminative audio–visual segment feature representation. After average-pooling segment features learned by DBN to form a fixed-length global video feature, a linear Support Vector Machine is used for video emotion classification. Experimental results on three public audio–visual emotional databases, including the acted RML database, the acted eNTERFACE05 database, and the spontaneous BAUM-1s database, demonstrate the promising performance of the proposed method. To the best of our knowledge, this is an early work fusing audio and visual cues with CNN, 3D-CNN, and DBN for audio–visual emotion recognition. [ABSTRACT FROM AUTHOR]
Published: 2018
Full Text: View/download PDF

2. SIFT Meets CNN: A Decade Survey of Instance Retrieval.

Author: Zheng, Liang, Yang, Yi, and Tian, Qi
Subjects: CONTENT-based image retrieval, SIGNAL convolution, ARTIFICIAL neural networks, DETECTORS, DATA visualization
Abstract: In the early days, content-based image retrieval (CBIR) was studied with global features. Since 2003, image retrieval based on local descriptors (de facto SIFT) has been extensively studied for over a decade due to the advantage of SIFT in dealing with image transformations. Recently, image representations based on the convolutional neural network (CNN) have attracted increasing interest in the community and demonstrated impressive performance. Given this time of rapid evolution, this article provides a comprehensive survey of instance retrieval over the last decade. Two broad categories, SIFT-based and CNN-based methods, are presented. For the former, according to the codebook size, we organize the literature into using large/medium-sized/small codebooks. For the latter, we discuss three lines of methods, i.e., using pre-trained or fine-tuned CNN models, and hybrid methods. The first two perform a single-pass of an image to the network, while the last category employs a patch-based feature extraction scheme. This survey presents milestones in modern instance retrieval, reviews a broad selection of previous works in different categories, and provides insights on the connection between SIFT and CNN-based methods. After analyzing and comparing retrieval performance of different categories on several datasets, we discuss promising directions towards generic and specialized instance retrieval. [ABSTRACT FROM PUBLISHER]
Published: 2018
Full Text: View/download PDF

3. Collaborative Index Embedding for Image Retrieval.

Author: Zhou, Wengang, Li, Houqiang, Sun, Jian, and Tian, Qi
Subjects: IMAGE retrieval, SIGNAL convolution, ARTIFICIAL neural networks, FEATURE extraction, MATHEMATICAL optimization, ACCURACY
Abstract: In content-based image retrieval, SIFT feature and the feature from deep convolutional neural network (CNN) have demonstrated promising performance. To fully explore both visual features in a unified framework for effective and efficient retrieval, we propose a collaborative index embedding method to implicitly integrate the index matrices of them. We formulate the index embedding as an optimization problem from the perspective of neighborhood sharing and solve it with an alternating index update scheme. After the iterative embedding, only the embedded CNN index is kept for on-line query, which demonstrates significant gain in retrieval accuracy, with very economical memory cost. Extensive experiments have been conducted on the public datasets with million-scale distractor images. The experimental results reveal that, compared with the recent state-of-the-art retrieval algorithms, our approach achieves competitive accuracy performance with less memory overhead and efficient query computation. [ABSTRACT FROM PUBLISHER]
Published: 2018
Full Text: View/download PDF

4. Accurate Image Search with Multi-Scale Contextual Evidences.

Author: Zheng, Liang, Wang, Shengjin, Wang, Jingdong, and Tian, Qi
Subjects: IMAGE segmentation, ARTIFICIAL neural networks, FEATURE extraction, IMAGE registration, PROBABILITY theory
Abstract: This paper considers the task of image search using the Bag-of-Words (BoW) model. In this model, the precision of visual matching plays a critical role. Conventionally, local cues of a keypoint, e.g., SIFT, are employed. However, such strategy does not consider the contextual evidences of a keypoint, a problem which would lead to the prevalence of false matches. To address this problem and enable accurate visual matching, this paper proposes to integrate discriminative cues from multiple contextual levels, i.e., local, regional, and global, via probabilistic analysis. 'True match' is defined as a pair of keypoints corresponding to the same scene location on all three levels (Fig. 1). Specifically, the Convolutional Neural Network (CNN) is employed to extract features from regional and global patches. We show that CNN feature is complementary to SIFT due to its semantic awareness and compares favorably to several other descriptors such as GIST, HSV, etc. To reduce memory usage, we propose to index CNN features outside the inverted file, communicated by memory-efficient pointers. Experiments on three benchmark datasets demonstrate that our method greatly promotes the search accuracy when CNN feature is integrated. We show that our method is efficient in terms of time cost compared with the BoW baseline, and yields competitive accuracy with the state-of-the-arts. [ABSTRACT FROM AUTHOR]
Published: 2016
Full Text: View/download PDF

5. GaitGMT: Global feature mapping transformer for gait recognition.

Author: Chen, Guilong, Huang, Jiayi, Chen, Guanghai, Chen, Xin, Deng, Xiaoling, Lan, Yubin, Long, Yongbing, and Tian, Qi
Subjects: *GAIT in humans, *IMAGE recognition (Computer vision), *DIGITAL image processing, *CRIMINAL investigation, *ARTIFICIAL neural networks, *FEATURE extraction
Abstract: Gait recognition is an important biometric technology that allows for the remote collection of stakeholders' characteristics, without requiring their explicit cooperation. It has gained considerable attention in the fields of criminal investigation and intelligent security. Previous studies have shown that local gait features can enhance gait recognition performance by improving robustness to disturbances. However, global gait features also play a crucial role in gait recognition. Many researchers have utilized convolutional operations to extract global features, but these operations tend to focus on features within the receptive field, neglecting those outside of it. Therefore, the potential of global gait features has not been fully explored. In this paper, we propose a gait recognition framework based on vision transformers, aiming to enhance the extraction of global gait features. We introduce an adaptive multi-frame global feature mapping (AMGM) method to address the challenge of inconsistent feature dimensions caused by variations in the number of gait frames when fusing global and local features. We evaluate our model on the latest datasets, and the experimental results demonstrate a significant breakthrough. Notably, our model achieves state-of-the-art recognition accuracy, particularly in scenarios where subjects are wearing coats. Additionally, our model achieves remarkable improvements in recognition accuracy through training with small sample sets. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

5 results on '"Tian, Qi"'

1. Learning Affective Features With a Hybrid Deep Model for Audio–Visual Emotion Recognition.

2. SIFT Meets CNN: A Decade Survey of Instance Retrieval.

3. Collaborative Index Embedding for Image Retrieval.

4. Accurate Image Search with Multi-Scale Contextual Evidences.

5. GaitGMT: Global feature mapping transformer for gait recognition.

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

5 results on '"Tian, Qi"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources