Author: "Ngan, King Ngi" / Topic: visualization - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Ngan, King Ngi"' showing total 10 results

Start Over Author "Ngan, King Ngi" Topic visualization

10 results on '"Ngan, King Ngi"'

1. POS-Trends Dynamic-Aware Model for Video Caption.

Author: Wang, Lanxiao, Li, Hongliang, Qiu, Heqian, Wu, Qingbo, Meng, Fanman, and Ngan, King Ngi
Subjects: METEORS, FEATURE extraction, VIDEOS, POCKET computers, PROBLEM solving
Abstract: Video caption aims to generate descriptive sentences about the video, and the most critical problem is how to achieve accurate word prediction with standardized and coherent syntax structure, which requires the model to thoroughly understand video content and precisely map them into corresponding sentence components. Many existing methods usually fuse different video features into a single visual feature for generating sentences. However, they ignore the word dataset prior information in the annotations (such as Part-Of-Speech) and they also ignore the association between sentence components and types of visual features. To solve these problems, we propose a POS-trends dynamic-aware model (PDA) to fully exploit the word dataset prior information in the captions to predict POS tag, so as to assist generating captions. We propose a POS feature extraction (PFE) module to use different filters to extract different POS-trends features, predict POS tags and fuse visual features. Furthermore, we propose a visual-dynamic-aware (VDA) module to dynamically adjust the mapping way of words and supplement the visual information into the local features. The fusion features provide directional visual information to generate correct words, and the predicted POS tags to guide the decoding process to generate a more standardized and coherent syntax structure. A large number of experiments based on MSVD, MSR-VTT and VATEX demonstrated that our method outperforms the state-of-the-art methods in BLEU-4, ROUGE-L, METEOR, CIDEr. Code can be available at: https://github.com/WangLanxiao/PDA-for-video-caption. [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

2. Subjective and Objective De-Raining Quality Assessment Towards Authentic Rain Image.

Author: Wu, Qingbo, Wang, Lei, Ngan, King Ngi, Li, Hongliang, Meng, Fanman, and Xu, Linfeng
Subjects: AUTHENTIC assessment, RAINFALL, SOURCE code, ALGORITHMS, IMAGE
Abstract: Images acquired by outdoor vision systems easily suffer poor visibility and annoying interference due to the rainy weather, which brings great challenge for accurately understanding and describing the visual contents. Recent researches have devoted great efforts on the task of rain removal for improving the image visibility. However, there is very few exploration about the quality assessment of de-rained image, even it is crucial for accurately measuring the performance of various de-raining algorithms. In this paper, we first create a de-raining quality assessment (DQA) database that collects 206 authentic rain images and their de-rained versions produced by 6 representative single image rain removal algorithms. Then, a subjective study is conducted on our DQA database, which collects the subject-rated scores of all de-rained images. To quantitatively measure the quality of de-rained image with non-uniform artifacts, we propose a bi-directional feature embedding network (B-FEN) which integrates the features of global perception and local difference together. Experiments confirm that the proposed method significantly outperforms many existing universal blind image quality assessment models. To help the research towards perceptually preferred de-raining algorithm, we will publicly release our DQA database and B-FEN source code on https://github.com/wqb-uestc. [ABSTRACT FROM AUTHOR]
Published: 2020
Full Text: View/download PDF

3. No-Reference Retargeted Image Quality Assessment Based on Pairwise Rank Learning.

Author: Ma, Lin, Xu, Long, Zhang, Yichi, Yan, Yihua, and Ngan, King Ngi
Abstract: In this paper, we propose a novel no-reference image quality assessment method for the retargeted image based on the pairwise rank learning approach. Each retargeted image needs to be first represented as a feature vector, which not only captures the image characteristics but also is sensitive to distortions during the retargeting process. As such, we investigate and examine different image representations for their abilities depicting the perceptual quality of retargeted image. Based on the image representations, we resort to the pairwise rank learning approach to discriminate the perceptual quality between the retargeted image pairs. Experimental results demonstrate that the proposed method can effectively depict the perceptual quality of the retargeted image, which can even perform comparably with the full-reference quality assessment methods. [ABSTRACT FROM PUBLISHER]
Published: 2016
Full Text: View/download PDF

4. Free-Energy Principle Inspired Video Quality Metric and Its Use in Video Coding.

Author: Xu, Long, Lin, Weisi, Ma, Lin, Zhang, Yongbing, Fang, Yuming, Ngan, King Ngi, Li, Songnan, and Yan, Yihua
Abstract: In this paper, we extend the free-energy principle to video quality assessment (VQA) by incorporating with the recent psychophysical study on human visual speed perception (HVSP). A novel video quality metric, namely the free-energy principle inspired video quality metric (FePVQ), is therefore developed and applied to perceptual video coding optimization. The free-energy principle suggests that the human visual system (HVS) can actively predict “orderly” information and avoid “disorderly” information for image perception. Basically, “orderly” is associated with the skeletons and edges of objects, and “disorderly” mostly concerns textures in images. Based on this principle, an image is separated into orderly and disorderly regions, and processed differently in image quality assessment. For videos, visual attention, or fixation, is associated with the objects with significant motion according to HVSP, resulting in a motion strength factor in the FePVQ so that the free-energy principle is extended into spatio-temporal domain for VQA. In addition, we investigate the application of the FePVQ in perceptual rate distortion optimization (RDO). For this purpose, the FePVQ is realized with low computational cost by using the relative total variation model and the block-wise motion vectors of video coding to simulate the free-energy principle and the HVSP, respectively. The experimental results indicate that the proposed FePVQ is highly consistent with the HVS perception. The linear correlation coefficient and Spearman's rank-order correlation coefficient are up to 0.8324 and 0.8281 on the LIVE video database. Better perceptual quality of encoded video sequences is achieved by FePVQ-motivated RDO in video coding. [ABSTRACT FROM PUBLISHER]
Published: 2016
Full Text: View/download PDF

5. Consistent Visual Quality Control in Video Coding.

Author: Xu, Long, Li, Songnan, Ngan, King Ngi, and Ma, Lin
Subjects: VIDEO coding, IMAGE quality analysis, VIDEO recording, BANDWIDTHS, COMPUTER algorithms, STREAMING technology
Abstract: Visual quality consistency is one of the most important issues in video quality assessment. When people view a sequential video, they may have an unpleasant perceptual experience if the video has an inconsistent visual quality even though the average visual quality of the video is not compromised. Thus, consistent visual quality control is mostly expected in general video encoding with limited channel bandwidth and buffer resources. However, there still has not been enough study on such an issue. In this paper, a new objective visual quality metric (VQM) is proposed first, which can easily be incorporated into video coding for guiding video coding. Second, a VQM-based window model is proposed to handle the tradeoff between visual quality consistency and buffer constraint in video coding. Third, a window-level rate control algorithm is developed to accomplish visual quality control based on the above two proposals. Finally, experimental results prove that consistent visual quality, high rate-distortion efficiency, accurate bit control, and compliant buffer constraint can be achieved by the proposed rate control algorithm. [ABSTRACT FROM PUBLISHER]
Published: 2013
Full Text: View/download PDF

6. Reduced-Reference Video Quality Assessment of Compressed Video Sequences.

Author: Ma, Lin, Li, Songnan, and Ngan, King Ngi
Subjects: VIDEO compression, SEQUENCES (Motion pictures), FEATURE extraction, HISTOGRAMS, DISCRETE cosine transforms, DATABASES
Abstract: In this paper, a novel reduced-reference (RR) video quality assessment (VQA) is proposed by exploiting the spatial information loss and the temporal statistical characteristics of the interframe histogram. From the spatial perspective, an energy variation descriptor (EVD) is proposed to measure the energy change of each individual encoded frame, which results from the quantization process. Besides depicting the energy change, EVD can further simulate the texture masking property of the human visual system (HVS). From the temporal perspective, the generalized Gaussian density (GGD) function is employed to capture the natural statistics of the interframe histogram distribution. The city-block distance (CBD) is used to calculate the histogram distance between the original video sequence and the encoded one. For simplicity, the difference image between adjacent frames is employed to characterize the temporal interframe relationship. By combining the spatial EVD together with the temporal CBD, an efficient RR VQA is developed. Evaluation on the subjective quality video database demonstrates that the proposed method outperforms the representative RR video quality metric and the full-reference VQAs, such as peak signal-to-noise ratio and structure similarity index in matching subjective ratings. This means that the proposed metric is more consistent with the HVS perception. Furthermore, as only a small number of RR features are extracted for representing the original video sequence (each frame requires only one parameter for describing EVD and three parameters for recording GGD), the RR features can be embedded into the video sequences or transmitted through the ancillary data channel, which can be used in the video quality monitoring system. [ABSTRACT FROM PUBLISHER]
Published: 2012
Full Text: View/download PDF

7. Full-Reference Video Quality Assessment by Decoupling Detail Losses and Additive Impairments.

Author: Li, Songnan, Ma, Lin, and Ngan, King Ngi
Subjects: VIDEO archives, COMMUNICATION, MATHEMATICAL decoupling, SPATIAL ability, ELECTRIC distortion, HUMAN behavior
Abstract: Video quality assessment plays a fundamental role in video processing and communication applications. In this paper, we study the use of motion information and temporal human visual system (HVS) characteristics for objective video quality assessment. In our previous work, two types of spatial distortions, i.e., detail losses and additive impairments, are decoupled and evaluated separately for spatial quality assessment. The detail losses refer to the loss of useful visual information that will affect the content visibility, and the additive impairments represent the redundant visual information in the test image, such as the blocking or ringing artifacts caused by data compression and so on. In this paper, a novel full-reference video quality metric is developed, which conceptually comprises the following processing steps: 1) decoupling detail losses and additive impairments within each frame for spatial distortion measure; 2) analyzing the video motion and using the HVS characteristics to simulate the human perception of the spatial distortions; and 3) taking into account cognitive human behaviors to integrate frame-level quality scores into sequence-level quality score. Distinguished from most studies in the literature, the proposed method comprehensively investigates the use of motion information in the simulation of HVS processing, e.g., to model the eye movement, to predict the spatio-temporal HVS contrast sensitivity, to implement the temporal masking effect, and so on. Furthermore, we also prove the effectiveness of decoupling detail losses and additive impairments for video quality assessment. The proposed method is tested on two subjective quality video databases, LIVE and IVP, and demonstrates the state-of-the-art performance in matching subjective ratings. [ABSTRACT FROM AUTHOR]
Published: 2012
Full Text: View/download PDF

8. A Co-Saliency Model of Image Pairs.

Author: Li, Hongliang and Ngan, King Ngi
Subjects: *IMAGE registration, *FEATURE extraction, *ALGORITHMS, *PYRAMIDS (Geometry), *IMAGE segmentation, *SIMILARITY (Geometry), *IMAGE processing, *PERFORMANCE evaluation
Abstract: In this paper, we introduce a method to detect co-saliency from an image pair that may have some objects in common. The co-saliency is modeled as a linear combination of the single-image saliency map (SISM) and the multi-image saliency map (MISM). The first term is designed to describe the local attention, which is computed by using three saliency detection techniques available in literature. To compute the MISM, a co-multilayer graph is constructed by dividing the image pair into a spatial pyramid representation. Each node in the graph is described by two types of visual descriptors, which are extracted from a representation of some aspects of local appearance, e.g., color and texture properties. In order to evaluate the similarity between two nodes, we employ a normalized single-pair SimRank algorithm to compute the similarity score. Experimental evaluation on a number of image pairs demonstrates the good performance of the proposed method on the co-saliency detection task. [ABSTRACT FROM AUTHOR]
Published: 2011
Full Text: View/download PDF

9. Image Quality Assessment by Separately Evaluating Detail Losses and Additive Impairments.

Author: Li, Songnan, Zhang, Fan, Ma, Lin, and Ngan, King Ngi
Abstract: In the research field of image processing, mean squared error (MSE) and peak signal-to-noise ratio (PSNR) are extensively adopted as the objective visual quality metrics, mainly because of their simplicity for calculation and optimization. However, it has been well recognized that these pixel-based difference measures correlate poorly with the human perception. Inspired by existing works refid="ref1"/refid="ref2"/ refid="ref3"/, in this paper we propose a novel algorithm which separately evaluates detail losses and additive impairments for image quality assessment. The detail loss refers to the loss of useful visual information which affects the content visibility, and the additive impairment represents the redundant visual information whose appearance in the test image will distract viewer's attention from the useful contents causing unpleasant viewing experience. To separate detail losses and additive impairments, a wavelet-domain decoupling algorithm is developed which can be used for a host of distortion types. Two HVS characteristics, i.e., the contrast sensitivity function and the contrast masking effect, are taken into account to approximate the HVS sensitivities. We propose two simple quality measures to correlate detail losses and additive impairments with visual quality, respectively. Based on the findings in refid="ref3"/ that observers judge low-quality images in terms of the ability to interpret the content, the outputs of the two quality measures are adaptively combined to yield the overall quality index. By conducting experiments based on five subjectively-rated image databases, we demonstrate that the proposed metric has a better or similar performance in matching subjective ratings when compared with the state-of-the-art image quality metrics. [ABSTRACT FROM PUBLISHER]
Published: 2011
Full Text: View/download PDF

10. Reduced-Reference Image Quality Assessment Using Reorganized DCT-Based Image Representation.

Author: Ma, Lin, Li, Songnan, Zhang, Fan, and Ngan, King Ngi
Abstract: In this paper, a novel reduced-reference (RR) image quality assessment (IQA) is proposed by statistical modeling of the discrete cosine transform (DCT) coefficient distributions. In order to reduce the RR data rates and further exploit the identical nature of the coefficient distributions between adjacent DCT subbands, the DCT coefficients are reorganized into a three-level coefficient tree. Subsequently, generalized Gaussian density (GGD) is employed to model the coefficient distribution of each reorganized DCT subband. The city-block distance is employed to measure the difference between the two images. Experimental results demonstrate that only a small number of RR features is sufficient for representing the image perceptual quality. The proposed method outperforms the RR WNISM and even the full-reference (FR) quality metric PSNR. [ABSTRACT FROM PUBLISHER]
Published: 2011
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

10 results on '"Ngan, King Ngi"'

1. POS-Trends Dynamic-Aware Model for Video Caption.

2. Subjective and Objective De-Raining Quality Assessment Towards Authentic Rain Image.

3. No-Reference Retargeted Image Quality Assessment Based on Pairwise Rank Learning.

4. Free-Energy Principle Inspired Video Quality Metric and Its Use in Video Coding.

5. Consistent Visual Quality Control in Video Coding.

6. Reduced-Reference Video Quality Assessment of Compressed Video Sequences.

7. Full-Reference Video Quality Assessment by Decoupling Detail Losses and Additive Impairments.

8. A Co-Saliency Model of Image Pairs.

9. Image Quality Assessment by Separately Evaluating Detail Losses and Additive Impairments.

10. Reduced-Reference Image Quality Assessment Using Reorganized DCT-Based Image Representation.

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

10 results on '"Ngan, King Ngi"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources