Author: "Davis, Larry S." / Database: Complementary Index - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Davis, Larry S."' showing total 134 results

Start Over Author "Davis, Larry S." Database Complementary Index

134 results on '"Davis, Larry S."'

1. Scale Normalized Image Pyramids With AutoFocus for Object Detection.

Author: Singh, Bharat, Najibi, Mahyar, Sharma, Abhishek, and Davis, Larry S.
Subjects: OBJECT recognition (Computer vision), PYRAMIDS, FEATURE extraction
Abstract: We present an efficient foveal framework to perform object detection. A scale normalized image pyramid (SNIP) is generated that, like human vision, only attends to objects within a fixed size range at different scales. Such a restriction of objects’ size during training affords better learning of object-sensitive filters, and therefore, results in better accuracy. However, the use of an image pyramid increases the computational cost. Hence, we propose an efficient spatial sub-sampling scheme which only operates on fixed-size sub-regions likely to contain objects (as object locations are known during training). The resulting approach, referred to as Scale Normalized Image Pyramid with Efficient Resampling or SNIPER, yields up to 3× speed-up during training. Unfortunately, as object locations are unknown during inference, the entire image pyramid still needs processing. To this end, we adopt a coarse-to-fine approach, and predict the locations and extent of object-like regions which will be processed in successive scales of the image pyramid. Intuitively, it's akin to our active human-vision that first skims over the field-of-view to spot interesting regions for further processing and only recognizes objects at the right resolution. The resulting algorithm is referred to as AutoFocus and results in a 2.5-5× speed-up during inference when used with SNIP. Code: https://github.com/mahyarnajibi/SNIPER. [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

2. A Dynamic Frame Selection Framework for Fast Video Recognition.

Author: Wu, Zuxuan, Li, Hengduo, Xiong, Caiming, Jiang, Yu-Gang, and Davis, Larry S.
Subjects: REWARD (Psychology), FRAMES (Social sciences), DECISION making, REINFORCEMENT learning
Abstract: We introduce AdaFrame, a conditional computation framework that adaptively selects relevant frames on a per-input basis for fast video recognition. AdaFrame, which contains a Long Short-Term Memory augmented with a global memory to provide context information, operates as an agent to interact with video sequences aiming to search over time which frames to use. Trained with policy search methods, at each time step, AdaFrame computes a prediction, decides where to observe next, and estimates a utility, i.e., expected future rewards, of viewing more frames in the future. Exploring predicted utilities at testing time, AdaFrame is able to achieve adaptive lookahead inference so as to minimize the overall computational cost without incurring a degradation in accuracy. We conduct extensive experiments on two large-scale video benchmarks, FCVID and ActivityNet. With a vanilla ResNet-101 model, AdaFrame achieves similar performance of using all frames while only requiring, on average, 8.21 and 8.65 frames on FCVID and ActivityNet, respectively. We also demonstrate AdaFrame is compatible with modern 2D and 3D networks for video recognition. Furthermore, we show, among other things, learned frame usage can reflect the difficulty of making prediction decisions both at instance-level within the same class and at class-level among different categories. [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

3. A Coarse-to-Fine Framework for Resource Efficient Video Recognition.

Author: Wu, Zuxuan, Li, Hengduo, Zheng, Yingbin, Xiong, Caiming, Jiang, Yu-Gang, and Davis, Larry S
Subjects: VIDEOS, DEFAULT (Finance), STREAMING video & television, CLASSIFICATION
Abstract: Deep neural networks have demonstrated remarkable recognition results on video classification, however great improvements in accuracies come at the expense of large amounts of computational resources. In this paper, we introduce LiteEval for resource efficient video recognition. LiteEval is a coarse-to-fine framework that dynamically allocates computation on a per-video basis, and can be deployed in both online and offline settings. Operating by default on low-cost features that are computed with images at a coarse scale, LiteEval adaptively determines on-the-fly when to read in more discriminative yet computationally expensive features. This is achieved by the interactions of a coarse RNN and a fine RNN, together with a conditional gating module that automatically learns when to use more computation conditioned on incoming frames. We conduct extensive experiments on three large-scale video benchmarks, FCVID, ActivityNet and Kinetics, and demonstrate, among other things, that LiteEval offers impressive recognition performance while using significantly less computation for both online and offline settings. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

4. A Generic Improvement to Deep Residual Networks Based on Gradient Flow.

Author: Santhanam, Venkataraman and Davis, Larry S.
Subjects: COMPUTER architecture
Abstract: Preactivation ResNets consistently outperforms the original postactivation ResNets on the CIFAR10/100 classification benchmark. However, these results surprisingly do not carry over to the standard ImageNet benchmark. First, we theoretically analyze this incongruity in terms of how the two variants differ in handling the propagation of gradients. Although identity shortcuts are critical in both variants for improving optimization and performance, we show that postactivation variants enable early layers to receive a diverse dynamic composition of gradients from effectively deeper paths in comparison to preactivation variants, enabling the network to make maximal use of its representational capacity. Second, we show that downsampling projections (while only a few in number) have a significantly detrimental effect on performance. We show that by simply replacing downsampling projections with identitylike dense-reshape shortcuts, the classification results of standard residual architectures such as ResNets, ResNeXts, and SE-Nets improve by up to 1.2% on ImageNet, without any increase in computational complexity (FLOPs). [ABSTRACT FROM AUTHOR]
Published: 2020
Full Text: View/download PDF

5. Truncated Cauchy Non-Negative Matrix Factorization.

Author: Guan, Naiyang, Liu, Tongliang, Zhang, Yangmuzi, Tao, Dacheng, and Davis, Larry S.
Subjects: EUCLIDEAN algorithm, DATA corruption, MATRICES (Mathematics), SUBSPACES (Mathematics), APPROXIMATION theory
Abstract: Non-negative matrix factorization (NMF) minimizes the euclidean distance between the data matrix and its low rank approximation, and it fails when applied to corrupted data because the loss function is sensitive to outliers. In this paper, we propose a Truncated CauchyNMF loss that handle outliers by truncating large errors, and develop a Truncated CauchyNMF to robustly learn the subspace on noisy datasets contaminated by outliers. We theoretically analyze the robustness of Truncated CauchyNMF comparing with the competing models and theoretically prove that Truncated CauchyNMF has a generalization bound which converges at a rate of order $O(\sqrt{{\ln n}/{n}})$ , where $n$ is the sample size. We evaluate Truncated CauchyNMF by image clustering on both simulated and real datasets. The experimental results on the datasets containing gross corruptions validate the effectiveness and robustness of Truncated CauchyNMF for learning robust subspaces. [ABSTRACT FROM AUTHOR]
Published: 2019
Full Text: View/download PDF

6. Object detection in 20 questions.

Author: Chen, Xi Stephen, He, and Davis, Larry S.
Published: 2016
Full Text: View/download PDF

7. Weakly Supervised Learning of Heterogeneous Concepts in Videos.

Author: Shah, Sohil, Kulkarni, Kuldeep, Biswas, Arijit, Gandhi, Ankit, Deshmukh, Om, and Davis, Larry S.
Published: 2016
Full Text: View/download PDF

8. Modeling Context Between Objects for Referring Expression Understanding.

Author: Nagaraja, Varun K., Morariu, Vlad I., and Davis, Larry S.
Published: 2016
Full Text: View/download PDF

9. Adaptive Resource Management for Sensor Fusion in Visual Tracking.

Author: Han, Bohyung, Joo, Seong-Wook, and Davis, Larry S.
Published: 2016
Full Text: View/download PDF

10. Generating Discriminative Object Proposals via Submodular Ranking.

Author: Zhang, Yangmuzi, Jiang, Zhuolin, Chen, Xi, and Davis, Larry S.
Published: 2016
Full Text: View/download PDF

11. G-CNN: An Iterative Grid Based Object Detector.

Author: Najibi, Mahyar, Rastegari, Mohammad, and Davis, Larry S.
Published: 2016
Full Text: View/download PDF

12. Learning Temporal Regularity in Video Sequences.

Author: Hasan, Mahmudul, Choi, Jonghyun, Neumann, Jan, Roy-Chowdhury, Amit K., and Davis, Larry S.
Published: 2016
Full Text: View/download PDF

13. Joint Human Detection and Head Pose Estimation via Multistream Networks for RGB-D Videos.

Author: Zhang, Guyue, Liu, Jun, Li, Hengduo, Chen, Yan Qiu, and Davis, Larry S.
Subjects: POSE estimation (Computer vision), MACHINE learning, SCALE invariance (Statistical physics)
Abstract: We propose a multistream multitask deep network for joint human detection and head pose estimation in RGB-D videos. To achieve high accuracy, we jointly utilize appearance, shape, and motion information as inputs. Based on the depth information, we generate scale invariant proposals, which are then fed into a novel contextual region of interest pooling (CRP) layer in our deep network. This CRP has two branches to deal with contextual information for each subject. The proposed method outperforms state-of-the-art approaches on three public datasets. [ABSTRACT FROM AUTHOR]
Published: 2017
Full Text: View/download PDF

14. VRFP: On-the-Fly Video Retrieval Using Web Images and Fast Fisher Vector Products.

Author: Han, Xintong, Singh, Bharat, Morariu, Vlad I., and Davis, Larry S.
Abstract: On-the-fly video retrieval using web images and fast Fisher Vector products (VRFP) is a real-time video retrieval framework based on short text input queries, which obtains weakly labeled training images from the web after the query is known. The retrieved web images representing the query and each database video are treated as unordered collections of images, and each collection is represented using a single Fisher Vector built on CNN features. Our experiments show that a Fisher Vector is robust to noise present in web images and compares favorably in terms of accuracy to other standard representations. While a Fisher Vector can be constructed efficiently for a new query, matching against the test set is slow due to its high dimensionality. To perform matching in real time, we present a lossless algorithm that accelerates the inner product computation between high-dimensional Fisher Vectors. We prove that the expected number of multiplications required decreases quadratically with the sparsity of Fisher Vectors. We are not only able to construct and apply query models in real time, but with the help of a simple reranking scheme, we also outperform state-of-the-art automatic retrieval methods by a significant margin on TRECVID MED13 (3.5%), MED14 (1.3%), and CCV datasets (5.2%). We also provide a direct comparison on standard datasets between two different paradigms for automatic video retrieval: zero-shot learning and on-the-fly retrieval. [ABSTRACT FROM PUBLISHER]
Published: 2017
Full Text: View/download PDF

15. Introduction to the Special Section on Deep Learning for Visual Surveillance.

Author: Porikli, Fatih, Davis, Larry S., Wang, Qi, Li, Yi, and Regazzoni, Carlo
Subjects: VISUAL learning, VIDEO surveillance, STREAMING video & television, DEEP learning, ANIMAL behavior
Abstract: We are now living in an era of visual information where data is unceasingly generated and pushed into consumption at astounding rates. A remarkable portion of this sensory input comes in the form of videos streaming from large-scale surveillance infrastructures as well as consumer-grade monitoring systems. The sheer amount of ground-based, aerial and mobile video surveillance data demands fittingly competent, accurate, effective techniques to extract useful cues and provide assistance for detection, prevention, and intervention tasks in traffic, safety, security, defense, forensic, health, biology, ethology, and retail space management applications. [ABSTRACT FROM AUTHOR]
Published: 2019
Full Text: View/download PDF

16. Submodular Reranking with Multiple Feature Modalities for Image Retrieval.

Author: Yang, Fan, Jiang, Zhuolin, and Davis, Larry S.
Published: 2015
Full Text: View/download PDF

17. PSPGC: Part-Based Seeds for Parametric Graph-Cuts.

Author: Singh, Bharat, Han, Xintong, Wu, Zhe, and Davis, Larry S.
Published: 2015
Full Text: View/download PDF

18. Jointly Learning Dictionaries and Subspace Structure for Video-Based Face Recognition.

Author: Zhang, Guangxiao, He, Ran, and Davis, Larry S.
Published: 2015
Full Text: View/download PDF

19. Joint Learning for Attribute-Consistent Person Re-Identification.

Author: Khamis, Sameh, Kuo, Cheng-Hao, Singh, Vivek K., Shet, Vinay D., and Davis, Larry S.
Published: 2015
Full Text: View/download PDF

20. Vehicle detection using PLS Hough transform.

Author: Takeuchi, Remma, Kato, Kunihito, Harwood, David, and Davis, Larry S.
Published: 2015
Full Text: View/download PDF

21. Selecting Relevant Web Trained Concepts for Automated Event Retrieval.

Author: Singh, Bharat, Han, Xintong, Wu, Zhe, Morariu, Vlad I., and Davis, Larry S.
Published: 2015
Full Text: View/download PDF

22. Multi-Task Learning with Low Rank Attribute Embedding for Person Re-Identification.

Author: Su, Chi, Yang, Fan, Zhang, Shiliang, Tian, Qi, Davis, Larry S., and Gao, Wen
Published: 2015
Full Text: View/download PDF

23. Selective Encoding for Recognizing Unreliably Localized Faces.

Author: Li, Ang, Morariu, Vlad I., and Davis, Larry S.
Published: 2015
Full Text: View/download PDF

24. Class consistent multi-modal fusion with binary features.

Author: Shrivastava, Ashish, Rastegari, Mohammad, Shekhar, Sumit, Chellappa, Rama, and Davis, Larry S.
Published: 2015
Full Text: View/download PDF

25. Walking and talking: A bilinear approach to multi-label action recognition.

Author: Khamis, Sameh and Davis, Larry S.
Published: 2015
Full Text: View/download PDF

26. Exploiting local features from deep networks for image retrieval.

Author: Ng, Joe Yue-Hei, Yang, Fan, and Davis, Larry S.
Published: 2015
Full Text: View/download PDF

27. Feedback Loop Between High Level Semantics and Low Level Vision.

Author: Nagaraja, Varun K., Morariu, Vlad I., and Davis, Larry S.
Published: 2015
Full Text: View/download PDF

28. Clauselets: Leveraging Temporally Related Actions for Video Event Analysis.

Author: Lee, Hyungtae, Morariu, Vlad I., and Davis, Larry S.
Published: 2015
Full Text: View/download PDF

29. Re-ranking by Multi-feature Fusion with Diffusion for Image Retrieval.

Author: Yang, Fan, Matei, Bogdan, and Davis, Larry S.
Published: 2015
Full Text: View/download PDF

30. Hierarchical Spherical Hashing for Compressing High Dimensional Vectors.

Author: Bondugula, Sravanthi and Davis, Larry S.
Published: 2015
Full Text: View/download PDF

31. Unsupervised Feature Extraction Inspired by Latent Low-Rank Representation.

Author: Wang, Yaming, Morariu, Vlad I., and Davis, Larry S.
Published: 2015
Full Text: View/download PDF

32. Joint Image Clustering and Labeling by Matrix Factorization.

Author: Hong, Seunghoon, Choi, Jonghyun, Feyereisl, Jan, Han, Bohyung, and Davis, Larry S.
Subjects: ALGORITHM research, IMAGE processing, IMAGE databases, LEARNING, SEARCH engines
Abstract: We propose a novel algorithm to cluster and annotate a set of input images jointly, where the images are clustered into several discriminative groups and each group is identified with representative labels automatically. For these purposes, each input image is first represented by a distribution of candidate labels based on its similarity to images in a labeled reference image database. A set of these label-based representations are then refined collectively through a non-negative matrix factorization with sparsity and orthogonality constraints; the refined representations are employed to cluster and annotate the input images jointly. The proposed approach demonstrates performance improvements in image clustering over existing techniques, and illustrates competitive image labeling accuracy in both quantitative and qualitative evaluation. In addition, we extend our joint clustering and labeling framework to solving the weakly-supervised image classification problem and obtain promising results. [ABSTRACT FROM PUBLISHER]
Published: 2016
Full Text: View/download PDF

33. Multi-Directional Multi-Level Dual-Cross Patterns for Robust Face Recognition.

Author: Ding, Changxing, Choi, Jonghyun, Tao, Dacheng, and Davis, Larry S.
Subjects: HUMAN facial recognition software, ROBUST control, IMAGE recognition (Computer vision), TEXTURE analysis (Image processing), BINARY control systems
Abstract: To perform unconstrained face recognition robust to variations in illumination, pose and expression, this paper presents a new scheme to extract “Multi-Directional Multi-Level Dual-Cross Patterns” (MDML-DCPs) from face images. Specifically, the MDML-DCPs scheme exploits the first derivative of Gaussian operator to reduce the impact of differences in illumination and then computes the DCP feature at both the holistic and component levels. DCP is a novel face image descriptor inspired by the unique textural structure of human faces. It is computationally efficient and only doubles the cost of computing local binary patterns, yet is extremely robust to pose and expression variations. MDML-DCPs comprehensively yet efficiently encodes the invariant characteristics of a face image from multiple levels into patterns that are highly discriminative of inter-personal differences but robust to intra-personal variations. Experimental results on the FERET, CAS-PERL-R1, FRGC 2.0, and LFW databases indicate that DCP outperforms the state-of-the-art local descriptors (e.g., LBP, LTP, LPQ, POEM, tLBP, and LGXP) for both face identification and face verification tasks. More impressively, the best performance is achieved on the challenging LFW and FRGC 2.0 databases by deploying MDML-DCPs in a simple recognition scheme. [ABSTRACT FROM PUBLISHER]
Published: 2016
Full Text: View/download PDF

34. Composite Discriminant Factor analysis.

Author: Morariu, Vlad I., Ahmed, Ejaz, Santhanam, Venkataraman, Harwood, David, and Davis, Larry S.
Published: 2014
Full Text: View/download PDF

35. Object co-labeling in multiple images.

Author: Chen, Xi, Jain, Arpit, and Davis, Larry S.
Published: 2014
Full Text: View/download PDF

36. Online discriminative dictionary learning for visual tracking.

Author: Yang, Fan, Jiang, Zhuolin, and Davis, Larry S.
Published: 2014
Full Text: View/download PDF

37. Toward Sparse Coding on Cosine Distance.

Author: Choi, Jonghyun, Cho, Hyunjong, Kwac, Jungsuk, and Davis, Larry S.
Published: 2014
Full Text: View/download PDF

38. Robust Pose Features for Action Recognition.

Author: Lee, Hyungtae, Morariu, Vlad I., and Davis, Larry S.
Published: 2014
Full Text: View/download PDF

39. Planar Structure Matching under Projective Uncertainty for Geolocation.

Author: Li, Ang, Morariu, Vlad I., and Davis, Larry S.
Published: 2014
Full Text: View/download PDF

40. Jointly Optimizing 3D Model Fitting and Fine-Grained Classification.

Author: Lin, Yen-Liang, Morariu, Vlad I., Hsu, Winston, and Davis, Larry S.
Published: 2014
Full Text: View/download PDF

41. Unsupervised Abnormal Crowd Activity Detection Using Semiparametric Scan Statistic.

Author: Hu, Yang, Zhang, Yangmuzi, and Davis, Larry S.
Published: 2013
Full Text: View/download PDF

42. Representing Videos Using Mid-level Discriminative Patches.

Author: Jain, Arpit, Gupta, Abhinav, Rodriguez, Mikel, and Davis, Larry S.
Published: 2013
Full Text: View/download PDF

43. Submodular Salient Region Detection.

Author: Jiang, Zhuolin and Davis, Larry S.
Published: 2013
Full Text: View/download PDF

44. Adding Unlabeled Samples to Categories by Learned Attributes.

Author: Choi, Jonghyun, Rastegari, Mohammad, Farhadi, Ali, and Davis, Larry S.
Published: 2013
Full Text: View/download PDF

45. Learning Structured Low-Rank Representations for Image Classification.

Author: Zhang, Yangmuzi, Jiang, Zhuolin, and Davis, Larry S.
Published: 2013
Full Text: View/download PDF

46. Discriminative Dictionary Learning with Pairwise Constraints.

Author: Guo, Huimin, Jiang, Zhuolin, and Davis, Larry S.
Published: 2013
Full Text: View/download PDF

47. Online Semi-Supervised Discriminative Dictionary Learning for Sparse Representation.

Author: Zhang, Guangxiao, Jiang, Zhuolin, and Davis, Larry S.
Published: 2013
Full Text: View/download PDF

48. Domain adaptive object detection.

Author: Mirrashed, Fatemeh, Morariu, Vlad I., Siddiquie, Behjat, Feris, Rogerio S., and Davis, Larry S.
Abstract: We study the use of domain adaptation and transfer learning techniques as part of a framework for adaptive object detection. Unlike recent applications of domain adaptation work in computer vision, which generally focus on image classification, we explore the problem of extreme class imbalance present when performing domain adaptation for object detection. The main difficulty caused by this imbalance is that test images contain millions or billions of negative image subwindows but just a few image subwindows containing positive instances, which makes it difficult to adapt to changes in the positive classes present new domains by simple techniques such as random sampling. We propose an initial approach to addressing this problem and apply our technique to vehicle detection in a challenging urban surveillance dataset, demonstrating the performance of our approach with various amounts of supervision, including the fully unsupervised case. [ABSTRACT FROM PUBLISHER]
Published: 2013
Full Text: View/download PDF

49. Sampling for unsupervised domain adaptive object detection.

Author: Mirrashed, Fatemeh, Morariu, Vlad I., and Davis, Larry S.
Published: 2013
Full Text: View/download PDF

50. Qualitative Pose Estimation by Discriminative Deformable Part Models.

Author: Lee, Hyungtae, Morariu, Vlad I., and Davis, Larry S.
Published: 2013
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

134 results on '"Davis, Larry S."'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources