Search

Showing total 49 results
49 results

Search Results

1. Dual Encoding for Video Retrieval by Text.

2. Self-Supervised Visual Feature Learning With Deep Neural Networks: A Survey.

3. GPCA: A Probabilistic Framework for Gaussian Process Embedded Channel Attention.

4. How to Trust Unlabeled Data? Instance Credibility Inference for Few-Shot Learning.

5. Cross-Modal Progressive Comprehension for Referring Segmentation.

6. Two-Branch Relational Prototypical Network for Weakly Supervised Temporal Action Localization.

7. Attention in Attention Networks for Person Retrieval.

8. Self-Supervised Video Representation Learning by Uncovering Spatio-Temporal Statistics.

9. Referring Segmentation in Images and Videos With Cross-Modal Self-Attention Network.

10. Transferable Interactiveness Knowledge for Human-Object Interaction Detection.

11. OANet: Learning Two-View Correspondences and Geometry Using Order-Aware Network.

12. Learning Visual Instance Retrieval from Failure: Efficient Online Local Metric Adaptation from Negative Samples.

13. Unsupervised Learning of a Hierarchical Spiking Neural Network for Optical Flow Estimation: From Events to Global Motion Perception.

14. Revisiting Image-Language Networks for Open-Ended Phrase Detection.

15. Visual Grounding Via Accumulated Attention.

16. Power Normalizations in Fine-Grained Image, Few-Shot Image and Graph Classification.

17. P-CNN: Part-Based Convolutional Neural Networks for Fine-Grained Visual Categorization.

18. Hierarchical Deep Click Feature Prediction for Fine-Grained Image Recognition.

19. Extraction of an Explanatory Graph to Interpret a CNN.

20. Interpretable CNNs for Object Classification.

21. Relationship-Embedded Representation Learning for Grounding Referring Expressions.

22. Visual Scanpath Prediction Using IOR-ROI Recurrent Mixture Density Network.

23. Crafting GBD-Net for Object Detection.

24. Ordered or Orderless: A Revisit for Video Based Person Re-Identification.

25. Visual Tracking via Dynamic Memory Networks.

26. Exploiting Feature and Class Relationships in Video Categorization with Regularized Deep Neural Networks.

27. Structured Label Inference for Visual Understanding.

28. Robust Visual Tracking via Hierarchical Convolutional Features.

29. Transferring Knowledge Fragments for Learning Distance Metric from a Heterogeneous Domain.

30. Learning Two-Branch Neural Networks for Image-Text Matching Tasks.

31. Head and Body Orientation Estimation Using Convolutional Random Projection Forests.

32. ELD-Net: An Efficient Deep Learning Architecture for Accurate Saliency Detection.

33. A Novel Linelet-Based Representation for Line Segment Detection.

34. Aligning Where to See and What to Tell: Image Captioning with Region-Based Attention and Scene-Specific Contexts.

35. Visually Grounded Meaning Representations.

36. Jointly Learning Heterogeneous Features for RGB-D Activity Recognition.

37. Cross-Convolutional-Layer Pooling for Image Recognition.

38. Video2vec Embeddings Recognize Events When Examples Are Scarce.

39. HOTS: A Hierarchy of Event-Based Time-Surfaces for Pattern Recognition.

40. Higher-Order Occurrence Pooling for Bags-of-Words: Visual Concept Detection.

41. Dynamic Scene Recognition with Complementary Spatiotemporal Features.

42. Max-Margin Action Prediction Machine.

43. Adopting Abstract Images for Semantic Scene Understanding.

44. One Shot Detection with Laplacian Object and Fast Matrix Cosine Similarity.

45. Weakly Supervised Large Scale Object Localization with Multiple Instance Learning and Bag Splitting.

46. Scalable Feature Matching by Dual Cascaded Scalar Quantization for Image Retrieval.

47. Multi-Camera Saliency.

48. Single-Pedestrian Detection Aided by Two-Pedestrian Detection.

49. Image Geo-Localization Based on MultipleNearest Neighbor Feature Matching UsingGeneralized Graphs.