Search

Your search keyword '"Dong, Jianfeng"' showing total 35 results

Search Constraints

Start Over You searched for: Author "Dong, Jianfeng" Remove constraint Author: "Dong, Jianfeng" Database arXiv Remove constraint Database: arXiv
35 results on '"Dong, Jianfeng"'

Search Results

1. Rethinking Video Deblurring with Wavelet-Aware Dynamic Transformer and Diffusion Model

2. Alleviating Hallucination in Large Vision-Language Models with Active Retrieval Augmentation

3. Mitigating Multilingual Hallucination in Large Vision-Language Models

4. Representation Alignment Contrastive Regularization for Multi-Object Tracking

5. Let All be Whitened: Multi-teacher Distillation for Efficient Visual Retrieval

6. CL2CM: Improving Cross-Lingual Cross-Modal Retrieval via Cross-Lingual Knowledge Transfer

7. Unified Multi-modal Unsupervised Representation Learning for Skeleton-based Action Understanding

8. Video Infringement Detection via Feature Disentanglement and Mutual Information Maximization

9. Dual-view Curricular Optimal Transport for Cross-lingual Cross-modal Retrieval

10. From Region to Patch: Attribute-Aware Foreground-Background Contrastive Learning for Fine-Grained Fashion Retrieval

11. Transform-Equivariant Consistency Learning for Temporal Sentence Grounding

12. Hierarchical Contrast for Unsupervised Skeleton-based Action Representation Learning

13. Cross-Lingual Cross-Modal Retrieval with Noise-Robust Learning

14. Partially Relevant Video Retrieval

15. Reading-strategy Inspired Visual Representation Learning for Text-to-Video Retrieval

16. Lightweight Attentional Feature Fusion: A New Baseline for Text-to-Video Retrieval

17. Adaptive Proposal Generation Network for Temporal Sentence Localization in Videos

18. Fine-Grained Fashion Similarity Prediction by Attribute-Specific Embedding Learning

19. Context-aware Biaffine Localizing Network for Temporal Sentence Grounding

20. Hierarchical Similarity Learning for Language-based Product Image Retrieval

21. Progressive Localization Networks for Language-based Moment Localization

22. Dual Encoding for Video Retrieval by Text

23. Fine-grained Iterative Attention Network for TemporalLanguage Localization in Videos

24. Jointly Cross- and Self-Modal Graph Attention Network for Query-Based Moment Localization

25. Tree-Augmented Cross-Modal Encoding for Complex-Query Video Retrieval

26. Feature Re-Learning with Data Augmentation for Video Relevance Prediction

27. Fine-Grained Fashion Similarity Learning by Attribute-Specific Embedding Network

28. Dual Encoding for Zero-Example Video Retrieval

29. Exploring Human-like Attention Supervision in Visual Question Answering

30. Predicting Visual Features from Text for Image and Video Caption Retrieval

31. Cross-Media Similarity Evaluation for Web Image Retrieval in the Wild

32. Fluency-Guided Cross-Lingual Image Captioning

33. Learning Deep Representations Using Convolutional Auto-encoders with Symmetric Skip Connections

34. Word2VisualVec: Image and Video to Sentence Matching by Visual Feature Prediction

35. Negative refractive index due to chirality

Catalog

Books, media, physical & digital resources