Search

Your search keyword '"Jin, Qin"' showing total 1,604 results

Search Constraints

Start Over You searched for: Author "Jin, Qin" Remove constraint Author: "Jin, Qin"
1,604 results on '"Jin, Qin"'

Search Results

1. Quo Vadis, Motion Generation? From Large Language Models to Large Motion Models

2. Revealing Personality Traits: A New Benchmark Dataset for Explainable Personality Recognition on Dialogues

3. ESPnet-Codec: Comprehensive Training and Evaluation of Neural Codecs for Audio, Music, and Speech

4. Muskits-ESPnet: A Comprehensive Toolkit for Singing Voice Synthesis in New Paradigm

5. mPLUG-DocOwl2: High-resolution Compressing for OCR-free Multi-page Document Understanding

6. What Makes a Good Story and How Can We Measure It? A Comprehensive Survey of Story Evaluation

7. Unveiling Visual Biases in Audio-Visual Localization Benchmarks

8. QuadrupedGPT: Towards a Versatile Quadruped Agent in Open-ended Worlds

9. UBiSS: A Unified Framework for Bimodal Semantic Summarization of Videos

10. ESCoT: Towards Interpretable Emotional Support Dialogue Systems

11. SingMOS: An extensive Open-Source Singing Voice Dataset for MOS Prediction

12. Adaptive Temporal Motion Guided Graph Convolution Network for Micro-expression Recognition

13. SingOMD: Singing Oriented Multi-resolution Discrete Representation Construction from Speech Models

14. TokSing: Singing Voice Synthesis based on Discrete Tokens

15. The Interspeech 2024 Challenge on Speech Processing Using Discrete Units

16. EgoNCE++: Do Egocentric Video-Language Models Really Understand Hand-Object Interactions?

17. Synchronized Video Storytelling: Generating Video Narrations with Structured Storyline

18. ECR-Chain: Advancing Generative Language Models to Better Emotion-Cause Reasoners through Reasoning Chains

19. TinyChart: Efficient Chart Understanding with Visual Token Merging and Program-of-Thoughts Learning

20. Think-Program-reCtify: 3D Situated Reasoning with Large Language Models

21. Movie101v2: Improved Movie Narration Benchmark

22. mPLUG-DocOwl 1.5: Unified Structure Learning for OCR-free Document Understanding

23. SPAFormer: Sequential 3D Part Assembly with Transformers

24. POV: Prompt-Oriented View-Agnostic Learning for Egocentric Hand-Object Interaction in the Multi-View World

26. Less is More: Mitigating Multimodal Hallucination from an EOS Decision Perspective

27. Singing Voice Data Scaling-up: An Introduction to ACE-Opencpop and ACE-KiSing

30. UReader: Universal OCR-free Visually-situated Language Understanding with Multimodal Large Language Model

31. The development of a dietary nutrient density educational tool and the investigation of its acceptance by Chinese residents from Henan province

32. Explore and Tell: Embodied Visual Captioning in 3D Environments

33. A Systematic Exploration of Joint-training for Singing Voice Synthesis

34. Visual Captioning at Will: Describing Images and Videos Guided by a Few Stylized Sentences

35. No-frills Temporal Video Grounding: Multi-Scale Neighboring Attention and Zoom-in Boundary Detection

36. Learning Descriptive Image Captioning via Semipermeable Maximum Likelihood Estimation

40. Movie101: A New Movie Understanding Benchmark

41. Edit As You Wish: Video Caption Editing with Multi-grained User Control

42. InfoMetIC: An Informative Metric for Reference-free Image Caption Evaluation

43. Knowledge Enhanced Model for Live Video Comment Generation

44. Rethinking Benchmarks for Cross-modal Image-text Retrieval

45. MPMQA: Multimodal Question Answering on Product Manuals

46. PHONEix: Acoustic Feature Processing Strategy for Enhanced Singing Pronunciation with Phoneme Distribution Predictor

47. Accommodating Audio Modality in CLIP for Multimodal Processing

48. TikTalk: A Video-Based Dialogue Dataset for Multi-Modal Chitchat in Real World

49. MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation

50. Systematic review and network meta-analysis of non-invasive respiratory support in paediatric patients with acute hypoxaemic respiratory failure: a protocol

Catalog

Books, media, physical & digital resources