Search

Your search keyword '"Xue Wei"' showing total 144 results

Search Constraints

Start Over You searched for: Author "Xue Wei" Remove constraint Author: "Xue Wei" Publication Type Reports Remove constraint Publication Type: Reports
144 results on '"Xue Wei"'

Search Results

1. pTSE-T: Presentation Target Speaker Extraction using Unaligned Text Cues

2. EVA: An Embodied World Model for Future Video Anticipation

3. FlashAudio: Rectified Flows for Fast and High-Fidelity Text-to-Audio Generation

4. Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation

5. Editing Music with Melody and Text: Using ControlNet for Diffusion Transformer

6. You Know What I'm Saying: Jailbreak Attack via Implicit Reference

7. PSHuman: Photorealistic Single-view Human Reconstruction using Cross-Scale Diffusion

8. HiPrompt: Tuning-free Higher-Resolution Generation with Hierarchical MLLM Prompts

9. Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model

10. AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems

11. Importance Weighting Can Help Large Language Models Self-Improve

12. NoRA: Nested Low-Rank Adaptation for Efficient Fine-Tuning Large Models

13. STBLLM: Breaking the 1-Bit Barrier with Structured Binary LLMs

14. Can LLMs 'Reason' in Music? An Evaluation of LLMs' Capability of Music Understanding and Generation

15. MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions

16. M-LRM: Multi-view Large Reconstruction Model

17. VidMuse: A Simple Video-to-Music Generation Framework with Long-Short-Term Modeling

18. LLMs Meet Multimodal Generation and Editing: A Survey

19. CoCoGesture: Toward Coherent Co-speech 3D Gesture Generation in the Wild

20. Gravitational Production of Heavy Particles during and after Inflation

21. VAE-Var: Variational-Autoencoder-Enhanced Variational Assimilation

22. FastSAG: Towards Fast Non-Autoregressive Singing Accompaniment Generation

23. ComposerX: Multi-Agent Symbolic Music Composition with LLMs

24. Information Re-Organization Improves Reasoning in Large Language Models

25. FlashSpeech: Efficient Zero-Shot Speech Synthesis

26. Kilometer-Level Coupled Modeling Using 40 Million Cores: An Eight-Year Journey of Model Development

27. Mixed-Precision Computing in the GRIST Dynamical Core for Weather and Climate Modelling

28. MuPT: A Generative Symbolic Music Pretrained Transformer

29. RQ-RAG: Learning to Refine Queries for Retrieval Augmented Generation

30. Towards Automatic Evaluation for LLMs' Clinical Capabilities: Metric, Data, and Algorithm

31. ChatMusician: Understanding and Generating Music Intrinsically with LLM

32. Ads Recommendation in a Collapsed and Entangled World

33. RJUA-MedDQA: A Multimodal Benchmark for Medical Document Question Answering and Clinical Reasoning

34. CoMoSVC: Consistency Model-based Singing Voice Conversion

35. FM-OV3D: Foundation Model-based Cross-modal Knowledge Blending for Open-Vocabulary 3D Detection

36. FengWu-4DVar: Coupling the Data-driven Weather Forecasting Model with 4D Variational Assimilation

37. RJUA-QA: A Comprehensive QA Dataset for Urology

38. Weakly-Supervised Emotion Transition Learning for Diverse 3D Co-speech Gesture Generation

39. Gauged Global Strings

40. Effective Action Approach for Preheating

41. Continual Learning with Dirichlet Generative-based Rehearsal

42. O2ATH: An OpenMP Offloading Toolkit for the Sunway Heterogeneous Manycore Platform

43. PUMGPT: A Large Vision-Language Model for Product Understanding

44. ChatEval: Towards Better LLM-based Evaluators through Multi-Agent Debate

45. LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPT

46. MARBLE: Music Audio Representation Benchmark for Universal Evaluation

47. ViDA: Homeostatic Visual Domain Adapter for Continual Test Time Adaptation

48. NAS-FM: Neural Architecture Search for Tunable and Interpretable Sound Synthesis based on Frequency Modulation

49. Insert or Attach: Taxonomy Completion via Box Embedding

50. CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model

Catalog

Books, media, physical & digital resources