Search

Your search keyword '"Zhuang, Yueting"' showing total 1,184 results

Search Constraints

Start Over You searched for: Author "Zhuang, Yueting" Remove constraint Author: "Zhuang, Yueting"
1,184 results on '"Zhuang, Yueting"'

Search Results

1. Image Regeneration: Evaluating Text-to-Image Model via Generating Identical Image with Multimodal Large Language Models

2. GaVaMoE: Gaussian-Variational Gated Mixture of Experts for Explainable Recommendation

3. RADAR: Robust Two-stage Modality-incomplete Industrial Anomaly Detection

4. Align$^2$LLaVA: Cascaded Human and Large Language Model Preference Alignment for Multi-modal Instruction Curation

5. TeamLoRA: Boosting Low-Rank Adaptation with Expert Collaboration and Competition

6. Logic Distillation: Learning from Code Function by Function for Planning and Decision-making

7. Distilling Vision-Language Foundation Models: A Data-Free Approach via Prompt Diversification

8. IDEAL: Leveraging Infinite and Dynamic Characterizations of Large Language Models for Query-focused Summarization

9. From Easy to Hard: Learning Curricular Shape-aware Features for Robust Panoptic Scene Graph Generation

10. Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Model

11. Ask Questions with Double Hints: Visual Question Generation with Answer-awareness and Region-reference

12. Bridging Local Details and Global Context in Text-Attributed Graphs

13. Improving Large Models with Small models: Lower Costs and Better Performance

14. T2S-GPT: Dynamic Vector Quantization for Autoregressive Sign Language Production from Text

15. Stock Movement Prediction with Multimodal Stable Fusion via Gated Cross-Attention Mechanism

16. DuetRAG: Collaborative Retrieval-Augmented Generation

17. Auto-Encoding Morph-Tokens for Multimodal LLM

18. WorldGPT: Empowering LLM as Multimodal World Model

19. LASER: Tuning-Free LLM-Driven Attention Control for Efficient Text-conditioned Image-to-Animation

20. Fact :Teaching MLLMs with Faithful, Concise and Transferable Rationales

21. HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models

22. ProSwitch: Knowledge-Guided Instruction Tuning to Switch Between Professional and Non-Professional Answers

23. Agent-Pro: Learning to Evolve via Policy-Level Reflection and Optimization

24. Triad: A Framework Leveraging a Multi-Role LLM-based Agent to Solve Knowledge Base Question Answering

25. Momentor: Advancing Video Large Language Model with Fine-Grained Temporal Reasoning

26. Explore Synergistic Interaction Across Frames for Interactive Video Object Segmentation

27. Self-Contrast: Better Reflection Through Inconsistent Solving Perspectives

29. TaskBench: Benchmarking Large Language Models for Task Automation

30. HalluciDoctor: Mitigating Hallucinatory Toxicity in Visual Instruction Data

31. Revisiting the Domain Shift and Sample Uncertainty in Multi-source Active Domain Transfer

32. De-fine: Decomposing and Refining Visual Programs with Auto-Feedback

33. Adapt Anything: Tailor Any Image Classifiers across Domains And Categories Using Text-to-Image Diffusion Models

34. Improving Vision Anomaly Detection with the Guidance of Language Modality

35. Dancing Avatar: Pose and Text-Guided Human Motion Videos Synthesis with Image Diffusion Model

36. Fine-tuning Multimodal LLMs to Follow Zero-shot Demonstrative Instructions

37. Degeneration-Tuning: Using Scrambled Grid shield Unwanted Concepts from Stable Diffusion

38. ZJU ReLER Submission for EPIC-KITCHEN Challenge 2023: TREK-150 Single Object Tracking

39. ZJU ReLER Submission for EPIC-KITCHEN Challenge 2023: Semi-Supervised Video Object Segmentation

40. Improving Reference-based Distinctive Image Captioning with Contrastive Rewards

41. Data-Copilot: Bridging Billions of Data and Humans with Autonomous Workflow

43. PromptNER: Prompt Locating and Typing for Named Entity Recognition

44. Interactive Data Synthesis for Systematic Vision Adaptation via LLMs-AIGCs Collaboration

45. DiffusionNER: Boundary Diffusion for Named Entity Recognition

46. Zero-shot Visual Relation Detection via Composite Visual Cues from Large Language Models

47. InstructVid2Vid: Controllable Video Editing with Natural Language Instructions

48. Continual Vision-Language Representation Learning with Off-Diagonal Information

49. DAMO-NLP at SemEval-2023 Task 2: A Unified Retrieval-augmented System for Multilingual Named Entity Recognition

50. Learning in Imperfect Environment: Multi-Label Classification with Long-Tailed Distribution and Partial Labels

Catalog

Books, media, physical & digital resources