Search

Your search keyword '"Shan, Shiguang"' showing total 1,672 results

Search Constraints

Start Over You searched for: Author "Shan, Shiguang" Remove constraint Author: "Shan, Shiguang"
1,672 results on '"Shan, Shiguang"'

Search Results

1. M$^3$oralBench: A MultiModal Moral Benchmark for LVLMs

2. Multi-P$^2$A: A Multi-perspective Benchmark on Privacy Assessment for Large Vision-Language Models

3. RefHCM: A Unified Model for Referring Perceptions in Human-Centric Scenarios

4. Autoregressive Video Generation without Vector Quantization

5. UniPose: A Unified Multimodal Framework for Human Pose Comprehension, Generation and Editing

6. Semantic or Covariate? A Study on the Intractable Case of Out-of-Distribution Detection

7. UMFC: Unsupervised Multi-Domain Feature Calibration for Vision-Language Models

8. Smile upon the Face but Sadness in the Eyes: Emotion Recognition based on Facial Expressions and Eye Behaviors

9. Confidence Aware Learning for Reliable Face Anti-spoofing

10. Face-MLLM: A Large Face Perception Model

11. CtrLoRA: An Extensible and Efficient Framework for Controllable Image Generation

12. HERM: Benchmarking and Enhancing Multimodal LLMs for Human-Centric Understanding

13. Face Forgery Detection with Elaborate Backbone

14. Static for Dynamic: Towards a Deeper Understanding of Dynamic Facial Expressions Using Static Expression Data

15. Segment Anything for Videos: A Systematic Survey

16. T2IShield: Defending Against Backdoors on Text-to-Image Diffusion Models

17. Dysca: A Dynamic and Scalable Benchmark for Evaluating Perception Ability of LVLMs

18. Evaluating the Quality of Hallucination Benchmarks for Large Vision-Language Models

19. VLBiasBench: A Comprehensive Benchmark for Evaluating Bias in Large Vision-Language Model

20. Rethinking the Evaluation of Out-of-Distribution Detection: A Sorites Paradox

21. Generalized Semi-Supervised Learning via Self-Supervised Feature Adaptation

22. Anonymization Prompt Learning for Facial Privacy-Preserving Text-to-Image Generation

23. M$^3$GPT: An Advanced Multimodal, Multitask Framework for Motion Comprehension and Generation

24. BIMM: Brain Inspired Masked Modeling for Video Representation Learning

25. Task-adaptive Q-Face

26. Image to Pseudo-Episode: Boosting Few-Shot Segmentation by Unlabeled Data

28. Collaborative Domain Alignment for Multi-source Domain Adaptation

29. T2IShield: Defending Against Backdoors on Text-to-Image Diffusion Models

30. An Information Theoretical View for Out-of-Distribution Detection

31. Tokenize Anything via Prompting

32. Clothes-Changing Person Re-Identification with Feasibility-Aware Intermediary Matching

33. HPNet: Dynamic Trajectory Forecasting with Historical Prediction Attention

34. StylizedGS: Controllable Stylization for 3D Gaussian Splatting

35. GPT as Psychologist? Preliminary Evaluations for GPT-4V on Visual Affective Computing

36. Contrastive Learning of Person-independent Representations for Facial Action Unit Detection

37. Generalized Face Liveness Detection via De-fake Face Generator

38. Pre-trained Model Guided Fine-Tuning for Zero-Shot Adversarial Robustness

39. FullLoRA-AT: Efficiently Boosting the Robustness of Pretrained Vision Transformers

40. Tokenize Anything via Prompting

41. From Static to Dynamic: Adapting Landmark-Aware Image Models for Facial Expression Recognition in Videos

42. Cooperative Dual Attention for Audio-Visual Speech Enhancement with Facial Cues

45. Learning Separable Hidden Unit Contributions for Speaker-Adaptive Lip-Reading

46. Dual Compensation Residual Networks for Class Imbalanced Learning

47. Patch Is Not All You Need

48. Triplet Knowledge Distillation

49. Function-Consistent Feature Distillation

50. CCLAP: Controllable Chinese Landscape Painting Generation via Latent Diffusion Model

Catalog

Books, media, physical & digital resources