Search

Your search keyword '"Chang, Xiaojun"' showing total 843 results

Search Constraints

Start Over You searched for: Author "Chang, Xiaojun" Remove constraint Author: "Chang, Xiaojun"
843 results on '"Chang, Xiaojun"'

Search Results

1. RealCustom++: Representing Images as Real-Word for Real-Time Customization

2. Disentangled Noisy Correspondence Learning

3. Contrastive Learning with Counterfactual Explanations for Radiology Report Generation

4. Label-anticipated Event Disentanglement for Audio-Visual Video Parsing

5. Teaching with Uncertainty: Unleashing the Potential of Knowledge Distillation in Object Detection

6. Predicting Genetic Mutation from Whole Slide Images via Biomedical-Linguistic Knowledge Enhanced Multi-label Classification

7. MLP Can Be A Good Transformer Learner

8. LongVLM: Efficient Long Video Understanding via Large Language Models

9. Self-Supervised Multi-Frame Neural Scene Flow

10. Unified Static and Dynamic Network: Efficient Temporal Filtering for Video Grounding

11. NavCoT: Boosting LLM-Based Vision-and-Language Navigation via Learning Disentangled Reasoning

12. SWAP-NAS: Sample-Wise Activation Patterns for Ultra-fast NAS

13. DNA Family: Boosting Weight-Sharing NAS with Block-Wise Supervisions

14. MatchNAS: Optimizing Edge AI in Sparse-Label Data Contexts via Automating Deep Neural Network Porting for Mobile Deployment

15. Noisy Correspondence Learning with Self-Reinforcing Errors Mitigation

16. Video Recognition in Portrait Mode

17. Shot2Story20K: A New Benchmark for Comprehensive Understanding of Multi-shot Videos

18. Generating Action-conditioned Prompts for Open-vocabulary Video Action Recognition

20. Disentangled Representation Learning with Transmitted Information Bottleneck

21. Mask Propagation for Efficient Video Semantic Segmentation

22. No Token Left Behind: Efficient Vision Transformer via Dynamic Token Idling

23. PSDiff: Diffusion Model for Person Search with Iterative and Collaborative Refinement

24. Normalized solutions for Sobolev critical Schr\'odinger-Bopp-Podolsky systems

26. ProAgent: Building Proactive Cooperative Agents with Large Language Models

27. SSMG: Spatial-Semantic Map Guided Diffusion Model for Free-form Layout-to-Image Generation

28. FULLER: Unified Multi-modality Multi-task 3D Perception via Multi-level Gradient Calibration

29. Two-stream Multi-level Dynamic Point Transformer for Two-person Interaction Recognition

30. Convergence of least energy sign-changing solutions for logarithmic Schr\'{o}dinger equations on locally finite graphs

31. Maximum Entropy Heterogeneous-Agent Reinforcement Learning

32. Toward the Automated Construction of Probabilistic Knowledge Graphs for the Maritime Domain

33. Existence and instability of standing waves for the biharmonic nonlinear Schroedinger equation with combined nonlinearities

34. Towards Medical Artificial General Intelligence via Knowledge-Enhanced Multimodal Pretraining

35. A Benchmark for Cycling Close Pass Near Miss Event Detection from Video Streams

36. Dynamic Graph Enhanced Contrastive Learning for Chest X-ray Report Generation

37. Guided Image-to-Image Translation by Discriminator-Generator Communication

38. No Token Left Behind: Efficient Vision Transformer via Dynamic Token Idling

39. Origin and evolution of the triploid cultivated banana genome

41. ViewCo: Discovering Text-Supervised Segmentation Masks via Multi-View Semantic Consistency

42. Normalized solutions of $L^2$-supercritical NLS equations on noncompact metric graphs with localized nonlinearities

43. 3D-TOGO: Towards Text-Guided Cross-Category 3D Object Generation

44. Ground states for logarithmic Schr\'{o}dinger equations on locally finite graphs

45. Simple Primitives with Feasibility- and Contextuality-Dependence for Open-World Compositional Zero-shot Learning

46. Bounded Palais-Smale sequences with Morse type information for some constrained functionals

47. Learning Self-Regularized Adversarial Views for Self-Supervised Vision Transformers

48. PAR: Political Actor Representation Learning with Social Context and Expert Knowledge

49. ViLPAct: A Benchmark for Compositional Generalization on Multimodal Human Activities

50. MARLlib: A Scalable and Efficient Multi-agent Reinforcement Learning Library

Catalog

Books, media, physical & digital resources