Search

Your search keyword '"Shan, Ying"' showing total 2,297 results

Search Constraints

Start Over You searched for: Author "Shan, Ying" Remove constraint Author: "Shan, Ying"
2,297 results on '"Shan, Ying"'

Search Results

101. Towards Unseen Triples: Effective Text-Image-joint Learning for Scene Graph Generation

102. TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter

103. InstructP2P: Learning to Edit 3D Point Clouds with Text Instructions

104. Sticker820K: Empowering Interactive Retrieval with Stickers

105. SGAT4PASS: Spherical Geometry-Aware Transformer for PAnoramic Semantic Segmentation

106. PanoGRF: Generalizable Spherical Radiance Fields for Wide-baseline Panoramas

107. Make-Your-Video: Customized Video Generation Using Textual and Structural Guidance

108. Inserting Anybody in Diffusion Models via Celeb Basis

109. GPT4Tools: Teaching Large Language Model to Use Tools via Self-instruction

110. Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models

111. TaleCrafter: Interactive Story Visualization with Multiple Characters

112. TVTSv2: Learning Out-of-the-box Spatiotemporal Visual Representations at Scale

113. A Confidence-based Partial Label Learning Model for Crowd-Annotated Named Entity Recognition

114. What Makes for Good Visual Tokenizers for Large Language Models?

115. SparseGNV: Generating Novel Views of Indoor Scenes with Sparse Input Views

116. $\pi$-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation

117. HOSNeRF: Dynamic Human-Object-Scene Neural Radiance Fields from a Single Video

118. SurfelNeRF: Neural Surfel Radiance Fields for Online Photorealistic Reconstruction of Indoor Scenes

119. NeAI: A Pre-convoluted Representation for Plug-and-Play Neural Ambient Illumination

120. MasaCtrl: Tuning-Free Mutual Self-Attention Control for Consistent Image Synthesis and Editing

121. Improved Test-Time Adaptation for Domain Generalization

122. TagGPT: Large Language Models are Zero-shot Multimodal Taggers

123. Follow Your Pose: Pose-Guided Text-to-Video Generation using Pose-Free Videos

124. DreamAvatar: Text-and-Shape Guided 3D Human Avatar Generation via Diffusion Models

125. Learning Anchor Transformations for 3D Garment Animation

126. DropMAE: Masked Autoencoders with Spatial-Attention Dropout for Tracking Tasks

131. LayoutDiffusion: Controllable Diffusion Model for Layout-to-image Generation

132. VMesh: Hybrid Volume-Mesh Representation for Efficient View Synthesis

133. Accelerating Vision-Language Pretraining with Free Language Modeling

134. BoPR: Body-aware Part Regressor for Human Shape and Pose Estimation

135. HRDFuse: Monocular 360{\deg}Depth Estimation by Collaboratively Learning Holistic-with-Regional Depth Distributions

136. HMC: Hierarchical Mesh Coarsening for Skeleton-free Motion Retargeting

137. FateZero: Fusing Attentions for Zero-shot Text-based Video Editing

138. Skinned Motion Retargeting with Residual Perception of Motion Semantics & Geometry

139. Binary Embedding-based Retrieval at Tencent

140. T2I-Adapter: Learning Adapters to Dig out More Controllable Ability for Text-to-Image Diffusion Models

141. OSRT: Omnidirectional Image Super-Resolution with Distortion-aware Transformer

142. Tagging before Alignment: Integrating Multi-Modal Tags for Video-Text Retrieval

143. RILS: Masked Visual Reconstruction in Language Semantic Space

144. DPE: Disentanglement of Pose and Expression for General Video Portrait Editing

145. Study on the mechanism of DDX6 promoting proliferation and migration of nasopharyngeal carcinoma cells by regulating stability of CKMT1A mRNA

148. Dream3D: Zero-Shot Text-to-3D Synthesis Using 3D Shape Prior and Text-to-Image Diffusion Models

149. Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

150. Mitigating Artifacts in Real-World Video Super-Resolution Models

Catalog

Books, media, physical & digital resources