974 results on '"Jiang, Yu-Gang"'
Search Results
202. Hierarchical Visual-Textual Graph for Temporal Activity Localization via Language
203. A Coarse-to-Fine Framework for Resource Efficient Video Recognition
204. Deep Learning for Video Classification and Captioning
205. The THUMOS Challenge on Action Recognition for Videos 'in the Wild'
206. DB-LSTM: Densely-connected Bi-directional LSTM for human action recognition
207. Semi-supervised Single-View 3D Reconstruction via Prototype Shape Priors
208. MORE: Multi-Order RElation Mining for Dense Captioning in 3D Scenes
209. A Survey on Video Diffusion Models.
210. Ultrafast non-volatile flash memory based on van der Waals heterostructures
211. FDGaussian: Fast Gaussian Splatting from Single Image via Geometric-aware Diffusion Model
212. Multi-Trigger Backdoor Attacks: More Triggers, More Threats
213. Automating the Diagnosis of Human Vision Disorders by Cross-modal 3D Generation
214. Eyes Can Deceive: Benchmarking Counterfactual Reasoning Abilities of Multi-modal Large Language Models
215. Heterogeneous Knowledge Transfer in Video Emotion Recognition, Attribution and Summarization
216. Fusing Multi-Stream Deep Networks for Video Classification
217. Evaluating Two-Stream CNN for Video Classification
218. Modeling Spatial-Temporal Clues in a Hybrid Deep Learning Framework for Video Classification
219. Exploiting Feature and Class Relationships in Video Categorization with Regularized Deep Neural Networks
220. Text-Driven Video Prediction.
221. Unified View Empirical Study for Large Pretrained Model on Cross-Domain Few-Shot Learning.
222. Non-local NetVLAD Encoding for Video Classification
223. From Canteen Food to Daily Meals: Generalizing Food Recognition to More Practical Scenarios
224. Dynamic Routing and Knowledge Re-Learning for Data-Free Black-Box Attack
225. Two-dimensional materials for next-generation computing technologies
226. Pose-Normalized Image Generation for Person Re-identification
227. Long-Term Cloth-Changing Person Re-identification
228. Generalizing Face Forgery Detection via Uncertainty Learning
229. Relation Triplet Construction for Cross-modal Text-to-Video Retrieval
230. On the Importance of Spatial Relations for Few-shot Action Recognition
231. GCMA: Generative Cross-Modal Transferable Adversarial Attacks from Images to Videos
232. Suspected Objects Matter: Rethinking Model's Prediction for One-stage Visual Grounding
233. Hierarchical Visual-Textual Graph for Temporal Activity Localization via Language
234. Learning Modality Interaction for Temporal Sentence Localization and Event Captioning in Videos
235. Learning part-based mid-level representation for visual recognition
236. Small footprint transistor architecture for photoswitching logic and in situ memory
237. HCMS: Hierarchical and Conditional Modality Selection for Efficient Video Recognition
238. CDistNet: Perceiving Multi-domain Character Distance for Robust Text Recognition
239. TPS++: Attention-Enhanced Thin-Plate Spline for Scene Text Recognition
240. Zn2+ reduction induces neuronal death with changes in voltage-gated potassium and sodium channel currents
241. Genistein inhibits hypoxia, ischemic-induced death, and apoptosis in PC12 cells
242. The THUMOS challenge on action recognition for videos “in the wild”
243. Extreme vocabulary learning
244. A comparative study of the effectiveness and safety of combined procarbazine, lomustine, and vincristine as a therapeutic method for recurrent high-grade glioma: A protocol for systematic review and meta-analysis
245. Stacked multichannel autoencoder – an efficient way of learning from synthetic data
246. Microarray expression profiling and co-expression network analysis of circulating LncRNAs and mRNAs associated with neurotoxicity induced by BPA
247. PolarFormer: Multi-Camera 3D Object Detection with Polar Transformer
248. Look Before You Match: Instance Understanding Matters in Video Object Segmentation
249. SVFormer: Semi-supervised Video Transformer for Action Recognition
250. Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning
Catalog
Books, media, physical & digital resources
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.