Search

Your search keyword '"Computer Science - Computer Vision and Pattern Recognition"' showing total 358,600 results

Search Constraints

Start Over You searched for: Descriptor "Computer Science - Computer Vision and Pattern Recognition" Remove constraint Descriptor: "Computer Science - Computer Vision and Pattern Recognition"
358,600 results on '"Computer Science - Computer Vision and Pattern Recognition"'

Search Results

201. Fall Leaf Adversarial Attack on Traffic Sign Classification

202. Multi-Task Learning for Integrated Automated Contouring and Voxel-Based Dose Prediction in Radiotherapy

203. CoVis: A Collaborative Framework for Fine-grained Graphic Visual Understanding

204. DiffMVR: Diffusion-based Automated Multi-Guidance Video Restoration

205. Multi-Task Model Merging via Adaptive Weight Disentanglement

206. Generative Visual Communication in the Era of Vision-Language Models

207. The Last Mile to Supervised Performance: Semi-Supervised Domain Adaptation for Semantic Segmentation

208. Evaluating Vision-Language Models as Evaluators in Path Planning

209. Random Walks with Tweedie: A Unified Framework for Diffusion Models

210. MatchDiffusion: Training-free Generation of Match-cuts

211. GaussianSpeech: Audio-Driven Gaussian Avatars

212. AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers

213. Active Data Curation Effectively Distills Large-Scale Multimodal Models

214. FactCheXcker: Mitigating Measurement Hallucinations in Chest X-ray Report Generation Models

215. TAPTRv3: Spatial and Temporal Context Foster Robust Tracking of Any Point in Long Video

216. SimCMF: A Simple Cross-modal Fine-tuning Strategy from Vision Foundation Models to Any Imaging Modality

217. Towards Chunk-Wise Generation for Long Videos

218. Point Cloud Unsupervised Pre-training via 3D Gaussian Splatting

219. SpotLight: Shadow-Guided Object Relighting via Diffusion

220. 3D Scene Graph Guided Vision-Language Pre-training

221. OOD-HOI: Text-Driven 3D Whole-Body Human-Object Interactions Generation Beyond Training Domains

222. Spatiotemporal Skip Guidance for Enhanced Video Diffusion Sampling

223. HDI-Former: Hybrid Dynamic Interaction ANN-SNN Transformer for Object Detection Using Frames and Events

224. HoliSDiP: Image Super-Resolution via Holistic Semantics and Diffusion Prior

225. DHCP: Detecting Hallucinations by Cross-modal Attention Pattern in Large Vision-Language Models

226. AToM: Aligning Text-to-Motion Model at Event-Level with GPT-4Vision Reward

227. Textured Gaussians for Enhanced 3D Scene Appearance Modeling

228. GeneMAN: Generalizable Single-Image 3D Human Reconstruction from Multi-Source Human Data

229. Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation

230. Leveraging Semi-Supervised Learning to Enhance Data Mining for Image Classification under Limited Labeled Data

231. Cross-modal Information Flow in Multimodal Large Language Models

232. Diffusion Self-Distillation for Zero-Shot Customized Image Generation

233. CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models

234. Proactive Gradient Conflict Mitigation in Multi-Task Learning: A Sparse Training Perspective

235. Evaluating and Improving the Effectiveness of Synthetic Chest X-Rays for Medical Image Analysis

236. Structured light with a million light planes per second

237. Biomolecular Analysis of Soil Samples and Rock Imagery for Tracing Evidence of Life Using a Mobile Robot

238. Hierarchical Information Flow for Generalized Efficient Image Restoration

239. Exploring Depth Information for Detecting Manipulated Face Videos

240. DexDiffuser: Interaction-aware Diffusion Planning for Adaptive Dexterous Manipulation

241. FAM Diffusion: Frequency and Attention Modulation for High-Resolution Image Generation with Stable Diffusion

242. PhyCAGE: Physically Plausible Compositional 3D Asset Generation from a Single Image

243. AdaVLN: Towards Visual Language Navigation in Continuous Indoor Environments with Moving Humans

244. Utilizing the Mean Teacher with Supcontrast Loss for Wafer Pattern Recognition

245. Enhancing weed detection performance by means of GenAI-based image augmentation

246. GATE OpenING: A Comprehensive Benchmark for Judging Open-ended Interleaved Image-Text Generation

247. A comparison of extended object tracking with multi-modal sensors in indoor environment

248. HEMGS: A Hybrid Entropy Model for 3D Gaussian Splatting Data Compression

249. Weakly Supervised Framework Considering Multi-temporal Information for Large-scale Cropland Mapping with Satellite Imagery

250. Complexity Experts are Task-Discriminative Learners for Any Image Restoration

Catalog

Books, media, physical & digital resources