Search

Your search keyword '"CLIP"' showing total 1,432 results

Search Constraints

Start Over You searched for: Descriptor "CLIP" Remove constraint Descriptor: "CLIP"
1,432 results on '"CLIP"'

Search Results

1. VCP-CLIP: A Visual Context Prompting Model for Zero-Shot Anomaly Segmentation

2. CLIP-AGIQA: Boosting the Performance of AI-Generated Image Quality Assessment with CLIP

3. Prompting Language-Informed Distribution for Compositional Zero-Shot Learning

4. Contour-Guided Context Learning for Scene Text Recognition

5. Boosting Fine-Grained Oriented Object Detection via Text Features

6. DATR: Domain Agnostic Text Recognizer

7. Teach CLIP to Develop a Number Sense for Ordinal Regression

8. Deciphering the Role of Representation Disentanglement: Investigating Compositional Generalization in CLIP Models

9. In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation

10. -Tuning: Efficient Image-to-Video Transfer Learning for Video Temporal Grounding

11. A Decoupling Video Frame Selection Method for Action Recognition

12. Zero-Shot Referring Image Segmentation with Hierarchical Prompts and Frequency Domain Fusion

13. A Proposal for Explainable Fruit Quality Recognition Using Multimodal Models

14. Expanding Design Horizons: Evolutionary Tool for Parametric Design Exploration with Interactive and CLIP-Based Evaluation

15. Enhancing Zero-Shot Anomaly Detection: CLIP-SAM Collaboration with Cascaded Prompts

16. Explore the Potential of CLIP for Training-Free Open Vocabulary Semantic Segmentation

17. SCLIP: Rethinking Self-Attention for Dense Vision-Language Inference

18. Unleashing the Class-Incremental Learning Potential of Foundation Models by Virtual Feature Generation and Replay

19. Multi-layer Tuning CLIP for Few-Shot Image Classification

20. mCLIP: Multimodal Approach to Classify Memes

21. UniCrossAdapter: Multimodal Adaptation of CLIP for Radiology Report Generation

22. Unleashing Text-to-Image Diffusion Prior for Zero-Shot Image Captioning

23. TmfimCLIP: Text-Driven Multi-Attribute Face Image Manipulation.

24. Local part attention for image stylization with text prompt.

25. Hierarchical bi-directional conceptual interaction for text-video retrieval.

26. Swimtrans Net: a multimodal robotic system for swimming action recognition driven via Swin-Transformer.

27. Development of a biocompatible 3D hydrogel scaffold using continuous liquid interface production for the delivery of cell therapies to treat recurrent glioblastoma.

28. DecoupleCLIP: A Novel Cross-Modality Decouple Model for Painting Captioning.

29. QR-CLIP: Introducing Explicit Knowledge for Location and Time Reasoning.

30. CAM-Vtrans: real-time sports training utilizing multi-modal robot data.

31. Grounded situation recognition under data scarcity.

32. Swimtrans Net: a multimodal robotic system for swimming action recognition driven via Swin-Transformer.

33. Endoclip-Assisted Cannulation for a Hidden Duodenal Papilla: Three Cases.

34. Open-world barely-supervised learning via augmented pseudo labels.

35. Dual-stream multi-label image classification model enhanced by feature reconstruction.

36. Grounded situation recognition under data scarcity

37. Based-CLIP early fusion transformer for image caption.

38. RBP-Tar – a searchable database for experimental RBP binding sites [version 3; peer review: 1 approved, 1 approved with reservations]

39. Efficacy of a novel traction method: outside-lesion clip-thread method for gastric endoscopic submucosal dissection of lesions of the greater curvature of the upper/middle stomach (with video).

40. WildCLIP: Scene and Animal Attribute Retrieval from Camera Trap Data with Domain-Adapted Vision-Language Models.

41. A Lightweight Enhancement Approach for Real-Time Semantic Segmentation by Distilling Rich Knowledge from Pre-Trained Vision-Language Model.

42. Unified View Empirical Study for Large Pretrained Model on Cross-Domain Few-Shot Learning.

43. RL-CWtrans Net: multimodal swimming coaching driven via robot vision.

44. Adapting CLIP for Action Recognition via Dual Semantic Supervision and Temporal Prompt Reparameterization.

45. Dose multimodal machine translation can improve translation performance?

46. CLIP feature-based randomized control using images and text for multiple tasks and robots.

47. Partial coil embolization before surgical clipping of ruptured intracranial aneurysms.

48. The Usefulness of Extradural Anterior Clinoidectomy for Low-Lying Posterior Communicating Artery Aneurysms : A Cadaveric Study.

49. Zero-shot urban function inference with street view images through prompting a pretrained vision-language model.

50. Detecting images generated by diffusers.

Catalog

Books, media, physical & digital resources