Search

Your search keyword '"Kuehne, Hilde"' showing total 178 results

Search Constraints

Start Over You searched for: Author "Kuehne, Hilde" Remove constraint Author: "Kuehne, Hilde"
178 results on '"Kuehne, Hilde"'

Search Results

1. Granite Vision: a lightweight, open-source multimodal model for enterprise Intelligence

2. mWhisper-Flamingo for Multilingual Audio-Visual Noise-Robust Speech Recognition

3. TimeLogic: A Temporal Logic Benchmark for Video QA

4. State-Space Large Audio Language Models

5. Teaching VLMs to Localize Specific Objects from In-context Examples

6. Convolutional Differentiable Logic Gate Networks

7. Newton Losses: Using Curvature Information for Learning with Differentiable Algorithms

8. MaskInversion: Localized Embeddings via Optimization of Explainability Maps

9. DASS: Distilled Audio State Space Models Are Stronger and More Duration-Scalable Learners

10. Meta-prompting for Automating Zero-Shot Visual Recognition with LLMs

11. Whisper-Flamingo: Integrating Visual Features into Whisper for Audio-Visual Speech Recognition and Translation

12. ConMe: Rethinking Evaluation of Compositional Reasoning for Modern VLMs

13. LeGrad: An Explainability Method for Vision Transformers via Feature Formation Sensitivity

14. Meta-Prompting for Automating Zero-shot Visual Recognition with LLMs

15. Uncertainty Quantification via Stable Distribution Propagation

16. Grounding Everything: Emerging Localization Properties in Vision-Language Transformers

17. Learning Human Action Recognition Representations Without Real Humans

18. HowToCaption: Prompting LLMs to Transform Video Annotations at Scale

19. In-Style: Bridging Text and Uncurated Videos with Style Transfer for Text-Video Retrieval

20. Preserving Modality Structure Improves Multi-Modal Learning

21. What a MESS: Multi-Domain Evaluation of Zero-Shot Semantic Segmentation

22. Comparison of Multilingual Self-Supervised and Weakly-Supervised Speech Pre-Training for Adaptation to Unseen Languages

23. ISAAC Newton: Input-based Approximate Curvature for Newton's Method

24. Learning Situation Hyper-Graphs for Video Question Answering

25. WEAR: An Outdoor Sports Dataset for Wearable and Egocentric Activity Recognition

26. What, when, and where? -- Self-Supervised Spatio-Temporal Grounding in Untrimmed Multi-Action Videos from Narrated Instructions

27. Temperature Schedules for Self-Supervised Contrastive Methods on Long-Tail Data

28. MAtch, eXpand and Improve: Unsupervised Finetuning for Zero-Shot Action Recognition with Language Knowledge

29. TAEC: Unsupervised Action Segmentation with Temporal-Aware Embedding and Clustering

30. Learning by Sorting: Self-supervised Learning with Group Ordering Constraints

31. Video Test-Time Adaptation for Action Recognition

32. Deep Differentiable Logic Gate Networks

33. C2KD: Cross-Lingual Cross-Modal Knowledge Distillation for Multilingual Text-Video Retrieval

34. Contrastive Audio-Visual Masked Autoencoder

35. VL-Taboo: An Analysis of Attribute-based Zero-shot Capabilities of Vision-Language Models

36. Augmentation Learning for Semi-Supervised Classification

37. Weakly Supervised Grounding for VQA in Vision-Language Transformers

38. Differentiable Top-k Classification Learning

39. CycDA: Unsupervised Cycle Domain Adaptation from Image to Video

40. Monotonic Differentiable Sorting Networks

41. Everything at Once -- Multi-modal Fusion Transformer for Video Retrieval

42. Unsupervised Domain Generalization by Learning a Bridge Across Domains

43. Routing with Self-Attention for Multimodal Capsule Networks

44. Cascaded Multilingual Audio-Visual Learning from Videos

45. Style Agnostic 3D Reconstruction via Adversarial Style Transfer

46. Learning with Algorithmic Supervision via Continuous Relaxations

47. Generalized and Incremental Few-Shot Learning by Explicit Learning and Calibration without Forgetting

48. Found a Reason for me? Weakly-supervised Grounded Visual Question Answering using Capsules

49. Differentiable Sorting Networks for Scalable Sorting and Ranking Supervision

50. Unsupervised Discriminative Embedding for Sub-Action Learning in Complex Activities

Catalog

Books, media, physical & digital resources