Search

Your search keyword '"Sikka, Karan"' showing total 144 results

Search Constraints

Start Over You searched for: Author "Sikka, Karan" Remove constraint Author: "Sikka, Karan"
144 results on '"Sikka, Karan"'

Search Results

1. Pelican: Correcting Hallucination in Vision-LLMs via Claim Decomposition and Program of Thought Verification

2. A Video is Worth 10,000 Words: Training and Benchmarking with Diverse Captions for Better Long Video Retrieval

3. DRESS: Instructing Large Vision-Language Models to Align and Interact with Humans via Natural Language Feedback

4. Demonstrations Are All You Need: Advancing Offensive Content Paraphrasing using In-Context Learning

5. Measuring and Improving Chain-of-Thought Reasoning in Vision-Language Models

6. SayNav: Grounding Large Language Models for Dynamic Planning to Navigation in New Environments

7. TIJO: Trigger Inversion with Joint Optimization for Defending Multimodal Backdoored Models

8. Predicting Information Pathways Across Online Communities

9. Multilingual Content Moderation: A Case Study on Reddit

10. Dual-Key Multimodal Backdoors for Visual Question Answering

11. Challenges in Procedural Multimodal Machine Comprehension:A Novel Way To Benchmark

12. Towards Solving Multimodal Comprehension

13. MISA: Online Defense of Trojaned Models using Misattributions

14. Detecting Trojaned DNNs Using Counterfactual Attributions

15. Zero-Shot Learning with Knowledge Enhanced Visual Semantic Embeddings

16. RGB2LIDAR: Towards Solving Large-Scale Cross-Modal Visual Localization

17. Deep Adaptive Semantic Logic (DASL): Compiling Declarative Knowledge into Deep Neural Networks

18. Sunny and Dark Outside?! Improving Answer Consistency in VQA through Entailed Question Generation

19. FoodX-251: A Dataset for Fine-grained Food Classification

20. Deep Unified Multimodal Embeddings for Understanding both Content and Users in Social Media Networks

21. Integrating Text and Image: Determining Multimodal Document Intent in Instagram Posts

22. Align2Ground: Weakly Supervised Phrase Grounding Guided by Image-Caption Alignment

23. Semantically-Aware Attentive Neural Embeddings for Image-based Visual Localization

24. Understanding Visual Ads by Aligning Symbols and Objects using Co-Attention

25. Zero-Shot Object Detection

26. Combining Weakly and Webly Supervised Learning for Classifying Food Images

27. AdaScan: Adaptive Scan Pooling in Deep Convolutional Neural Networks for Human Action Recognition in Videos

28. Discriminatively Trained Latent Ordinal Model for Video Classification

29. LOMo: Latent Ordinal Model for Facial Analysis in Videos

30. Deep Active Object Recognition by Joint Label and Action Prediction

31. Deep active object recognition by joint label and action prediction

32. Pseudo vs. True Defect Classification in Printed Circuits Boards using Wavelet Features

35. Zero-Shot Object Detection

36. Exploring Bag of Words Architectures in the Facial Expression Domain

38. Dual-Key Multimodal Backdoors for Visual Question Answering

40. Challenges in Procedural Multimodal Machine Comprehension: A Novel Way To Benchmark

41. Latent Dynamic Space-Time Volumes for Predicting Human Facial Behavior in Videos

45. RGB2LIDAR

Catalog

Books, media, physical & digital resources