Author: "Dobhal, Daksh" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Dobhal, Daksh"' showing total 8 results

Start Over Author "Dobhal, Daksh"

8 results on '"Dobhal, Daksh"'

1. $\forall$uto$\exists$$\lor\!\land$L: Autonomous Evaluation of LLMs for Truth Maintenance and Reasoning Tasks

Author: Karia, Rushang, Bramblett, Daniel, Dobhal, Daksh, and Srivastava, Siddharth
Subjects: Computer Science - Artificial Intelligence, Computer Science - Computation and Language
Abstract: This paper presents $\forall$uto$\exists$$\lor\!\land$L, a novel benchmark for scaling Large Language Model (LLM) assessment in formal tasks with clear notions of correctness, such as truth maintenance in translation and logical reasoning. $\forall$uto$\exists$$\lor\!\land$L is the first benchmarking paradigm that offers several key advantages necessary for scaling objective evaluation of LLMs without human labeling: (a) ability to evaluate LLMs of increasing sophistication by auto-generating tasks at different levels of difficulty; (b) auto-generation of ground truth that eliminates dependence on expensive and time-consuming human annotation; (c) the use of automatically generated, randomized datasets that mitigate the ability of successive LLMs to overfit to static datasets used in many contemporary benchmarks. Empirical analysis shows that an LLM's performance on $\forall$uto$\exists$$\lor\!\land$L is highly indicative of its performance on a diverse array of other benchmarks focusing on translation and reasoning tasks, making it a valuable autonomous evaluation paradigm in settings where hand-curated datasets can be hard to obtain and/or update.
Published: 2024

2. Using Explainable AI and Hierarchical Planning for Outreach with Robots

Author: Dobhal, Daksh, Nagpal, Jayesh, Karia, Rushang, Verma, Pulkit, Nayyar, Rashmeet Kaur, Shah, Naman, and Srivastava, Siddharth
Subjects: Computer Science - Robotics
Abstract: Understanding how robots plan and execute tasks is crucial in today's world, where they are becoming more prevalent in our daily lives. However, teaching non-experts the complexities of robot planning can be challenging. This work presents an open-source platform that simplifies the process using a visual interface that completely abstracts the complex internals of hierarchical planning that robots use for performing task and motion planning. Using the principles developed in the field of explainable AI, this intuitive platform enables users to create plans for robots to complete tasks, and provides helpful hints and natural language explanations for errors. The platform also has a built-in simulator to demonstrate how robots execute submitted plans. This platform's efficacy was tested in a user study on university students with little to no computer science background. Our results show that this platform is highly effective in teaching novice users the intuitions of robot task planning.
Published: 2024

3. $\forall$uto$\exists$val: Autonomous Assessment of LLMs in Formal Synthesis and Interpretation Tasks

Author: Karia, Rushang, Bramblett, Daniel, Dobhal, Daksh, Verma, Pulkit, and Srivastava, Siddharth
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: This paper presents $\forall$uto$\exists$val, a new approach for scaling LLM assessment in translating formal syntax -- such as first-order logic, regular expressions, etc -- to natural language (interpretation) or vice versa (compilation), thereby facilitating their use in applications such as generating/explaining logic and control flow for programs etc. Existing approaches for LLM assessment in these areas require labor-intensive ground-truth creation, the availability of which undermines the separation of training and test sets. Furthermore, such datasets typically include relatively few hand-coded test cases over which LLM accuracy is determined, thus making them inadequate for determining the safety or correctness of their generated outputs. We introduce a new approach that utilizes context-free grammars (CFGs) to generate out-of-distribution datasets on the fly and perform closed-loop testing of LLM capabilities using formal verifiers to guarantee the correctness of LLM outputs without any human intervention. We release our dataset and benchmark as open-source code at \url{https://github.com/AAIR-lab/auto-llm-assessment}. We also conduct an assessment of several SOTA closed and open-source LLMs to showcase the feasibility and scalability of this paradigm. Our experiments reveal that SOTA LLMs are unable to solve the formal translation task adequately.
Published: 2024

4. Experimental Validation of Fully Informed Particle Swarm Optimization Tuned Multi-Loop L-PID Controllers for Stabilization of Gantry Crane System

Author: Valluru, Sudarshan K., Singh, Madhusudan, Dobhal, Daksh, Kartikeya, Kumar, Kaur, Manpreet, Goel, Arnav, Kacprzyk, Janusz, Series Editor, Pal, Nikhil R., Advisory Editor, Bello Perez, Rafael, Advisory Editor, Corchado, Emilio S., Advisory Editor, Hagras, Hani, Advisory Editor, Kóczy, László T., Advisory Editor, Kreinovich, Vladik, Advisory Editor, Lin, Chin-Teng, Advisory Editor, Lu, Jie, Advisory Editor, Melin, Patricia, Advisory Editor, Nedjah, Nadia, Advisory Editor, Nguyen, Ngoc Thanh, Advisory Editor, Wang, Jun, Advisory Editor, Choudhury, Sushabhan, editor, Mishra, Ranjan, editor, Mishra, Raj Gaurav, editor, and Kumar, Adesh, editor
Published: 2020
Full Text: View/download PDF

5. Can LLMs Converse Formally? Automatically Assessing LLMs in Translating and Interpreting Formal Specifications

Author: Karia, Rushang, Dobhal, Daksh, Bramblett, Daniel, Verma, Pulkit, Srivastava, Siddharth, Karia, Rushang, Dobhal, Daksh, Bramblett, Daniel, Verma, Pulkit, and Srivastava, Siddharth
Abstract: Stakeholders often describe system requirements using natural language which are then converted to formal syntax by a domain-expert leading to increased design costs. This paper assesses the capabilities of Large Language Models (LLMs) in converting between natural language descriptions and formal specifications. Existing work has evaluated the capabilities of LLMs in generating formal syntax such as source code but such experiments are typically hand-crafted and use problems that are likely to be in the training set of LLMs, and often require human-annotated datasets. We propose an approach that can use two copies of an LLM in conjunction with an off-the-shelf verifier to automatically evaluate its translation abilities without any additional human input. Our approach generates formal syntax using language grammars to automatically generate a dataset. We conduct an empirical evaluation to measure the accuracy of this translation task and show that SOTA LLMs cannot adequately solve this task, limiting their current utility in the design of complex systems.
Published: 2024

6. Experimental Investigation of Fully Informed Particle Swarm Optimization tuned Multi Loop L-PID and NL-PID Controllers for Gantry Crane System

Author: Valluru, Sudarshan K., Kaur, Manpreet, Kartikeya, Kumar, Goel, Arnav, and Dobhal, Daksh
Published: 2020
Full Text: View/download PDF

7. Experimental Validation of Fully Informed Particle Swarm Optimization Tuned Multi-Loop L-PID Controllers for Stabilization of Gantry Crane System

Author: Valluru, Sudarshan K., primary, Singh, Madhusudan, additional, Dobhal, Daksh, additional, Kartikeya, Kumar, additional, Kaur, Manpreet, additional, and Goel, Arnav, additional
Published: 2019
Full Text: View/download PDF

8. Design of Multi-Loop L-PID and NL-PID Controllers: An Experimental Validation

Author: Valluru, Sudarshan K., primary, Singh, Madhusudan, additional, Goel, Arnav, additional, Kaur, Manpreet, additional, Dobhal, Daksh, additional, Kartikeya, Kumar, additional, Verma, Aditya, additional, and Gupta, Anshul, additional
Published: 2018
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

8 results on '"Dobhal, Daksh"'

1. $\forall$uto$\exists$$\lor\!\land$L: Autonomous Evaluation of LLMs for Truth Maintenance and Reasoning Tasks

2. Using Explainable AI and Hierarchical Planning for Outreach with Robots

3. $\forall$uto$\exists$val: Autonomous Assessment of LLMs in Formal Synthesis and Interpretation Tasks

4. Experimental Validation of Fully Informed Particle Swarm Optimization Tuned Multi-Loop L-PID Controllers for Stabilization of Gantry Crane System

5. Can LLMs Converse Formally? Automatically Assessing LLMs in Translating and Interpreting Formal Specifications

6. Experimental Investigation of Fully Informed Particle Swarm Optimization tuned Multi Loop L-PID and NL-PID Controllers for Gantry Crane System

7. Experimental Validation of Fully Informed Particle Swarm Optimization Tuned Multi-Loop L-PID Controllers for Stabilization of Gantry Crane System

8. Design of Multi-Loop L-PID and NL-PID Controllers: An Experimental Validation

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

8 results on '"Dobhal, Daksh"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources