Search

Your search keyword '"Lee Donghwan"' showing total 50 results

Search Constraints

Start Over You searched for: Author "Lee Donghwan" Remove constraint Author: "Lee Donghwan" Database OAIster Remove constraint Database: OAIster
50 results on '"Lee Donghwan"'

Search Results

1. Finite-Time Error Analysis of Soft Q-Learning: Switching System Approach

2. Analysis of Off-Policy Multi-Step TD-Learning with Linear Function Approximation

3. Finite-Time Error Analysis of Online Model-Based Q-Learning with a Relaxed Sampling Model

4. Harnessing Membership Function Dynamics for Stability Analysis of T-S Fuzzy Systems

5. A finite time analysis of distributed Q-learning

6. Unified ODE Analysis of Smooth Q-Learning Algorithms

7. Backstepping Temporal Difference Learning

8. Demystifying Disagreement-on-the-Line in High Dimensions

9. On Some Geometric Behavior of Value Iteration on the Orthant: Switching System Perspective

10. TMO: Textured Mesh Acquisition of Objects with a Mobile Device by using Differentiable Rendering

11. A Theory of Non-Linear Feature Learning with One Gradient Step in Two-Layer Neural Networks

12. Suppressing Overestimation in Q-Learning through Adversarial Behaviors

13. A primal-dual perspective for distributed TD-learning

14. Relaxed Conditions for Parameterized Linear Matrix Inequality in the Form of Nested Fuzzy Summations

15. On the Local Quadratic Stability of T-S Fuzzy Systems in the Vicinity of the Origin

16. Continuous-Time Distributed Dynamic Programming for Networked Multi-Agent Markov Decision Processes

17. Temporal Difference Learning with Experience Replay

18. Finite-Time Analysis of Minimax Q-Learning for Two-Player Zero-Sum Markov Games: Switching System Approach

19. Optimal Heterogeneous Collaborative Linear Regression and Contextual Bandits

20. Block Double-Submission Attack: Block Withholding Can Be Self-Destructive

21. Finite-Time Analysis of Asynchronous Q-learning under Diminishing Step-Size from Control-Theoretic View

22. Finite-Time Analysis of Temporal Difference Learning: Discrete-Time Linear System Perspective

23. A Single Correspondence Is Enough: Robust Global Registration to Avoid Degeneracy in Urban Environments

24. SelfTune: Metrically Scaled Monocular Depth Estimation through Self-Supervised Learning

25. T-Cal: An optimal test for the calibration of predictive models

26. Regularized Q-learning

27. Collaborative Learning of Discrete Distributions under Heterogeneity and Communication Constraints

28. Investigating the Role of Image Retrieval for Visual Localization -- An exhaustive benchmark

29. Control Theoretic Analysis of Temporal Difference Learning

30. New Versions of Gradient Temporal Difference Learning

31. On the Semidefinite Duality of Finite-Horizon LQG Problem

32. DnD: Dense Depth Estimation in Crowded Dynamic Indoor Scenes

33. Convergence of Dynamic Programming on the Semidefinite Cone

34. Data-Driven Control Design with LMIs and Dynamic Programming

35. Multi-Objective LQG Design with Primal-Dual Method

36. Large-scale Localization Datasets in Crowded Indoor Spaces

37. Simulation Studies on Deep Reinforcement Learning for Building Control with Human Interaction

38. A Discrete-Time Switching System Analysis of Q-learning

39. HARMer: Cyber-attacks Automation and Evaluation

40. Periodic Q-Learning

41. SelfDeco: Self-Supervised Monocular Depth Completion in Challenging Indoor Environments

42. SAFENet: Self-Supervised Monocular Depth Estimation with Semantic-Aware Feature Extraction

43. DEP domain-containing mTOR-interacting protein suppresses lipogenesis and ameliorates hepatic steatosis and acute-on-chronic liver injury in alcoholic liver disease

44. A Unified Switching System Perspective and O.D.E. Analysis of Q-Learning Algorithms

45. Optimization for Reinforcement Learning: From Single Agent to Cooperative Agents

46. Target-Based Temporal Difference Learning

47. Learning to Communicate: A Machine Learning Framework for Heterogeneous Multi-Agent Robotic Systems

48. Hidden Markov Model Estimation-Based Q-learning for Partially Observable Markov Decision Process

49. One CNV Discordance in NRXN1 Observed Upon Genome-wide Screening in 38 Pairs of Adult Healthy Monozygotic Twins

50. Rediscovery rate estimation for assessing the validation of significant findings in high-throughput studies

Catalog

Books, media, physical & digital resources