Author: "Huizhen Yu" / Journal: corr - Searchworks@Jio Institute Digital Library Search Results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Your search keyword '"Huizhen Yu"' showing total 13 results

Start Over Author "Huizhen Yu" Journal corr

13 results on '"Huizhen Yu"'

1. Asynchronous Stochastic Approximation and Average-Reward Reinforcement Learning.

Author: Huizhen Yu, Yi Wan 0004, and Richard S. Sutton
Published: 2024
Full Text: View/download PDF

2. On Convergence of Average-Reward Q-Learning in Weakly Communicating Markov Decision Processes.

Author: Yi Wan 0004, Huizhen Yu, and Richard S. Sutton
Published: 2024
Full Text: View/download PDF

3. A Note on Stability in Asynchronous Stochastic Approximation without Communication Delays.

Author: Huizhen Yu, Yi Wan 0004, and Richard S. Sutton
Published: 2023
Full Text: View/download PDF

4. Two geometric input transformation methods for fast online reinforcement learning with neural nets.

Author: Sina Ghiassian, Huizhen Yu, Banafsheh Rafiee, and Richard S. Sutton
Published: 2018

5. On Generalized Bellman Equations and Temporal-Difference Learning.

Author: Huizhen Yu, Ashique Rupam Mahmood, and Richard S. Sutton
Published: 2017

6. Multi-step Off-policy Learning Without Importance Sampling Ratios.

Author: Ashique Rupam Mahmood, Huizhen Yu, and Richard S. Sutton
Published: 2017

7. On Convergence of some Gradient-based Temporal-Differences Algorithms for Off-Policy Learning.

Author: Huizhen Yu
Published: 2017

8. Some Simulation Results for Emphatic Temporal-Difference Learning Algorithms.

Author: Huizhen Yu
Published: 2016

9. Emphatic Temporal-Difference Learning.

Author: Ashique Rupam Mahmood, Huizhen Yu, Martha White, and Richard S. Sutton
Published: 2015

10. On Convergence of Emphatic Temporal-Difference Learning.

Author: Huizhen Yu
Published: 2015

11. Weak Convergence Properties of Constrained Emphatic Temporal-difference Learning with Constant and Slowly Diminishing Stepsize.

Author: Huizhen Yu
Published: 2015

12. A Function Approximation Approach to Estimation of Policy Gradient for POMDP with Structured Policies

Author: Huizhen Yu
Published: 2012

13. Discretized Approximations for POMDP with Average Cost

Author: Huizhen Yu and Dimitri P. Bertsekas
Published: 2012

Catalog

Books, media, physical & digital resources

See catalog results