Search

Your search keyword '"Zheng, Zeyu"' showing total 614 results

Search Constraints

Start Over You searched for: Author "Zheng, Zeyu" Remove constraint Author: "Zheng, Zeyu"
614 results on '"Zheng, Zeyu"'

Search Results

1. Normalization and effective learning rates in reinforcement learning

2. Daily Physical Activity Monitoring -- Adaptive Learning from Multi-source Motion Sensor Data

3. Understanding the performance gap between online and offline alignment algorithms

4. Large Language Model Enhanced Machine Learning Estimators for Classification

5. Collaborative Intelligence in Sequential Experiments: A Human-in-the-Loop Framework for Drug Discovery

6. A Preliminary Study on Accelerating Simulation Optimization with GPU Implementation

7. Language Model Prompt Selection via Simulation Optimization

8. Human Alignment of Large Language Models through Online Preference Optimisation

9. Disentangling the Causes of Plasticity Loss in Neural Networks

10. Generalized Preference Optimization: A Unified Approach to Offline Alignment

12. Gemini: A Family of Highly Capable Multimodal Models

13. Causal inference with Machine Learning-Based Covariate Representation

17. Sandpile Prediction on Undirected Graphs

18. Regret Distribution in Stochastic Bandits: Optimal Trade-off between Expectation and Tail Risk

19. Best Arm Identification with Fairness Constraints on Subpopulations

20. Efficient targeted learning of heterogeneous treatment effects for multiple subgroups.

24. Understanding plasticity in neural networks

25. Efficient Targeted Learning of Heterogeneous Treatment Effects for Multiple Subgroups

26. Adaptive A/B Tests and Simultaneous Treatment Parameter Optimization

27. Towards Multi-Agent Reinforcement Learning driven Over-The-Counter Market Simulations

29. Gradient-Free Methods for Deterministic and Stochastic Nonsmooth Nonconvex Optimization

30. Extremal planar graphs with no cycles of particular lengths

31. A Short Proof of a Convex Representation for Stationary Distributions of Markov Chains with an Application to State Space Truncation

33. Inference on the Best Policies with Many Covariates

34. Common kings of a chain of cycles in a strong tournament

35. A Simple and Optimal Policy Design with Safety against Heavy-Tailed Risk for Stochastic Bandits

36. GrASP: Gradient-Based Affordance Selection for Planning

37. Selecting the Best Optimizing System

40. Note on the Tur\'an number of the $3$-linear hypergraph $C_{13}$

45. Role of aerobic exercise in ameliorating NASH: Insights into the hepatic thyroid hormone signaling and circulating thyroid hormones.

46. Offline Planning and Online Learning under Recovering Rewards

Catalog

Books, media, physical & digital resources