Search

Your search keyword '"Zhao, Tuo"' showing total 403 results

Search Constraints

Start Over You searched for: Author "Zhao, Tuo" Remove constraint Author: "Zhao, Tuo" Search Limiters Available in Library Collection Remove constraint Search Limiters: Available in Library Collection
403 results on '"Zhao, Tuo"'

Search Results

1. Provable Acceleration of Nesterov's Accelerated Gradient for Rectangular Matrix Factorization and Linear Neural Networks

2. Model Tells Itself Where to Attend: Faithfulness Meets Automatic Attention Steering

3. RNR: Teaching Large Language Models to Follow Roles and Rules

4. Robust Reinforcement Learning from Corrupted Human Feedback

5. RoseLoRA: Row and Column-wise Sparse Low-rank Adaptation of Pre-trained Language Model for Knowledge Editing and Fine-tuning

6. Adaptive Preference Scaling for Reinforcement Learning with Human Feedback

7. To Cool or not to Cool? Temperature Network Meets Large Foundation Models via DRO

8. Stochastic Constrained Decentralized Optimization for Machine Learning with Fewer Data Oracles: a Gradient Sliding Approach

9. GEAR: An Efficient KV Cache Compression Recipe for Near-Lossless Generative Inference of LLM

10. BlendFilter: Advancing Retrieval-Augmented Large Language Models via Query Generation Blending and Knowledge Filtering

11. Data Diversity Matters for Robust Instruction Tuning

12. Tell Your Model Where to Attend: Post-hoc Attention Steering for LLMs

13. Model-Based Reparameterization Policy Gradient Methods: Theory and Practical Algorithms

14. Good regularity creates large learning rate implicit biases: edge of stability, balancing, and catapult

15. SMURF-THP: Score Matching-based UnceRtainty quantiFication for Transformer Hawkes Process

16. Score Matching-based Pseudolikelihood Estimation of Neural Marked Spatio-Temporal Point Process with Uncertainty Quantification

17. Efficient Long-Range Transformers: You Need to Attend More, but Not Necessarily at Every Layer

18. Robust Multi-Agent Reinforcement Learning via Adversarial Regularization: Theoretical Foundation and Stable Algorithms

19. LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models

20. Module-wise Adaptive Distillation for Multimodality Foundation Models

21. Sample Complexity of Neural Policy Mirror Descent for Policy Optimization on Low-Dimensional Manifolds

22. Pivotal Estimation of Linear Discriminant Analysis in High Dimensions

23. Deep Reinforcement Learning from Hierarchical Preference Design

24. Provable Benefits of Policy Learning from Human Preferences in Contextual Bandit Problems

25. Nonparametric Classification on Low Dimensional Manifolds using Overparameterized Convolutional Residual Networks

26. Effective Minkowski Dimension of Deep Nonparametric Regression: Function Approximation and Statistical Theories

27. LoSparse: Structured Compression of Large Language Models based on Low-Rank and Sparse Approximation

28. Machine Learning Force Fields with Data Cost Aware Training

30. AdaLoRA: Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning

31. On Deep Generative Models for Approximation and Estimation of Distributions on Manifolds

32. HomoDistil: Homotopic Task-Agnostic Distillation of Pre-trained Transformers

33. Score Approximation, Estimation and Distribution Recovery of Diffusion Models on Low-Dimensional Data

34. Efficient Long Sequence Modeling via State Space Augmented Transformer

35. High Dimensional Binary Classification under Label Shift: Phase Transition and Regularization

36. Less is More: Task-aware Layer-wise Distillation for Language Model Compression

37. First-order Policy Optimization for Robust Markov Decision Process

38. Context-Aware Query Rewriting for Improving Users' Search Experience on E-commerce Websites

39. DiP-GNN: Discriminative Pre-Training of Graph Neural Networks

40. Differentially Private Estimation of Hawkes Process

41. PLATON: Pruning Large Transformer Models with Upper Confidence Bound of Weight Importance

42. Benefits of Overparameterized Convolutional Residual Networks: Function Approximation under Smoothness Constraint

43. Sample Complexity of Nonparametric Off-Policy Evaluation on Low-Dimensional Manifolds using Deep Networks

44. A Manifold Two-Sample Test Study: Integral Probability Metric with Neural Networks

45. MoEBERT: from BERT to Mixture-of-Experts via Importance-Guided Adaptation

46. CAMERO: Consistency Regularized Ensemble of Perturbed Language Models with Weight Sharing

47. CERES: Pretraining of Graph-Conditioned Transformer for Semi-Structured Session Data

48. Noise Regularizes Over-parameterized Rank One Matrix Recovery, Provably

49. No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for Training Large Transformer Models

50. Homotopic Policy Mirror Descent: Policy Convergence, Implicit Regularization, and Improved Sample Complexity

Catalog

Books, media, physical & digital resources