Search

Your search keyword '"Nguyen, Karina"' showing total 10 results

Search Constraints

Start Over You searched for: Author "Nguyen, Karina" Remove constraint Author: "Nguyen, Karina" Publication Type Reports Remove constraint Publication Type: Reports
10 results on '"Nguyen, Karina"'

Search Results

1. Evaluating and Mitigating Discrimination in Language Model Decisions

2. Specific versus General Principles for Constitutional AI

3. Studying Large Language Model Generalization with Influence Functions

4. Question Decomposition Improves the Faithfulness of Model-Generated Reasoning

5. Measuring Faithfulness in Chain-of-Thought Reasoning

6. Towards Measuring the Representation of Subjective Global Opinions in Language Models

7. Vision Transformers for Mobile Applications: A Short Survey

8. FAIR-Ensemble: When Fairness Naturally Emerges From Deep Ensembling

9. The Capacity for Moral Self-Correction in Large Language Models

10. Discovering Language Model Behaviors with Model-Written Evaluations

Catalog

Books, media, physical & digital resources