Search

Your search keyword '"Tamkin, Alex"' showing total 36 results

Search Constraints

Start Over You searched for: Author "Tamkin, Alex" Remove constraint Author: "Tamkin, Alex"
36 results on '"Tamkin, Alex"'

Search Results

1. Sycophancy to Subterfuge: Investigating Reward-Tampering in Large Language Models

2. Collective Constitutional AI: Aligning a Language Model with Public Input

3. Bayesian Preference Elicitation with Language Models

4. Evaluating and Mitigating Discrimination in Language Model Decisions

5. Social Contract AI: Aligning AI Assistants with Implicit Group Norms

6. Codebook Features: Sparse and Discrete Interpretability for Neural Networks

7. Eliciting Human Preferences with Language Models

8. Turbulence in Focus: Benchmarking Scaling Behavior of 3D Volumetric Super-Resolution with BLASTNet 2.0 Data

9. Studying Large Language Model Generalization with Influence Functions

10. Towards Measuring the Representation of Subjective Global Opinions in Language Models

11. Operationalising the Definition of General Purpose AI Systems: Assessing Four Approaches

12. BenchMD: A Benchmark for Unified Learning on Medical Images and Sensors

13. Multispectral Contrastive Learning with Viewmaker Networks

14. Task Ambiguity in Humans and Language Models

15. Feature Dropout: Revisiting the Role of Augmentations in Contrastive Learning

16. Active Learning Helps Pretrained Models Learn the Intended Task

17. Oolong: Investigating What Makes Transfer Learning Hard with Controlled Studies

18. Tradeoffs Between Contrastive and Supervised Learning: An Empirical Study

19. DABS: A Domain-Agnostic Benchmark for Self-Supervised Learning

20. C5T5: Controllable Generation of Organic Molecules with Transformers

21. On the Opportunities and Risks of Foundation Models

22. Understanding the Capabilities, Limitations, and Societal Impact of Large Language Models

23. Language Through a Prism: A Spectral Approach for Multiscale Language Representations

24. Viewmaker Networks: Learning Views for Unsupervised Representation Learning

25. Investigating Transferability in Pretrained Language Models

26. Being Optimistic to Be Conservative: Quickly Learning a CVaR Policy

31. Oolong: Investigating What Makes Crosslingual Transfer Hard with Controlled Studies

36. drone.io: A Gestural and Visual Interface for Human-Drone Interaction.

Catalog

Books, media, physical & digital resources