Search

Your search keyword '"von Oswald, Johannes"' showing total 26 results

Search Constraints

Start Over You searched for: Author "von Oswald, Johannes" Remove constraint Author: "von Oswald, Johannes"
26 results on '"von Oswald, Johannes"'

Search Results

1. Multi-agent cooperation through learning-aware policy gradients

2. Learning Randomized Algorithms with Transformers

3. When can transformers compositionally generalize in-context?

4. State Soup: In-Context Skill Learning, Retrieval and Mixing

5. Linear Transformers are Versatile In-Context Learners

6. Discovering modular solutions that generalize compositionally

7. Uncovering mesa-optimization algorithms in Transformers

8. Gated recurrent neural networks discover attention

9. Transformers learn in-context by gradient descent

10. Disentangling the Predictive Variance of Deep Ensembles through the Neural Tangent Kernel

11. Random initialisations performing above chance and how to find them

12. The least-control principle for local learning at equilibrium

13. Learning where to learn: Gradient sparsity in meta and continual learning

14. A contrastive rule for meta-learning

15. Posterior Meta-Replay for Continual Learning

16. Neural networks with late-phase weights

17. Continual Learning in Recurrent Neural Networks

18. Continual learning with hypernetworks

19. The least-control principle for learning at equilibrium

20. On the reversed bias-variance tradeoff in deep ensembles

21. Neural networks with late-phase weights

22. Continual Learning in Recurrent Neural Networks

23. Continual learning with hypernetworks

24. Meta-Learning via Hypernetworks

Catalog

Books, media, physical & digital resources