Search

Your search keyword '"Farajtabar, Mehrdad"' showing total 161 results

Search Constraints

Start Over You searched for: Author "Farajtabar, Mehrdad" Remove constraint Author: "Farajtabar, Mehrdad"
161 results on '"Farajtabar, Mehrdad"'

Search Results

1. From Dense to Dynamic: Token-Difficulty Driven MoEfication of Pre-Trained LLMs

2. SALSA: Soup-based Alignment Learning for Stronger Adaptation in RLHF

3. Computational Bottlenecks of Training Small-scale Large Language Models

4. GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models

5. Duo-LLM: A Framework for Studying Adaptive Computation in Large Language Models

6. Scaling Smart: Accelerating Large Language Model Pre-training with Small Model Initialization

7. CatLIP: CLIP-level Visual Recognition Accuracy with 2.7x Faster Pre-training on Web-scale Image-Text Data

8. Weight subcloning: direct initialization of transformers using larger pretrained ones

9. LLM in a flash: Efficient Large Language Model Inference with Limited Memory

10. Knowledge Transfer from Vision Foundation Models for Efficient Training of Small Task-specific Models

11. TiC-CLIP: Continual Training of CLIP Models

12. SAM-CLIP: Merging Vision Foundation Models towards Semantic and Spatial Understanding

13. CLIP meets Model Zoo Experts: Pseudo-Supervision for Visual Enhancement

14. ReLU Strikes Back: Exploiting Activation Sparsity in Large Language Models

15. On the Efficacy of Multi-scale Data Samplers for Vision Applications

16. Reinforce Data, Multiply Impact: Improved Model Accuracy and Robustness with Dataset Reinforcement

17. An Empirical Study of Implicit Regularization in Deep Offline RL

18. Continual Learning Beyond a Single Model

19. Architecture Matters in Continual Learning

20. Wide Neural Networks Forget Less Catastrophically

21. Task-agnostic Continual Learning with Hybrid Probabilistic Models

22. Balance Regularized Neural Network Models for Causal Effect Estimation

23. Linear Mode Connectivity in Multitask and Continual Learning

24. The Effectiveness of Memory Replay in Large Scale Continual Learning

25. Optimization and Generalization of Regularization-Based Continual Learning: a Loss Approximation Viewpoint

26. A maximum-entropy approach to off-policy evaluation in average-reward MDPs

27. Understanding the Role of Training Regimes in Continual Learning

28. Learning to Incentivize Other Learning Agents

29. Dropout as an Implicit Gating Mechanism For Continual Learning

30. Self-Distillation Amplifies Regularization in Hilbert Space

31. Orthogonal Gradient Descent for Continual Learning

32. Cross-View Policy Learning for Street Navigation

33. Improved Knowledge Distillation via Teacher Assistant

34. Adapting Auxiliary Losses Using Gradient Similarity

35. Representation Learning over Dynamic Graphs

36. More Robust Doubly Robust Off-policy Evaluation

37. Hawkes Processes for Invasive Species Modeling and Management

38. Wasserstein Learning of Deep Generative Point Process Models

39. Joint Modeling of Event Sequence and Time Series with Attentional Twin Recurrent Neural Networks

40. Fake News Mitigation via Point Process Based Intervention

41. Recurrent Poisson Factorization for Temporal Recommendation

42. Distilling Information Reliability and Source Trustworthiness from Digital Traces

43. Multistage Campaigning in Social Networks

44. Smart broadcasting: Do you want to be seen?

45. Detecting weak changes in dynamic events over networks

46. Learning Granger Causality for Hawkes Processes

47. A Continuous-time Mutually-Exciting Point Process Framework for Prioritizing Events in Social Media

48. On The Network You Keep: Analyzing Persons of Interest using Cliqster

49. Correlated Cascades: Compete or Cooperate

50. COEVOLVE: A Joint Point Process Model for Information Diffusion and Network Co-evolution

Catalog

Books, media, physical & digital resources