arpita8 / Awesome-Mixture-of-Experts-Papers
Survey: A collection of AWESOME papers and resources on the latest research in Mixture of Experts.
☆41Updated last month
Related projects: ⓘ
- Repository for the paper Stream of Search: Learning to Search in Language☆70Updated last month
- ☆75Updated 3 weeks ago
- Set of scripts to finetune LLMs☆36Updated 5 months ago
- Prune transformer layers☆60Updated 3 months ago
- ☆91Updated last month
- ☆105Updated this week
- Just a bunch of benchmark logs for different LLMs☆112Updated last month
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆81Updated last year
- ☆85Updated 7 months ago
- Attribute (or cite) statements generated by LLMs back to in-context information.☆107Updated 2 weeks ago
- ☆82Updated 3 weeks ago
- Functional Benchmarks and the Reasoning Gap☆74Updated last month
- Codebase accompanying the Summary of a Haystack paper.☆65Updated 2 months ago
- ☆109Updated last month
- Code for the paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'☆140Updated 3 months ago
- Automating enterprise workflows with multimodal agents☆83Updated last month
- ☆68Updated 2 months ago
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.☆158Updated 2 months ago
- σ-GPT: A New Approach to Autoregressive Models☆53Updated last month
- A comprehensive repository of reasoning tasks for Medical LLMs (and beyond)☆81Updated this week
- A simple unified framework for evaluating LLMs☆121Updated this week
- Small and Efficient Mathematical Reasoning LLMs☆69Updated 7 months ago
- ☆48Updated 11 months ago
- ☆77Updated last month
- ☆89Updated 11 months ago
- An automated tool for discovering insights from research papaer corpora☆131Updated 3 months ago
- Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"☆82Updated 2 months ago
- Swarming algorithms like PSO, Ant Colony, Sakana, and more in PyTorch 😊☆106Updated last week
- ☆68Updated last month
- Understand and test language model architectures on synthetic tasks.☆156Updated 4 months ago