kyegomez / Fuyu
Implementation of Adepts Fuyu all-new Multi-Modality model in pytorch
☆24Updated 2 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for Fuyu
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆130Updated 2 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆75Updated last month
- This is the official repository for Inheritune.☆105Updated last month
- Unofficial Implementation of Evolutionary Model Merging☆33Updated 7 months ago
- Block Transformer: Global-to-Local Language Modeling for Fast Inference (Official Code)☆135Updated last month
- Plug in & Play Pytorch Implementation of the paper: "Evolutionary Optimization of Model Merging Recipes" by Sakana AI☆26Updated 2 weeks ago
- FuseAI Project☆76Updated 3 months ago
- ☆57Updated 2 weeks ago
- Official repository for the paper "SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention"☆93Updated last month
- Expert Specialized Fine-Tuning☆148Updated 2 months ago
- augmented LLM with self reflection☆103Updated last year
- My fork os allen AI's OLMo for educational purposes.☆28Updated last week
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆50Updated 7 months ago
- ☆116Updated 5 months ago
- A repository for research on medium sized language models.☆74Updated 6 months ago
- ☆35Updated last year
- X-LoRA: Mixture of LoRA Experts☆178Updated 3 months ago
- ☆90Updated 4 months ago
- Set of scripts to finetune LLMs☆36Updated 7 months ago
- ☆42Updated 2 months ago
- Small and Efficient Mathematical Reasoning LLMs☆71Updated 9 months ago
- Pytorch implementation for "Compressed Context Memory For Online Language Model Interaction" (ICLR'24)☆50Updated 7 months ago
- "Improving Mathematical Reasoning with Process Supervision" by OPENAI☆83Updated last week
- ☆53Updated 5 months ago
- Unofficial implementation for the paper "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"☆135Updated 5 months ago
- ☆112Updated last month
- ☆73Updated 10 months ago
- ☆126Updated 7 months ago
- A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).☆147Updated this week
- [NeurIPS 2024 D&B Track] GTA: A Benchmark for General Tool Agents☆46Updated 2 weeks ago