kyegomez / Fuyu
Implementation of Adepts Fuyu all-new Multi-Modality model in pytorch
☆24Updated 2 months ago
Alternatives and similar repositories for Fuyu:
Users that are interested in Fuyu are comparing it to the libraries listed below
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆139Updated 4 months ago
- This is the official repository for Inheritune.☆109Updated 3 months ago
- Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)☆29Updated 10 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆77Updated 3 months ago
- Small and Efficient Mathematical Reasoning LLMs☆71Updated 11 months ago
- Unofficial Implementation of Evolutionary Model Merging☆33Updated 9 months ago
- ☆23Updated 4 months ago
- A repository for research on medium sized language models.☆76Updated 7 months ago
- ☆81Updated last year
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆76Updated last year
- FuseAI Project☆75Updated last month
- Implementation of the Quiet-STAR paper (https://arxiv.org/pdf/2403.09629.pdf)☆48Updated 5 months ago
- ☆74Updated last year
- Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examples☆57Updated this week
- ☆90Updated this week
- "Improving Mathematical Reasoning with Process Supervision" by OPENAI☆101Updated this week
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆106Updated 8 months ago
- ☆40Updated 8 months ago
- Reformatted Alignment☆113Updated 3 months ago
- Data preparation code for CrystalCoder 7B LLM☆44Updated 8 months ago
- My fork os allen AI's OLMo for educational purposes.☆30Updated last month
- Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"☆42Updated 2 months ago
- ☆69Updated this week
- ☆40Updated last month
- ☆120Updated 7 months ago
- ☆62Updated 3 months ago
- ☆116Updated 3 months ago
- Official repository for the paper "SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention"☆96Updated 3 months ago
- ☆47Updated last month
- Block Transformer: Global-to-Local Language Modeling for Fast Inference (NeurIPS 2024)☆148Updated last month