Gemma 2B with 10M context length using Infini-attention.
☆936May 12, 2024Updated last year
Alternatives and similar repositories for gemma-2B-10M
Users that are interested in gemma-2B-10M are comparing it to the libraries listed below
Sorting:
- AutoGroq is a groundbreaking tool that revolutionizes the way users interact with Autogen™ and other AI assistants. By dynamically genera…☆1,498Dec 23, 2024Updated last year
- Llama-3 agents that can browse the web by following instructions and talking to you☆1,405Dec 10, 2024Updated last year
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆13,206Mar 1, 2026Updated last week
- Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"☆3,335May 4, 2024Updated last year
- The first open-source Artificial Narrow Intelligence generalist agentic framework Computer-Using-Agent that fully operates graphical-user…☆1,329Feb 13, 2025Updated last year
- Reaching LLaMA2 Performance with 0.1M Dollars☆989Jul 23, 2024Updated last year
- A framework for Claude Opus to intelligently orchestrate subagents.☆4,324Jul 1, 2024Updated last year
- PyTorch native post-training library☆5,697Updated this week
- The PyTorch implementation of Generative Pre-trained Transformers (GPTs) using Kolmogorov-Arnold Networks (KANs) for language modeling☆725Nov 25, 2024Updated last year
- Large World Model -- Modeling Text and Video with Millions Context☆7,399Oct 19, 2024Updated last year
- Fast, flexible LLM inference☆6,653Feb 27, 2026Updated last week
- We introduced a new model designed for the Code generation task. Its test accuracy on the HumanEval base dataset surpasses that of GPT-4 …☆852Jul 6, 2024Updated last year
- Unofficial PyTorch/🤗Transformers(Gemma/Llama3) implementation of Leave No Context Behind: Efficient Infinite Context Transformers with I…☆375Apr 23, 2024Updated last year
- Unofficial Implementation of Animate Anyone by Novita AI☆782May 31, 2024Updated last year
- Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.p…☆1,315Feb 26, 2026Updated last week
- ☆3,082Nov 21, 2025Updated 3 months ago
- A self-organizing file system with llama 3