mustafaaljadery / gemma-2B-10M
Gemma 2B with 10M context length using Infini-attention.
☆949Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for gemma-2B-10M
- [NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments☆1,397Updated this week
- OpenCodeInterpreter is a suite of open-source code generation systems aimed at bridging the gap between large language models and sophist…☆1,602Updated 6 months ago
- Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.p…☆1,170Updated last week
- We introduced a new model designed for the Code generation task. Its test accuracy on the HumanEval base dataset surpasses that of GPT-4 …☆814Updated 4 months ago
- Reaching LLaMA2 Performance with 0.1M Dollars☆960Updated 3 months ago
- ☆896Updated 6 months ago
- Llama-3 agents that can browse the web by following instructions and talking to you☆1,351Updated 4 months ago
- Automate the analysis of GitHub repositories for LLMs with RepoToTextForLLMs. Fetch READMEs, structure, and non-binary files efficiently.…☆683Updated 5 months ago
- A series of math-specific large language models of our Qwen2 series.☆602Updated 3 weeks ago
- Port of OpenAI's Whisper model in C/C++ with xtts and wav2lip☆782Updated 3 months ago
- Accelerate your Hugging Face Transformers 7.6-9x. Native to Hugging Face and PyTorch.☆687Updated 2 months ago
- The first open source Large Action Model generalist Artificial Narrow Intelligence agentic framework that controls completely human user …☆1,264Updated 5 months ago
- YaFSDP: Yet another Fully Sharded Data Parallel☆846Updated 2 weeks ago
- ☆718Updated 2 months ago
- Agent S: an open agentic framework that uses computers like a human☆606Updated this week
- An autoagentic AGI that is self-evolving and modular.☆892Updated 2 months ago
- AutoGroq is a groundbreaking tool that revolutionizes the way users interact with Autogen™ and other AI assistants. By dynamically genera…☆1,341Updated 3 months ago
- Codebase for Aria - an Open Multimodal Native MoE☆832Updated this week
- Official Repo for ICML 2024 paper "Executable Code Actions Elicit Better LLM Agents" by Xingyao Wang, Yangyi Chen, Lifan Yuan, Yizhe Zhan…☆496Updated 5 months ago
- ☆892Updated last month
- A project structure aware autonomous software engineer aiming for autonomous program improvement. Resolved 30.67% tasks (pass@1) in SWE-b…☆2,723Updated 3 weeks ago
- the simplest self-building coding agent☆821Updated last month
- DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence☆2,197Updated last month
- Code for Quiet-STaR☆651Updated 3 months ago
- OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training☆317Updated last month
- TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones☆1,251Updated 7 months ago
- ☆641Updated this week