mustafaaljadery / gemma-2B-10MLinks
Gemma 2B with 10M context length using Infini-attention.
☆947Updated last year
Alternatives and similar repositories for gemma-2B-10M
Users that are interested in gemma-2B-10M are comparing it to the libraries listed below
Sorting:
- Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.p…☆1,259Updated last month
- OpenCodeInterpreter is a suite of open-source code generation systems aimed at bridging the gap between large language models and sophist…☆1,662Updated last year
- YaFSDP: Yet another Fully Sharded Data Parallel☆968Updated 2 weeks ago
- Reaching LLaMA2 Performance with 0.1M Dollars☆981Updated 10 months ago
- OmniFusion — a multimodal model to communicate using text and images☆230Updated last year
- A series of math-specific large language models of our Qwen2 series.☆938Updated 4 months ago
- We introduced a new model designed for the Code generation task. Its test accuracy on the HumanEval base dataset surpasses that of GPT-4 …☆844Updated 11 months ago
- ☆974Updated 4 months ago
- Accelerate your Hugging Face Transformers 7.6-9x. Native to Hugging Face and PyTorch.☆683Updated 9 months ago
- ☆447Updated last year
- [NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments☆1,898Updated this week
- LLM-powered lossless compression tool☆285Updated 9 months ago
- ☆930Updated last year
- Automate the analysis of GitHub repositories for LLMs with RepoToTextForLLMs. Fetch READMEs, structure, and non-binary files efficiently.…☆755Updated last year
- A framework for Claude Opus to intelligently orchestrate subagents.☆4,233Updated 11 months ago
- Llama-3 agents that can browse the web by following instructions and talking to you☆1,405Updated 5 months ago
- Gemma 2 optimized for your local machine.☆370Updated 10 months ago
- ☆1,083Updated last year
- multi1: create o1-like reasoning chains with multiple AI providers (and locally). Supports LiteLLM as backend too for 100+ providers at o…☆347Updated 4 months ago
- Official repository of Evolutionary Optimization of Model Merging Recipes☆1,329Updated 6 months ago
- Official implementation of Half-Quadratic Quantization (HQQ)☆818Updated this week
- Thunder gives you PyTorch models superpowers for training and inference. Unlock out-of-the-box optimizations for performance, memory and …☆1,358Updated this week
- A toolkit for inference and evaluation of 'mixtral-8x7b-32kseqlen' from Mistral AI☆767Updated last year
- Training LLMs with QLoRA + FSDP☆1,483Updated 6 months ago
- YaRN: Efficient Context Window Extension of Large Language Models☆1,495Updated last year
- ☆707Updated last year
- A trivial programmatic Llama 3 jailbreak. Sorry Zuck!☆553Updated 4 months ago
- Fast Matrix Multiplications for Lookup Table-Quantized LLMs☆369Updated last month
- Inference code for Mistral and Mixtral hacked up into original Llama implementation☆371Updated last year
- LLM Transparency Tool (LLM-TT), an open-source interactive toolkit for analyzing internal workings of Transformer-based language models. …☆821Updated 6 months ago