pyember / ember
☆153Updated 2 weeks ago
Alternatives and similar repositories for ember:
Users that are interested in ember are comparing it to the libraries listed below
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆168Updated last month
- ☆65Updated this week
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆116Updated 10 months ago
- Long context evaluation for large language models☆207Updated last month
- ☆37Updated 2 months ago
- Verdict is a library for scaling judge-time compute.☆197Updated this week
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆196Updated last week
- ☆122Updated last month
- r2e: turn any github repository into a programming agent environment☆112Updated last month
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆170Updated 3 months ago
- Docker image NVIDIA GH200 machines - optimized for vllm serving and hf trainer finetuning☆39Updated 2 months ago
- ☆128Updated 3 weeks ago
- Cold Compress is a hackable, lightweight, and open-source toolkit for creating and benchmarking cache compression methods built on top of…☆128Updated 8 months ago
- Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters☆126Updated 4 months ago
- ArcticTraining is a framework designed to simplify and accelerate the post-training process for large language models (LLMs)☆66Updated this week
- Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research☆178Updated this week
- ☆108Updated 4 months ago
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym☆438Updated 3 weeks ago
- Extract full next-token probabilities via language model APIs☆241Updated last year
- A MAD laboratory to improve AI architecture designs 🧪☆111Updated 4 months ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆105Updated 5 months ago
- ☆114Updated 2 months ago
- EvaByte: Efficient Byte-level Language Models at Scale☆87Updated last month
- seqax = sequence modeling + JAX☆154Updated 2 weeks ago
- Functional Benchmarks and the Reasoning Gap☆85Updated 6 months ago
- ☆72Updated 2 months ago
- Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024☆286Updated last week
- Can Language Models Solve Olympiad Programming?☆115Updated 3 months ago
- Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'☆187Updated 4 months ago
- Systematic evaluation framework that automatically rates overthinking behavior in large language models.☆86Updated 2 weeks ago