pyember / emberLinks
☆232Updated 4 months ago
Alternatives and similar repositories for ember
Users that are interested in ember are comparing it to the libraries listed below
Sorting:
- Training-Ready RL Environments + Evals☆164Updated last week
- Storing long contexts in tiny caches with self-study☆213Updated 3 weeks ago
- ☆106Updated 3 weeks ago
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆189Updated 8 months ago
- Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse …☆743Updated this week
- ☆59Updated 9 months ago
- Async RL Training at Scale☆749Updated last week
- A scalable asynchronous reinforcement learning implementation with in-flight weight updates.☆291Updated last week
- ☆135Updated 7 months ago
- ☆143Updated 2 months ago
- rl from zero pretrain, can it be done? yes.☆280Updated last month
- Inference-time scaling for LLMs-as-a-judge.☆308Updated last week
- Library for text-to-text regression, applicable to any input string representation and allows pretraining and fine-tuning over multiple r…☆287Updated this week
- Simple & Scalable Pretraining for Neural Architecture Research☆299Updated 2 weeks ago
- Long context evaluation for large language models☆224Updated 8 months ago
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆149Updated last year
- ArcticTraining is a framework designed to simplify and accelerate the post-training process for large language models (LLMs)☆241Updated last week
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆100Updated this week
- PCCL (Prime Collective Communications Library) implements fault tolerant collective communications over IP☆138Updated 2 months ago
- Super basic implementation (gist-like) of RLMs with REPL environments.☆242Updated 3 weeks ago
- Training API☆214Updated this week
- Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research☆258Updated last week
- code for training & evaluating Contextual Document Embedding models☆200Updated 6 months ago
- Open source interpretability artefacts for R1.☆163Updated 6 months ago
- j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.☆97Updated 3 months ago
- Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"☆336Updated 11 months ago
- This repo contains the source code for the paper "Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning"☆252Updated this week
- SIMD quantization kernels☆92Updated 2 months ago
- ⚖️ Awesome LLM Judges ⚖️☆133Updated 6 months ago
- accompanying material for sleep-time compute paper☆117Updated 6 months ago