pyember / emberLinks
☆235Updated last week
Alternatives and similar repositories for ember
Users that are interested in ember are comparing it to the libraries listed below
Sorting:
- Curated collection of community environments☆200Updated last week
- Storing long contexts in tiny caches with self-study☆229Updated last month
- ☆116Updated this week
- Training API and CLI☆311Updated last month
- MoE training for Me and You and maybe other people☆319Updated last week
- Super basic implementation (gist-like) of RLMs with REPL environments.☆390Updated last week
- Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse …☆825Updated this week
- Async RL Training at Scale☆985Updated this week
- Simple & Scalable Pretraining for Neural Architecture Research☆306Updated last month
- A scalable asynchronous reinforcement learning implementation with in-flight weight updates.☆343Updated 3 weeks ago
- ☆949Updated 2 months ago
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆189Updated 10 months ago
- Harbor is a framework for running agent evaluations and creating and using RL environments.☆306Updated last week
- rl from zero pretrain, can it be done? yes.☆286Updated 3 months ago
- PyTorch-native post-training at scale☆585Updated this week
- ☆151Updated 4 months ago
- ArcticTraining is a framework designed to simplify and accelerate the post-training process for large language models (LLMs)☆267Updated last week
- ☆135Updated 9 months ago
- code for training & evaluating Contextual Document Embedding models☆202Updated 8 months ago
- SIMD quantization kernels☆92Updated 4 months ago
- Inference-time scaling for LLMs-as-a-judge.☆324Updated 2 months ago
- j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.☆100Updated 5 months ago
- ☆57Updated 11 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆256Updated this week
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆153Updated last year
- Open source interpretability artefacts for R1.☆165Updated 8 months ago
- Official CLI and Python SDK for Prime Intellect - access GPU compute, remote sandboxes, RL environments, and distributed training infrast…☆134Updated this week
- ☆459Updated last month
- Library for text-to-text regression, applicable to any input string representation and allows pretraining and fine-tuning over multiple r…☆308Updated 3 weeks ago
- PCCL (Prime Collective Communications Library) implements fault tolerant collective communications over IP☆141Updated 4 months ago