pyember / emberLinks
☆229Updated 4 months ago
Alternatives and similar repositories for ember
Users that are interested in ember are comparing it to the libraries listed below
Sorting:
- Training-Ready RL Environments + Evals☆132Updated this week
- Storing long contexts in tiny caches with self-study☆201Updated last week
- An interface library for RL post training with environments.☆66Updated this week
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆187Updated 7 months ago
- ☆105Updated this week
- ☆135Updated 7 months ago
- Async RL Training at Scale☆722Updated this week
- Library for text-to-text regression, applicable to any input string representation and allows pretraining and fine-tuning over multiple r…☆277Updated this week
- Simple & Scalable Pretraining for Neural Architecture Research☆297Updated 2 months ago
- Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse …☆726Updated this week
- A scalable asynchronous reinforcement learning implementation with in-flight weight updates.☆260Updated this week
- ☆142Updated last month
- rl from zero pretrain, can it be done? yes.☆277Updated 3 weeks ago
- PCCL (Prime Collective Communications Library) implements fault tolerant collective communications over IP☆133Updated last month
- Inference-time scaling for LLMs-as-a-judge.☆303Updated 3 weeks ago
- Super basic implementation (gist-like) of RLMs with REPL environments.☆204Updated last week
- ☆58Updated 8 months ago
- Training API☆172Updated last week
- Post-training with Tinker☆1,096Updated this week
- PyTorch Single Controller☆528Updated this week
- Long context evaluation for large language models☆224Updated 7 months ago
- ArcticTraining is a framework designed to simplify and accelerate the post-training process for large language models (LLMs)☆231Updated this week
- Train your own SOTA deductive reasoning model☆108Updated 7 months ago
- SIMD quantization kernels☆87Updated last month
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆145Updated last year
- ☆843Updated last week
- A zero-to-one guide on scaling modern transformers with n-dimensional parallelism.☆103Updated 3 weeks ago
- FlexAttention based, minimal vllm-style inference engine for fast Gemma 2 inference.☆296Updated 2 months ago
- Open source interpretability artefacts for R1.☆163Updated 6 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆97Updated this week