tzafon / Tzafon-WayPointLinks
Tzafon-WayPoint is a robust, scalable solution for managing large fleets of browser instances. WayPoint stands out with unmatched cold‑start speed—launching up to a 1000 browser per second on standard GCP hardware.
☆74Updated 6 months ago
Alternatives and similar repositories for Tzafon-WayPoint
Users that are interested in Tzafon-WayPoint are comparing it to the libraries listed below
Sorting:
- Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse …☆726Updated this week
- ☆229Updated 4 months ago
- Async RL Training at Scale☆722Updated this week
- Training-Ready RL Environments + Evals☆132Updated this week
- Plotting (entropy, varentropy) for small LMs☆98Updated 5 months ago
- A framework for optimizing DSPy programs with RL☆208Updated this week
- OSS RL environment + evals toolkit☆192Updated this week
- An MCP Server that's also an MCP Client. Useful for letting Claude develop and test MCPs without needing to reset the application.☆124Updated 7 months ago
- Inference-time scaling for LLMs-as-a-judge.☆303Updated 3 weeks ago
- ☆68Updated 5 months ago
- ☆124Updated 10 months ago
- The State Of The Art, intelligence☆154Updated 2 months ago
- ⚖️ Awesome LLM Judges ⚖️☆132Updated 5 months ago
- smol models are fun too☆93Updated 11 months ago
- ☆135Updated 7 months ago
- rl from zero pretrain, can it be done? yes.☆277Updated 3 weeks ago
- The Prime Intellect CLI provides a powerful command-line interface for managing GPU resources across various providers☆100Updated this week
- A comprehensive repository of reasoning tasks for LLMs (and beyond)☆450Updated last year
- Testing baseline LLMs performance across various models☆319Updated 2 weeks ago
- Super basic implementation (gist-like) of RLMs with REPL environments.☆204Updated last week
- Aidan Bench attempts to measure <big_model_smell> in LLMs.☆312Updated 4 months ago
- ☆62Updated 3 months ago
- 🦾💻🌐 distributed training & serverless inference at scale on RunPod☆18Updated last year
- An interface library for RL post training with environments.☆66Updated this week
- Train your own SOTA deductive reasoning model☆108Updated 7 months ago
- Claude Deep Research config for Claude Code.☆223Updated 7 months ago
- ☆105Updated this week
- Storing long contexts in tiny caches with self-study☆201Updated last week
- Open source interpretability artefacts for R1.☆163Updated 6 months ago
- smolLM with Entropix sampler on pytorch☆150Updated 11 months ago