Scaling Coding-Agent RL to 32x H100s. **Achieving 160% improvement** on Stanford's TerminalBench
☆98Nov 3, 2025Updated 5 months ago
Alternatives and similar repositories for Orca-Agent-RL
Users that are interested in Orca-Agent-RL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A meta-repo that watches karpathy/autoresearch and adjacent systems, distills portable patterns for bounded agent-verifier research lo…☆42Mar 11, 2026Updated last month
- The training codes of Jasper-Token-Compression-600M☆19Nov 19, 2025Updated 5 months ago
- Minimal Claude Code alternative powered by MLX☆46Jan 11, 2026Updated 3 months ago
- ☆20May 14, 2025Updated 11 months ago
- ☆12Mar 3, 2023Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- tuimorphic choose-your-own-adventure story game☆18Mar 3, 2026Updated last month
- Official Code Repository for the paper "Generative Modeling on Manifolds Through Mixture of Riemannian Diffusion Processes" (ICML 2024).☆16Jul 21, 2024Updated last year
- BERT Sentiment Classification on the IMDb Large Movie Review Dataset.☆16Sep 8, 2022Updated 3 years ago
- ☆15Oct 4, 2024Updated last year
- Data for the MTEB leaderboard☆50Updated this week
- ☆22Dec 18, 2025Updated 4 months ago
- lol☆10Mar 12, 2021Updated 5 years ago
- ☆14Dec 13, 2022Updated 3 years ago
- Fixes the rotation of the images based on EXIF data☆15Apr 6, 2026Updated last week
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Claude Code Session Debugger & Performance Analyzer☆73Apr 5, 2026Updated 2 weeks ago
- CDK stack for deploying an s3 bucket with cloudfront for asset serving☆15Oct 24, 2023Updated 2 years ago
- JANG — GGUF for MLX. YOU MUST USE JANG_Q RUNTIME. Adaptive Mixed-Precision Quantization + Runtime for Apple Silicon☆99Apr 6, 2026Updated last week
- ☆27Oct 30, 2025Updated 5 months ago
- ☆38Oct 10, 2025Updated 6 months ago
- A Windows .mobileconfig generator for installing fonts on iOS device. 一个 Windows 上的 .mobileconfig 配置生成器,用于给 iOS 设备安装字体。☆13Feb 26, 2020Updated 6 years ago
- [EMNLP 2024] Official implementation of "Hierarchical Deconstruction of LLM Reasoning: A Graph-Based Framework for Analyzing Knowledge Ut…☆23Dec 4, 2024Updated last year
- Procedural data generators suite for synthetic pretraining and formal reasoning☆36Updated this week
- Automatically Update LLM Papers Daily using Github Actions. Ref: https://github.com/Vincentqyw/cv-arxiv-daily☆10Updated this week
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Highly concurrent and fast content processing for Mighty Inference Server☆10Feb 6, 2023Updated 3 years ago
- Streamline on-policy/off-policy distillation workflows in a few lines of code☆98Feb 26, 2026Updated last month
- ☆62Feb 6, 2026Updated 2 months ago
- Testing DeepSpeed integration in 🤗 Accelerate☆11Jun 28, 2022Updated 3 years ago
- This is the code for our paper: PLACES: Prompting Language Models for Social Conversation Synthesis☆11Feb 17, 2023Updated 3 years ago
- Official implementation of SIGIR 2022 Paper "Task-Oriented Dialogue System as Natural Language Generation".☆14Apr 6, 2022Updated 4 years ago
- A Redis-compatible in-memory database server written in Rust with MLua-based Lua 5.1 scripting☆18Nov 28, 2025Updated 4 months ago
- A scikit-learn compliant implementation of Monroe et al.'s Fightin' Words analysis method.☆11Mar 10, 2019Updated 7 years ago
- MULTIOPED: A Corpus of Multi-Perspective News Editorials.☆12Aug 25, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Demo of fine-tuning QA models for answering FAQ of cloud providers documentation☆11Mar 7, 2023Updated 3 years ago
- The official Open Component Model Specification☆16Mar 30, 2026Updated 2 weeks ago
- SkillWeaver is a framework to enable web agent self-improvement through environment exploration and skill synthesis.☆118Apr 14, 2025Updated last year
- Code for the MTEB Arena☆24Jul 2, 2025Updated 9 months ago
- TypeScript port of Google's Agent Development Kit (ADK): An open-source, code-first toolkit for building, evaluating, and deploying AI ag…☆38Nov 4, 2025Updated 5 months ago
- pichuang personal website☆19Jun 10, 2025Updated 10 months ago
- Tensorflow implementation of "Meta Dropout: Learning to Perturb Latent Features for Generalization" (ICLR 2020)☆27Apr 27, 2020Updated 5 years ago