Qwen3-14B Orchestrator Agent Reinforcement Learning. **Achieved 160% improvement** on Stanford's TerminalBench
☆101Nov 3, 2025Updated 7 months ago
Alternatives and similar repositories for Orca-Agent-RL
Users that are interested in Orca-Agent-RL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A meta-repo that watches karpathy/autoresearch and adjacent systems, distills portable patterns for bounded agent-verifier research lo…☆43May 8, 2026Updated last month
- ☆13Mar 5, 2025Updated last year
- Tree-based speculative decoding for Apple Silicon (MLX). ~10-15% faster than DFlash on code, ~1.5x over autoregressive. First MLX port wi…☆140Apr 15, 2026Updated 2 months ago
- ☆15May 17, 2022Updated 4 years ago
- Minimal Claude Code alternative powered by MLX☆47Jan 11, 2026Updated 5 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆12Mar 3, 2023Updated 3 years ago
- [ACL 2021] Learning to Perturb Word Embeddings for Out-of-distribution QA☆16May 11, 2022Updated 4 years ago
- Official Code Repository for the paper "Generative Modeling on Manifolds Through Mixture of Riemannian Diffusion Processes" (ICML 2024).☆16Jul 21, 2024Updated last year
- Local lightning-fast semantic code search built for agents☆42Mar 16, 2026Updated 3 months ago
- Data for the MTEB leaderboard☆58Jun 12, 2026Updated last week
- MEDIQA-Chat Shared Tasks @ ACL-ClinicalNLP 2023☆58May 15, 2023Updated 3 years ago
- LockManager with deadlock detection for implementing 2PL☆13Mar 13, 2019Updated 7 years ago
- A macOS .mobileconfig generator for installing fonts on iOS device. 一个 macOS 上的 .mobileconfig 配置生成器,用于给 iOS 设备安装字体。☆12Jun 24, 2020Updated 5 years ago
- Official Implementation For PolarQuant☆43Apr 2, 2026Updated 2 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆22Sep 29, 2025Updated 8 months ago
- ☆39Oct 10, 2025Updated 8 months ago
- ☆28Oct 30, 2025Updated 7 months ago
- BERT score for text generation☆12Jan 15, 2025Updated last year
- A Windows .mobileconfig generator for installing fonts on iOS device. 一个 Windows 上的 .mobileconfig 配置生成器,用于给 iOS 设备安装字体。☆13Feb 26, 2020Updated 6 years ago
- [EMNLP 2024] Official implementation of "Hierarchical Deconstruction of LLM Reasoning: A Graph-Based Framework for Analyzing Knowledge Ut…☆23Dec 4, 2024Updated last year
- Procedural data generators suite for synthetic pretraining and formal reasoning☆42Updated this week
- ☆21Oct 2, 2025Updated 8 months ago
- Automatically Update LLM Papers Daily using Github Actions. Ref: https://github.com/Vincentqyw/cv-arxiv-daily☆10Jun 8, 2026Updated last week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code release for "TempLM: Distilling Language Models into Template-Based Generators"☆14Jul 21, 2022Updated 3 years ago
- ☆65Feb 6, 2026Updated 4 months ago
- Testing DeepSpeed integration in 🤗 Accelerate☆11Jun 28, 2022Updated 3 years ago
- This is the code for our paper: PLACES: Prompting Language Models for Social Conversation Synthesis☆11Feb 17, 2023Updated 3 years ago
- 🎭 Official code and dataset for our CCGPK@COLING 2022 paper - "PersonaChatGen: Generating Personalized Dialogue using GPT-3"☆13Mar 26, 2024Updated 2 years ago
- Official implementation of SIGIR 2022 Paper "Task-Oriented Dialogue System as Natural Language Generation".☆14Apr 6, 2022Updated 4 years ago
- The official public dataset for Famelack.☆49Jun 12, 2026Updated last week
- A Redis-compatible in-memory database server written in Rust with MLua-based Lua 5.1 scripting☆19Nov 28, 2025Updated 6 months ago
- MULTIOPED: A Corpus of Multi-Perspective News Editorials.☆12Aug 25, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Demo of fine-tuning QA models for answering FAQ of cloud providers documentation☆11Jun 10, 2026Updated last week
- ☆243Oct 27, 2025Updated 7 months ago
- Code for the MTEB Arena☆25Jul 2, 2025Updated 11 months ago
- ☆20Jan 4, 2026Updated 5 months ago
- pichuang personal website☆19Jun 10, 2025Updated last year
- ClockBench - Visual Reasoning AI Benchmark☆32Sep 4, 2025Updated 9 months ago
- Tool to perform paired evaluation of automatic systems☆13Oct 20, 2021Updated 4 years ago