Accelerating RL for LLM Reasoning with Optimal Advantage Regression
☆35May 30, 2025Updated 8 months ago
Alternatives and similar repositories for A-PO
Users that are interested in A-PO are comparing it to the libraries listed below
Sorting:
- Reinforcement Learning via Regressing Relative Rewards☆39Dec 12, 2024Updated last year
- [IROS 2025] EgoLoc: Zero-Shot Temporal Interaction Localization for Egocentric Videos☆32Jan 13, 2026Updated last month
- ☆10Jun 14, 2024Updated last year
- Husky-LIO-SAM☆12Feb 23, 2023Updated 3 years ago
- Monorepo blueprint for developer platform☆11Dec 22, 2025Updated 2 months ago
- An android app with machine learning.☆11Jun 30, 2017Updated 8 years ago
- Mitigating the Filter Bubble while Maintaining Relevance: Targeted Diversification with VAE-based Recommender Systems☆10Mar 15, 2023Updated 2 years ago
- ☆74Jun 28, 2025Updated 7 months ago
- ☆10Jul 27, 2023Updated 2 years ago
- Synthetic Camera Simulator - Unreal Engine4 Plugin☆10Nov 2, 2019Updated 6 years ago
- C++ event and statemachine framework☆12Jan 7, 2026Updated last month
- Code for "Adaptive Self-improvement LLM Agentic System for ML Library Development" (ICML 2025)☆15Jan 6, 2026Updated last month
- [ICLR 2026 🔥] Official pytorch implementation for "Attention Is All You Need for KV Cache in Diffusion LLMs"☆37Jan 23, 2026Updated last month
- CUDA Accelerated ORB-SLAM2☆10Sep 7, 2022Updated 3 years ago
- [CVPR 2024] PLGSLAM☆12Nov 24, 2025Updated 3 months ago
- A standalone Windows CROSSTOOL for Bazel☆11Jul 21, 2020Updated 5 years ago
- The code for On Robust Cross-View Consistency in Outdoor Self-Supervised Monocular Depth Estimation☆13Jun 2, 2023Updated 2 years ago
- ☆15Jul 31, 2025Updated 6 months ago
- Image readout, processing and SLAM library☆11Jun 3, 2022Updated 3 years ago
- ☆13Jul 17, 2024Updated last year
- ☆10Aug 7, 2024Updated last year
- [RA-L 2025] Bayesian NeRF☆15Jan 22, 2025Updated last year
- Information-based Active SLAM via Topological Feature Graphs☆11Aug 7, 2022Updated 3 years ago
- Minimal Transformer base in JAX. A single backbone for language modelling, diffusion, classification, etc...☆13May 28, 2025Updated 8 months ago
- Android HAL☆10Dec 27, 2025Updated 2 months ago
- simple sensor calibration toolbox for camera, lidar, imu based on ROS2☆14Oct 2, 2025Updated 4 months ago
- Trust Region Preference Approximation: A simple and stable reinforcement learning algorithm for LLM reasoning☆14Jun 28, 2025Updated 7 months ago
- 3D Scene Flow Estimation☆14Sep 24, 2025Updated 5 months ago
- Deprecated - see our other repos for Bazel examples☆10Mar 22, 2022Updated 3 years ago
- [CoRL 2024] Software and hardware instructions for SoniceSense.☆16Mar 1, 2025Updated 11 months ago
- Deep Learning for 3D Point Clouds☆10Feb 26, 2020Updated 6 years ago
- Build dependencies and tools used across @typedb repositories (not for public)☆11Updated this week
- 比较啰嗦的orbslam单目部分注释;采用本地最新版本g2o;一些格式转换的私货main(Example/zzz_QXC_Test)☆10Nov 15, 2019Updated 6 years ago
- Applied modern C/C++ in calculus, discrete mathematics, robotics and machine learning with CMake.☆11Jan 22, 2026Updated last month
- A mass-spring system simulator that animates realistic hanging, pinned, falling, colliding, and folding cloth behaviors.☆10May 18, 2021Updated 4 years ago
- Evaluate mapping quality using intrinsic and extrinsic metrics☆12Mar 17, 2022Updated 3 years ago
- Bazel defs and rules for building Python projects with nanobind extensions.☆12Feb 4, 2026Updated 3 weeks ago
- Demand Forecasting is the process in which historical sales data is used to develop an estimate of an expected forecast of customer deman…☆12Jul 13, 2020Updated 5 years ago
- A Homemade game engine written in C++23 with Vulkan☆10Aug 23, 2024Updated last year