☆31Dec 3, 2025Updated 3 months ago
Alternatives and similar repositories for VERL
Users that are interested in VERL are comparing it to the libraries listed below
Sorting:
- ☆10Jan 19, 2022Updated 4 years ago
- Text2GraphRAG Disease Assistant builds a disease-focused retrieval-augmented generation workflow. It ingests structured Markdown (demo: o…☆42Nov 20, 2025Updated 3 months ago
- ☆55Mar 5, 2025Updated 11 months ago
- [ICDAR-DALL-2025] PALM-LAY is the first unified, cross-regional annotated dataset specifically designed for layout analysis of historical…☆40Dec 30, 2025Updated 2 months ago
- One-click synchronization tool for MCP configuration☆45Jan 29, 2026Updated last month
- Attention-based Deep Reinforcement Learning framework for portfolio allocation on S&P 500 equities. Includes custom environment, policy a…☆163Oct 16, 2025Updated 4 months ago
- CMPhysBench: A Benchmark for Evaluating Large Language Models in Condensed Matter Physics☆27Nov 1, 2025Updated 4 months ago
- AAAI2025☆11Apr 18, 2025Updated 10 months ago
- 开源 AI 命令行工具,将多模型 AI 智能体、智能工作流和规格驱动开发带入您的终端。(An open-source AI command-line tool that brings multi-model AI agents, intelligent workflows,…☆121Nov 23, 2025Updated 3 months ago
- A navigation algorithm based on CMU team's open-source local planner☆118Oct 9, 2025Updated 4 months ago
- A repository made to host something like a tiny framework to apply heuristics and metaheuristics to the multidimensional knapsack problem…☆10Aug 16, 2015Updated 10 years ago
- 🏅토스 NEXT ML CHALLENGE : 광고 클릭 예측(CTR) 대회 5등 모델 제출용 레포지토리🏅☆26Feb 2, 2026Updated last month
- Pytorch implementation of Detective☆12Jul 11, 2024Updated last year
- ☆15Nov 18, 2025Updated 3 months ago
- A blockchain simulator based on SimPy in python.☆14Dec 18, 2018Updated 7 years ago
- [ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆15Feb 9, 2026Updated 3 weeks ago
- Official Repository of LatentSeek☆77Jun 6, 2025Updated 8 months ago
- ☆112Oct 16, 2025Updated 4 months ago
- Agentic Virtual Lab☆19Nov 30, 2025Updated 3 months ago
- ☆18Apr 10, 2025Updated 10 months ago
- PyCausalSim is a Python framework for discovering and validating causal relationships through simulation. Unlike traditional analytics th…☆32Dec 8, 2025Updated 2 months ago
- Code for NeurIPS 2024 paper — Cross-Device Collaborative Test-Time Adaptation☆13Feb 28, 2025Updated last year
- [CVPR2025] Code Release of Patch Matters: Training-free Fine-grained Image Caption Enhancement via Local Perception☆21Jun 17, 2025Updated 8 months ago
- 一个在 Googe Chrome 上的插件,可以顺序平铺展示的浏览器书签栏,并替换当前新标签页。☆33Oct 28, 2025Updated 4 months ago
- ☆11Feb 21, 2022Updated 4 years ago
- The official implemention of "Depth-Breadth Synergy in RLVR: Unlocking LLM Reasoning Gains with Adaptive Exploration"☆24Feb 4, 2026Updated 3 weeks ago
- [EMNLP'22] Weakly-Supervised Temporal Article Grounding☆14Nov 25, 2023Updated 2 years ago
- This repo is the official implementation of "Euclid’s Gift: Enhancing Spatial Perception and Reasoning in Vision‑Language Models via Geom…☆27Nov 7, 2025Updated 3 months ago
- This repo is re-produce for Channel_pruning☆11May 17, 2018Updated 7 years ago
- 📚 TG-EDU综合教育平台 | 支持作业提交📝、批量评分✅、补交申请🔄、团队协作👥、成绩统计📊☆111Dec 3, 2025Updated 3 months ago
- ☆12Oct 30, 2021Updated 4 years ago
- [ICLR 2025] COME: Test-time Adaption by Conservatively Minimizing Entropy☆18Mar 5, 2025Updated 11 months ago
- ☆30Feb 15, 2026Updated 2 weeks ago
- [ICLR-2026] Official Implementation of our paper "THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning".☆31Updated this week
- PyTorch implementation of QKAN "Quantum-inspired Kolmogorov-Arnold Network" https://arxiv.org/abs/2509.14026☆20Updated this week
- <핸즈온 LLM>(한빛미디어, 2025)의 예제 코드 저장소☆34Jan 4, 2026Updated 2 months ago
- Working Memory Attack on LLMs☆17May 27, 2025Updated 9 months ago
- ☆10Oct 30, 2021Updated 4 years ago
- Face Identification using ONNX Runtime☆13Jul 4, 2024Updated last year