☆17Dec 23, 2025Updated 3 months ago
Alternatives and similar repositories for self-verification
Users that are interested in self-verification are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆29Oct 8, 2025Updated 5 months ago
- ☆26Jan 5, 2026Updated 2 months ago
- ☆19Oct 27, 2025Updated 4 months ago
- Author's PyTorch implementation of ICML'23 paper "Policy Regularization with Dataset Constraint for Offline Reinforcement Learning" for D…☆18Nov 8, 2024Updated last year
- Official Repository of "Learning what reinforcement learning can't"☆80Dec 30, 2025Updated 2 months ago
- Implementation of ICLR 2025 paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"☆18Oct 5, 2024Updated last year
- The implementation of IJCAI'22 paper "Multi-Agent Concentrative Coordination with Decentralized Task Representation".☆18May 1, 2022Updated 3 years ago
- Official implementation of NeurIPS22 paper “Multi-agent Dynamic Algorithm Configuration”☆26Mar 6, 2023Updated 3 years ago
- [ICML 2024] Learning Reward for Robot Skills Using Large Language Models via Self-Alignment☆18Aug 22, 2024Updated last year
- Code accompanying paper titled "Algorithmic Guarantees for Inverse Imaging with Untrained Network Priors"☆10Sep 8, 2019Updated 6 years ago
- A python module designed for agile RL algorithm developing.☆26Jul 11, 2024Updated last year
- ☆24Jul 26, 2025Updated 7 months ago
- Source code for the paper "Memory-Efficient Fine-Tuning via Low-Rank Activation Compression"☆14Aug 1, 2025Updated 7 months ago
- ☆34Jan 27, 2025Updated last year
- Toward Ambulatory Vision: Learning Visually-Grounded Active View Selection☆22Feb 5, 2026Updated last month
- Source code of paper: Process vs. Outcome Reward: Which is Better for Agentic RAG Reinforcement Learning☆45Jun 24, 2025Updated 9 months ago
- ☆16Jun 5, 2020Updated 5 years ago
- A beginner-friendly repository on Deep Reinforcement Learning (RL), written in PyTorch.☆27Jan 27, 2026Updated last month
- ☆10Jul 11, 2022Updated 3 years ago
- [ACL 2025] How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training☆47Jul 18, 2025Updated 8 months ago
- Generate descriptions automatically for 3D shapes in ShapeNet via cross-modal joint embedding☆16Jan 4, 2019Updated 7 years ago
- [NeurIPS 2024] Official code for the paper 'RankUp: Boosting Semi-Supervised Regression with an Auxiliary Ranking Classifier'☆14Aug 22, 2025Updated 7 months ago
- ☆10Jan 28, 2024Updated 2 years ago
- Official implementation of the paper "Pretraining Language Models to Ponder in Continuous Space"☆25Jul 21, 2025Updated 8 months ago
- ☆19Mar 14, 2023Updated 3 years ago
- Brax + Pufferlib + CARBS for gpu-accelerated robotics RL☆12Jun 12, 2025Updated 9 months ago
- Imitation learning from multiple experts☆13Aug 29, 2022Updated 3 years ago
- Offical Code For "Towards Hierarchical Multi-Step Reward Models for Enhanced Reasoning in Large Language Models"☆20Mar 25, 2025Updated last year
- (ICML 2025) Rethinking Chain-of-Thought from the Perspective of Self-Training☆13Feb 15, 2025Updated last year
- Pytorch Implementation of AAMAS 2021 paper <Energy-Based Imitation Learning>☆11Oct 8, 2021Updated 4 years ago
- [ICLR 2026] Thinking on the Fly: Test-Time Reasoning Enhancement via Latent Thought Policy Optimization☆24Mar 6, 2026Updated 2 weeks ago
- ☆15Nov 29, 2024Updated last year
- Re-implementations of SOTA RL algorithms.☆137Sep 7, 2023Updated 2 years ago
- This repository is the official implementation of ED-NeRF.☆12Apr 24, 2024Updated last year
- ☆12May 14, 2024Updated last year
- 云任务调度仿真平台☆13Mar 11, 2020Updated 6 years ago
- [EMNLP 2024] Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality☆21Oct 8, 2024Updated last year
- Format your bibtex (.bib) file to help standardize citations for conference and journal submissions☆14Nov 23, 2025Updated 4 months ago
- ☆14Oct 11, 2023Updated 2 years ago