RL algorithm: Advantage induced policy alignment
☆66Aug 11, 2023Updated 2 years ago
Alternatives and similar repositories for RLHF-APA
Users that are interested in RLHF-APA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Jun 29, 2024Updated 2 years ago
- ☆18May 9, 2025Updated last year
- This data set contains accelerometer and gyroscope recordings from over 200 participants performing various gym exercises. This data set …☆37Jun 16, 2023Updated 3 years ago
- ☆16Jul 29, 2025Updated 11 months ago
- Scalable Educational Experiences with Digital Scaffolding☆14Jun 5, 2026Updated 3 weeks ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [IEEE TMI 2024] Prototype-Guided Graph Reasoning Network for Few-Shot Medical Image Segmentation☆13Jun 13, 2025Updated last year
- ☆284Jan 6, 2025Updated last year
- Woodgrove groceries custom authentication extension REST API demo☆36Oct 10, 2025Updated 8 months ago
- Direct preference optimization with f-divergences.☆17Nov 3, 2024Updated last year
- Content built for the community, with love, by The Fabric Customer Advisory Team!☆26Aug 18, 2025Updated 10 months ago
- Monorepo for contributor extension packages to Fluent UI☆55Updated this week
- Invite OpenAI to your teams calls to assist w/ QnA right in chat.☆28Jan 9, 2024Updated 2 years ago
- Examples, samples and write ups to help educate and accelerate development and adoption of power platform including Canvas Apps, Model Ap…☆37Jan 18, 2024Updated 2 years ago
- [TIP 2025] This is an official PyTorch implementation of "Zero-Shot Skeleton-Based Action Recognition With Prototype-Guided Feature Align…☆36Jul 24, 2025Updated 11 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [IEEE TBD 2023] IEMask R-CNN: Information-enhanced Mask R-CNN☆16Mar 14, 2023Updated 3 years ago
- ☆98May 30, 2023Updated 3 years ago
- Data and code for the paper "The Moral Integrity Corpus: A Benchmark for Ethical Dialogue Systems"☆21Jul 18, 2023Updated 2 years ago
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆40Jun 10, 2024Updated 2 years ago
- Woodgrove groceries demo web application☆77Aug 7, 2025Updated 10 months ago
- A multi-threaded C++ implementation of Nickel & Kiela's "Poincare Embeddings" paper from NIPS 2017, following the implementation of the a…☆18Jun 6, 2018Updated 8 years ago
- Dataset with coverage annotations for HumanEval dataset☆25Aug 17, 2023Updated 2 years ago
- Self-Alignment with Principle-Following Reward Models☆170Sep 18, 2025Updated 9 months ago
- Codes and files for the paper Are Emergent Abilities in Large Language Models just In-Context Learning☆33Jan 9, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆22Aug 30, 2021Updated 4 years ago
- ☆35Jan 29, 2023Updated 3 years ago
- Docker for Visual Studio Code: Extensibility Model☆20Jun 19, 2026Updated last week
- Repository for the paper Stream of Search: Learning to Search in Language☆153Feb 3, 2025Updated last year
- GPT* - Training faster small transformers using ALiBi, Parallel Residual Connections and more!☆20Oct 29, 2022Updated 3 years ago
- An up-to-date list of progress made in next-generation AI.☆11Apr 2, 2023Updated 3 years ago
- MLOps Model Factory is an end to end workflow that supports generating multiple models and used for deployment to any target.☆10May 9, 2024Updated 2 years ago
- Implementation of MixCE method described in ACL 2023 paper by Zhang et al.☆20May 29, 2023Updated 3 years ago
- Clone-voice-with-Pytorch☆11Sep 9, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Azure Machine Learning - MLOps Python SDKv2☆10Jul 24, 2023Updated 2 years ago
- GenEnv: Difficulty-Aligned Co-Evolution Between LLM Agents and Environment Simulators☆62Dec 23, 2025Updated 6 months ago
- Academic Resources for the Courses at IIITD - Monsoon 2021 onwards☆10Sep 23, 2023Updated 2 years ago
- GenRM-CoT: Data release for verification rationales☆68Oct 16, 2024Updated last year
- Code for Posterior Sampling for Deep Reinforcement Learning, ICML 2023☆28Mar 7, 2024Updated 2 years ago
- [ICML 2024] Code for the paper "MoE-RBench: Towards Building Reliable Language Models with Sparse Mixture-of-Experts"☆11Jul 1, 2024Updated last year
- ☆125Jun 2, 2026Updated 3 weeks ago