ZYN: Zero-Shot Reward Models with Yes-No Questions
☆35Aug 15, 2023Updated 2 years ago
Alternatives and similar repositories for zero-shot-reward-models
Users that are interested in zero-shot-reward-models are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A repository for transformer critique learning and generation☆89Dec 7, 2023Updated 2 years ago
- Synthetic Data Generation with Execution-Based Verification and Grounding for LLM Training.☆22Feb 7, 2025Updated last year
- Rationale-enhanced language models are better continual relation learners (EMNLP 2023 Main Conference)☆12Oct 11, 2023Updated 2 years ago
- Serial Contrastive Knowledge Distillation for Continual Few-shot Relation Extraction, Findings of ACL 2023☆14May 12, 2023Updated 3 years ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jul 22, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆26Aug 23, 2024Updated last year
- Git for "Stepwise Self-Consistent Mathematical Reasoning with Large Language Models"☆12Nov 26, 2024Updated last year
- ProofNet dataset ported into Lean 4☆31Jun 9, 2025Updated last year
- A Library for Scaling Mixed-Integer Optimization-Based Machine Learning.☆11Jun 24, 2024Updated last year
- K12高中数学试题数据集☆17Aug 16, 2023Updated 2 years ago
- Offline RL experiments☆15Oct 1, 2022Updated 3 years ago
- Python tools☆14Oct 22, 2023Updated 2 years ago
- Official code release of AAAI 2024 paper SayCanPay.☆54Oct 22, 2025Updated 7 months ago
- Code for the papers "Induction of Subgoal Automata for Reinforcement Learning" (AAAI-20) and "Induction and Exploitation of Subgoal Autom…☆14Aug 15, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [TPAMI] "Symbolic Visual Reinforcement Learning: A Scalable Framework with Object-Level Abstraction and Differentiable Expression Search"…☆18Jan 4, 2023Updated 3 years ago
- Code for Arxiv 2023: Improving Language Model Negociation with Self-Play and In-Context Learning from AI Feedback☆208May 24, 2023Updated 3 years ago
- Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge☆21Jul 25, 2022Updated 3 years ago
- ☆98May 30, 2023Updated 3 years ago
- ☆12Jan 17, 2025Updated last year
- Unofficial baselines for ManiSkill, including RL and BC algorithms.☆21Jun 6, 2024Updated 2 years ago
- [ICLR 2024] Adaptive Replay Ratio implementation from 'Revisiting Plasticity in Visual RL: Data, Modules and Training Stages'.☆13Oct 9, 2024Updated last year
- Fill up the `model_list` field in your LiteLLM proxy configuration file☆10Sep 7, 2024Updated last year
- ☆11Nov 8, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for ICLR 2024 paper "When should we prefer Decision Transformers for Offline Reinforcement Learning?"☆17Jan 31, 2024Updated 2 years ago
- fast trainer for educational purposes☆26Jun 4, 2026Updated 2 weeks ago
- Neuro-Symbolic Hierarchical Rule Induction☆14Dec 31, 2022Updated 3 years ago
- ☆13Apr 12, 2024Updated 2 years ago
- A Visualizer for prosodically annotated speech corpora☆12Oct 27, 2021Updated 4 years ago
- ☆33May 23, 2023Updated 3 years ago
- ☆28May 8, 2024Updated 2 years ago
- ☆26May 30, 2023Updated 3 years ago
- Dynamic Mixture of Progressive Parameter-Efficient Expert Library for Lifelong Robot Learning☆29Jul 4, 2025Updated 11 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Evaluation Pipeline for medical tasks.☆12Apr 8, 2026Updated 2 months ago
- Chain of Images for Intuitively Reasoning☆10Nov 29, 2023Updated 2 years ago
- [ICLR 2024] Evaluating Large Language Models at Evaluating Instruction Following☆138Jul 8, 2024Updated last year
- Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?☆19Jun 3, 2025Updated last year
- Terminal Voice Assistant is a powerful and flexible tool designed to help users interact with their terminal using natural language comma…☆18Jun 9, 2024Updated 2 years ago
- The Search for Sparse, Robustness Neural Networks☆11Mar 24, 2023Updated 3 years ago
- 南京大学教务网抢课系统☆15Mar 4, 2021Updated 5 years ago