ZYN: Zero-Shot Reward Models with Yes-No Questions
☆35Aug 15, 2023Updated 2 years ago
Alternatives and similar repositories for zero-shot-reward-models
Users that are interested in zero-shot-reward-models are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A repository for transformer critique learning and generation☆89Dec 7, 2023Updated 2 years ago
- Attentional Neural Network that translates text to phones.☆11Jan 25, 2018Updated 8 years ago
- Serial Contrastive Knowledge Distillation for Continual Few-shot Relation Extraction, Findings of ACL 2023☆14May 12, 2023Updated 3 years ago
- ☆26Aug 23, 2024Updated last year
- ☆39Aug 9, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Git for "Stepwise Self-Consistent Mathematical Reasoning with Large Language Models"☆12Nov 26, 2024Updated last year
- ProofNet dataset ported into Lean 4☆31Jun 9, 2025Updated 11 months ago
- K12高中数学试题数据集☆17Aug 16, 2023Updated 2 years ago
- Offline RL experiments☆15Oct 1, 2022Updated 3 years ago
- Python tools☆14Oct 22, 2023Updated 2 years ago
- Official code release of AAAI 2024 paper SayCanPay.☆54Oct 22, 2025Updated 7 months ago
- [TPAMI] "Symbolic Visual Reinforcement Learning: A Scalable Framework with Object-Level Abstraction and Differentiable Expression Search"…☆18Jan 4, 2023Updated 3 years ago
- Code for Arxiv 2023: Improving Language Model Negociation with Self-Play and In-Context Learning from AI Feedback☆208May 24, 2023Updated 3 years ago
- Scripts to parse arxiv documents for NLP tasks☆19Jun 12, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆10Oct 11, 2022Updated 3 years ago
- Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge☆21Jul 25, 2022Updated 3 years ago
- An example FastAPI server that streams messages from Autogen using OpenAI API format☆15Jul 3, 2024Updated last year
- ☆98May 30, 2023Updated 3 years ago
- ☆12Jan 17, 2025Updated last year
- Unofficial baselines for ManiSkill, including RL and BC algorithms.☆21Jun 6, 2024Updated last year
- [ICLR 2024] Adaptive Replay Ratio implementation from 'Revisiting Plasticity in Visual RL: Data, Modules and Training Stages'.☆13Oct 9, 2024Updated last year
- Thisi is the official code base for paper "Think Before You Act: Decision Transformers with Internal Working Memory"☆23Jul 12, 2024Updated last year
- Efficient Conway's Game of Life implemented in Python using NumPy.☆14May 1, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆11Nov 8, 2023Updated 2 years ago
- Interpretability of Machine Learning-Visualizations☆13Jul 9, 2018Updated 7 years ago
- fast trainer for educational purposes☆26May 4, 2026Updated 3 weeks ago
- Neuro-Symbolic Hierarchical Rule Induction☆14Dec 31, 2022Updated 3 years ago
- ☆13Apr 12, 2024Updated 2 years ago
- A Visualizer for prosodically annotated speech corpora☆12Oct 27, 2021Updated 4 years ago
- ☆33May 23, 2023Updated 3 years ago
- ☆28May 8, 2024Updated 2 years ago
- ☆26May 30, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Dynamic Mixture of Progressive Parameter-Efficient Expert Library for Lifelong Robot Learning☆27Jul 4, 2025Updated 10 months ago
- ☆15Oct 26, 2021Updated 4 years ago
- Chain of Images for Intuitively Reasoning☆10Nov 29, 2023Updated 2 years ago
- [ICLR 2024] Evaluating Large Language Models at Evaluating Instruction Following☆138Jul 8, 2024Updated last year
- A Keras-based and TensorFlow-backend NLP Models Toolkit.☆12Jul 7, 2022Updated 3 years ago
- Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?☆19Jun 3, 2025Updated 11 months ago
- Reproduce the paper Distributed Representations of Sentences and Documents in tensorflow☆14Apr 8, 2017Updated 9 years ago