ZYN: Zero-Shot Reward Models with Yes-No Questions
☆35Aug 15, 2023Updated 2 years ago
Alternatives and similar repositories for zero-shot-reward-models
Users that are interested in zero-shot-reward-models are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A repository for transformer critique learning and generation☆89Dec 7, 2023Updated 2 years ago
- ☆14Aug 15, 2024Updated last year
- Synthetic Data Generation with Execution-Based Verification and Grounding for LLM Training.☆19Feb 7, 2025Updated last year
- Rationale-enhanced language models are better continual relation learners (EMNLP 2023 Main Conference)☆12Oct 11, 2023Updated 2 years ago
- Implementation of our ICCV 2023 paper DREAMWALKER: Mental Planning for Continuous Vision-Language Navigation☆20Jul 24, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Attentional Neural Network that translates text to phones.☆11Jan 25, 2018Updated 8 years ago
- Serial Contrastive Knowledge Distillation for Continual Few-shot Relation Extraction, Findings of ACL 2023☆13May 12, 2023Updated 2 years ago
- ☆25Aug 23, 2024Updated last year
- Git for "Stepwise Self-Consistent Mathematical Reasoning with Large Language Models"☆12Nov 26, 2024Updated last year
- ProofNet dataset ported into Lean 4☆29Jun 9, 2025Updated 9 months ago
- A Library for Scaling Mixed-Integer Optimization-Based Machine Learning.☆11Jun 24, 2024Updated last year
- K12高中数学试题数据集☆16Aug 16, 2023Updated 2 years ago
- Offline RL experiments☆15Oct 1, 2022Updated 3 years ago
- Official code release of AAAI 2024 paper SayCanPay.☆54Oct 22, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Python tools☆14Oct 22, 2023Updated 2 years ago
- Official repository for "Stylized Adversarial Training" (TPAMI 2022)☆11Dec 30, 2022Updated 3 years ago
- Code for Arxiv 2023: Improving Language Model Negociation with Self-Play and In-Context Learning from AI Feedback☆208May 24, 2023Updated 2 years ago
- ☆98May 30, 2023Updated 2 years ago
- Fill up the `model_list` field in your LiteLLM proxy configuration file☆10Sep 7, 2024Updated last year
- ☆11Nov 8, 2023Updated 2 years ago
- Code for ICLR 2024 paper "When should we prefer Decision Transformers for Offline Reinforcement Learning?"☆17Jan 31, 2024Updated 2 years ago
- fast trainer for educational purposes☆24Updated this week
- ☆12Jul 7, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆13Apr 12, 2024Updated last year
- ☆32May 23, 2023Updated 2 years ago
- ☆26May 30, 2023Updated 2 years ago
- ☆29May 8, 2024Updated last year
- ☆120May 26, 2025Updated 10 months ago
- Dynamic Mixture of Progressive Parameter-Efficient Expert Library for Lifelong Robot Learning☆27Jul 4, 2025Updated 8 months ago
- Evaluation Pipeline for medical tasks.☆12Feb 13, 2026Updated last month
- ☆15Oct 26, 2021Updated 4 years ago
- Chain of Images for Intuitively Reasoning☆10Nov 29, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ICLR 2024] Evaluating Large Language Models at Evaluating Instruction Following☆137Jul 8, 2024Updated last year
- A Keras-based and TensorFlow-backend NLP Models Toolkit.☆12Jul 7, 2022Updated 3 years ago
- Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?☆18Jun 3, 2025Updated 9 months ago
- Reproduce the paper Distributed Representations of Sentences and Documents in tensorflow☆14Apr 8, 2017Updated 8 years ago
- The Search for Sparse, Robustness Neural Networks☆11Mar 24, 2023Updated 3 years ago
- ☆11Dec 8, 2022Updated 3 years ago
- A seq2seq with attention dialogue/MT model implemented by TensorFlow.☆11Jul 17, 2018Updated 7 years ago