ZYN: Zero-Shot Reward Models with Yes-No Questions
☆35Aug 15, 2023Updated 2 years ago
Alternatives and similar repositories for zero-shot-reward-models
Users that are interested in zero-shot-reward-models are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A repository for transformer critique learning and generation☆89Dec 7, 2023Updated 2 years ago
- ☆14Aug 15, 2024Updated last year
- Rationale-enhanced language models are better continual relation learners (EMNLP 2023 Main Conference)☆12Oct 11, 2023Updated 2 years ago
- Implementation of our ICCV 2023 paper DREAMWALKER: Mental Planning for Continuous Vision-Language Navigation☆20Jul 24, 2023Updated 2 years ago
- Attentional Neural Network that translates text to phones.☆11Jan 25, 2018Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Serial Contrastive Knowledge Distillation for Continual Few-shot Relation Extraction, Findings of ACL 2023☆14May 12, 2023Updated 2 years ago
- [COLING 2025🔥] Evolver: Chain-of-Evolution Prompting to Boost Large Multimodal Models for Hateful Meme Detection☆17Jan 21, 2025Updated last year
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jul 22, 2023Updated 2 years ago
- ☆39Aug 9, 2022Updated 3 years ago
- ☆25Aug 23, 2024Updated last year
- Git for "Stepwise Self-Consistent Mathematical Reasoning with Large Language Models"☆12Nov 26, 2024Updated last year
- A Library for Scaling Mixed-Integer Optimization-Based Machine Learning.☆11Jun 24, 2024Updated last year
- K12高中数学试题数据集☆17Aug 16, 2023Updated 2 years ago
- Offline RL experiments☆15Oct 1, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Python tools☆14Oct 22, 2023Updated 2 years ago
- Official code release of AAAI 2024 paper SayCanPay.☆54Oct 22, 2025Updated 6 months ago
- Code for the papers "Induction of Subgoal Automata for Reinforcement Learning" (AAAI-20) and "Induction and Exploitation of Subgoal Autom…☆14Aug 15, 2023Updated 2 years ago
- [TPAMI] "Symbolic Visual Reinforcement Learning: A Scalable Framework with Object-Level Abstraction and Differentiable Expression Search"…☆17Jan 4, 2023Updated 3 years ago
- ☆10Dec 3, 2020Updated 5 years ago
- Code for Arxiv 2023: Improving Language Model Negociation with Self-Play and In-Context Learning from AI Feedback☆208May 24, 2023Updated 2 years ago
- ☆13Dec 22, 2021Updated 4 years ago
- A Task of Fictitious Unlearning for VLMs☆27Apr 6, 2025Updated last year
- Scripts to parse arxiv documents for NLP tasks☆19Jun 12, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆10Oct 11, 2022Updated 3 years ago
- Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge☆21Jul 25, 2022Updated 3 years ago
- ☆98May 30, 2023Updated 2 years ago
- ☆12Jan 17, 2025Updated last year
- [ICLR 2024] Adaptive Replay Ratio implementation from 'Revisiting Plasticity in Visual RL: Data, Modules and Training Stages'.☆13Oct 9, 2024Updated last year
- Thisi is the official code base for paper "Think Before You Act: Decision Transformers with Internal Working Memory"☆23Jul 12, 2024Updated last year
- Efficient Conway's Game of Life implemented in Python using NumPy.☆14May 1, 2024Updated 2 years ago
- ☆11Nov 8, 2023Updated 2 years ago
- Code for ICLR 2024 paper "When should we prefer Decision Transformers for Offline Reinforcement Learning?"☆17Jan 31, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Neuro-Symbolic Hierarchical Rule Induction☆14Dec 31, 2022Updated 3 years ago
- ☆13Apr 12, 2024Updated 2 years ago
- A Visualizer for prosodically annotated speech corpora☆12Oct 27, 2021Updated 4 years ago
- ☆28May 8, 2024Updated 2 years ago
- ☆26May 30, 2023Updated 2 years ago
- ☆121May 26, 2025Updated 11 months ago
- Dynamic Mixture of Progressive Parameter-Efficient Expert Library for Lifelong Robot Learning☆27Jul 4, 2025Updated 10 months ago