RewardAnything: Generalizable Principle-Following Reward Models
☆45Jun 11, 2025Updated 9 months ago
Alternatives and similar repositories for RewardAnything
Users that are interested in RewardAnything are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆35Jun 3, 2025Updated 10 months ago
- How to think step-by-step: A mechanistic understanding of chain-of-thought reasoning☆26Aug 29, 2024Updated last year
- ☆44Oct 29, 2025Updated 5 months ago
- Implementation for the paper "Fictitious Synthetic Data Can Improve LLM Factuality via Prerequisite Learning"☆11Jan 10, 2025Updated last year
- ☆14Sep 19, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [ACL2023] Preserving Commonsense Knowledge from Pre-trained Language Models via Causal Inference☆24Dec 25, 2023Updated 2 years ago
- Data and Baselines for AStitchInLanguageModels dataset☆12Oct 31, 2022Updated 3 years ago
- ☆24Nov 20, 2021Updated 4 years ago
- Enhaced version of Wikiextrator: A wikipedia dumps extractor☆28Sep 17, 2025Updated 6 months ago
- Text-driven human motion generation surveys, datasets and models.☆81Aug 17, 2025Updated 7 months ago
- unofficial implementation of https://arxiv.org/pdf/2301.08871v1.pdf on pytorch☆15Apr 20, 2023Updated 2 years ago
- ☆358Jul 29, 2025Updated 8 months ago
- (CVPR 2026) Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentation☆33Feb 28, 2026Updated last month
- This is the official repository for the "Towards Vision-Language Mechanistic Interpretability: A Causal Tracing Tool for BLIP" paper acce…☆25Feb 16, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- The repository for ACL 2024 paper "TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language Models"☆34Jun 29, 2024Updated last year
- Offical Repository of MetaAgent Program☆45Dec 2, 2025Updated 4 months ago
- Very concise example of integrated gradients (a method to reveal areas of attention in input images)☆10Jun 17, 2019Updated 6 years ago
- Structured outputs from DSPy and Jinja2☆27Jun 27, 2025Updated 9 months ago
- [ACL 2024] Making Long-Context Language Models Better Multi-Hop Reasoners☆19May 28, 2024Updated last year
- Newton–Cotes Graph Neural Networks: On the Time Evolution of Dynamic Systems☆11Oct 19, 2023Updated 2 years ago
- ☆16May 30, 2019Updated 6 years ago
- ☆21Feb 12, 2025Updated last year
- This is the source code of our paper PALT in EMNLP2022.☆13Nov 19, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [ICLR 2026] Empowering Small VLMs to Think with Dynamic Memorization and Exploration☆16Mar 18, 2026Updated 3 weeks ago
- Enhanced BiLSTM Inference Model for Natural Language Inference☆26May 23, 2018Updated 7 years ago
- ☆16Jul 29, 2025Updated 8 months ago
- [ICLR 2026]🚀ReVisual-R1 is a 7B open-source multimodal language model that follows a three-stage curriculum—cold-start pre-training, mul…☆206Dec 10, 2025Updated 3 months ago
- Multifactor Sequential Disentanglement via Structured Koopman Autoencoders☆20Dec 2, 2024Updated last year
- The official implementation of "ICDPO: Effectively Borrowing Alignment Capability of Others via In-context Direct Preference Optimization…☆16Feb 15, 2024Updated 2 years ago
- Rethinking Urban Mobility Prediction: A Super-Multivariate Time Series Forecasting Approach (TITS)☆20Nov 22, 2024Updated last year
- ☆16Jun 19, 2023Updated 2 years ago
- 用Paddle复现论文ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information(ACL2021)☆10Nov 15, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Structuring Hour-Long Videos into Navigable Chapters and Hierarchical Summaries☆34Nov 19, 2025Updated 4 months ago
- This repository contains papers for a comprehensive survey on accelerated generation techniques in Large Language Models (LLMs).☆11May 24, 2024Updated last year
- ☆15Dec 2, 2025Updated 4 months ago
- This repository includes a benchmark and code for the paper "Evaluating LLMs at Detecting Errors in LLM Responses".☆31Aug 18, 2024Updated last year
- I-SHEEP: Iterative Self-enHancEmEnt Paradigm of LLMs through Self-Instruct and Self-Assessment☆17Jan 16, 2025Updated last year
- The official repo of "WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents"☆113Sep 29, 2025Updated 6 months ago
- Wuxia style novel generation using T5-PEGASUS model. 中文武侠小说续写☆12Nov 22, 2022Updated 3 years ago