RewardAnything: Generalizable Principle-Following Reward Models
☆44Jun 11, 2025Updated 10 months ago
Alternatives and similar repositories for RewardAnything
Users that are interested in RewardAnything are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆36Jun 3, 2025Updated 10 months ago
- How to think step-by-step: A mechanistic understanding of chain-of-thought reasoning☆26Aug 29, 2024Updated last year
- The official implementation of InfoRM [NeurIPS 2024].☆15Oct 25, 2025Updated 6 months ago
- ☆45Oct 29, 2025Updated 6 months ago
- Implementation for the paper "Fictitious Synthetic Data Can Improve LLM Factuality via Prerequisite Learning"☆11Jan 10, 2025Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Extensive Self-Contrast Enables Feedback-Free Language Model Alignment☆20Apr 2, 2024Updated 2 years ago
- [ACL2023] Preserving Commonsense Knowledge from Pre-trained Language Models via Causal Inference☆24Dec 25, 2023Updated 2 years ago
- This repository contains all the source code needed to reproduce the experiments or review the results obtained in the research paper "…☆13Dec 9, 2023Updated 2 years ago
- ☆13Oct 2, 2024Updated last year
- ☆26Nov 20, 2021Updated 4 years ago
- Data and Baselines for AStitchInLanguageModels dataset☆12Oct 31, 2022Updated 3 years ago
- ☆13Jul 2, 2025Updated 9 months ago
- 中文公开聊天语料库☆11Nov 5, 2018Updated 7 years ago
- ☆359Jul 29, 2025Updated 9 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- (CVPR 2026) Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentation☆35Feb 28, 2026Updated 2 months ago
- This is AlpaGasus2-QLoRA based on LLaMA2 with AlpaGasus mechanism using QLoRA!☆15Nov 22, 2023Updated 2 years ago
- This is the official repository for the "Towards Vision-Language Mechanistic Interpretability: A Causal Tracing Tool for BLIP" paper acce…☆25Feb 16, 2026Updated 2 months ago
- The repository for ACL 2024 paper "TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language Models"☆35Jun 29, 2024Updated last year
- Offical Repository of MetaAgent Program☆46Dec 2, 2025Updated 4 months ago
- [ACL 2024] Making Long-Context Language Models Better Multi-Hop Reasoners☆20May 28, 2024Updated last year
- clip retrieval benchmark☆17May 4, 2022Updated 3 years ago
- (WSDM'24) Cross-modal Self-Supervised Learning for Time-series through Latent Masking☆21Feb 20, 2024Updated 2 years ago
- This repository contains the code used for the experiments in the paper "Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity…☆31Oct 27, 2025Updated 6 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆16May 30, 2019Updated 6 years ago
- PyTorch Implementation of Lusch et al DeepKoopman☆18Jan 5, 2023Updated 3 years ago
- ☆21Feb 12, 2025Updated last year
- [ACL2023] WhitenedCSE: Whitening-based Contrastive Learning of Sentence Embeddings☆18Sep 12, 2023Updated 2 years ago
- This is the source code of our paper PALT in EMNLP2022.☆13Nov 19, 2022Updated 3 years ago
- [ICLR 2026] Empowering Small VLMs to Think with Dynamic Memorization and Exploration☆16Mar 18, 2026Updated last month
- 🔧 Custom utils. 供日常使用的脚本小工具。☆10Jun 14, 2024Updated last year
- Enhanced BiLSTM Inference Model for Natural Language Inference☆26May 23, 2018Updated 7 years ago
- ☆16Jul 29, 2025Updated 9 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Multifactor Sequential Disentanglement via Structured Koopman Autoencoders☆20Dec 2, 2024Updated last year
- ☆16Jun 19, 2023Updated 2 years ago
- Code for "APTBench: Benchmarking Agentic Potential of Base LLMs During Pre-Training"☆41Dec 23, 2025Updated 4 months ago
- This repository contains papers for a comprehensive survey on accelerated generation techniques in Large Language Models (LLMs).☆11May 24, 2024Updated last year
- ☆15Apr 11, 2026Updated 2 weeks ago
- I-SHEEP: Iterative Self-enHancEmEnt Paradigm of LLMs through Self-Instruct and Self-Assessment☆17Jan 16, 2025Updated last year
- The official repo of "WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents"☆115Sep 29, 2025Updated 7 months ago