☆20May 16, 2024Updated last year
Alternatives and similar repositories for Value-Augmented-Sampling
Users that are interested in Value-Augmented-Sampling are comparing it to the libraries listed below
Sorting:
- The rule-based evaluation subset and code implementation of Omni-MATH☆27Dec 23, 2024Updated last year
- This repo explores how AMR to address tasks difficult for LLMs☆13Jan 15, 2024Updated 2 years ago
- 这是我的博客《不用框架,使用Python搭建基于numpy的卷积神经网络来进行cifar-10分类的深度学习系统》的代码实现。☆10Jul 1, 2019Updated 6 years ago
- Official repository for ICLR 2025 paper "Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs"☆16Mar 18, 2025Updated last year
- 💻 Terminal-Agent with Human-in-the-Loop Learning☆39Jan 16, 2026Updated 2 months ago
- Improving Your Model Ranking on Chatbot Arena by Vote Rigging (ICML 2025)☆26Feb 25, 2025Updated last year
- The official repository of "Document Image Machine Translation with Dynamic Multi-pre-trained Models Assembling"☆14Nov 26, 2025Updated 3 months ago
- AAAI 2025: Adapting to Non-Stationary Environments: Multi-Armed Bandit Enhanced Retrieval-Augmented Generation on Knowledge Graphs☆18Nov 9, 2024Updated last year
- Code for: "Cutting Down on Prompts and Parameters: Simple Few-Shot Learning with Language Models"☆19Feb 2, 2022Updated 4 years ago
- Implementations of online merging optimizers proposed by Online Merging Optimizers for Boosting Rewards and Mitigating Tax in Alignment☆82Jun 19, 2024Updated last year
- ☆46Feb 8, 2024Updated 2 years ago
- [ICML 2025] Official repository for paper "OR-Bench: An Over-Refusal Benchmark for Large Language Models"☆25Mar 4, 2025Updated last year
- ☆10Mar 1, 2025Updated last year
- ☆13Jul 2, 2025Updated 8 months ago
- [NeurIPS 2022] "A Win-win Deal: Towards Sparse and Robust Pre-trained Language Models", Yuanxin Liu, Fandong Meng, Zheng Lin, Jiangnan Li…☆21Jan 9, 2024Updated 2 years ago
- Official Implementation of "Learning to Refuse: Towards Mitigating Privacy Risks in LLMs"☆10Dec 13, 2024Updated last year
- ☆19Aug 4, 2025Updated 7 months ago
- A Controllable Model of Grounded Response Generation (AAAI 21)☆13Oct 25, 2022Updated 3 years ago
- Deep Reinforcement Learning by using an on-policy adaptation of Maximum a Posteriori Policy Optimization (MPO)☆16Oct 23, 2021Updated 4 years ago
- ☆17Jun 11, 2025Updated 9 months ago
- Forcing Diffuse Distributions out of Language Models☆18Sep 10, 2024Updated last year
- The code implementation of the paper Stumbling Blocks: Stress Testing the Robustness of Machine-Generated Text Detectors Under Attacks (A…☆13Jul 16, 2024Updated last year
- 从零开始无框架python实现卷积神经网络☆13Aug 24, 2020Updated 5 years ago
- This Project extends the concepts of the Anymal C robot developed by the ANYmal group and uses the simulation of this robot coupled with …☆14Jan 9, 2023Updated 3 years ago
- [ICLR 2023] PyTorch code of Summarization Programs: Interpretable Abstractive Summarization with Neural Modular Trees☆23Jun 19, 2023Updated 2 years ago
- Code release for "TempLM: Distilling Language Models into Template-Based Generators"☆14Jul 21, 2022Updated 3 years ago
- SuperTerrain+: A real-time procedural 3D infinite terrain engine with geographical features and photorealistic rendering.☆17Apr 6, 2023Updated 2 years ago
- Code of EMNLP 2025 paper 'UltraIF: Advancing Instruction Following from the Wild'.☆21Apr 3, 2025Updated 11 months ago
- This repository includes the code implementation of the paper Improving Pacing in Long-Form Story Planning by Yichen Wang, Kevin Yang, Xi…☆17Nov 19, 2024Updated last year
- [MM 2024 Oral] Diffusion Posterior Proximal Sampling for Image Restoration☆18Nov 19, 2024Updated last year
- Official implementation of the paper: [EMNLP 2025] RICO: Improving Accuracy and Completeness in Image Recaptioning via Visual Reconstruct…☆21Dec 9, 2025Updated 3 months ago
- A comprehensive collection of process reward models.☆141Oct 4, 2025Updated 5 months ago
- Code release for the paper "Goal Representations for Instruction Following: A Semi-Supervised Language Interface to Control"☆17Apr 9, 2024Updated last year
- A simple Python wrapper for the ClearNLP constituents-to-dependencies converter☆11Nov 2, 2015Updated 10 years ago
- Accompanying repo for the DP2O paper accepted by AAAI 2024 main conference☆17Mar 28, 2024Updated last year
- MoCo: A One-Stop Shop for Model Collaboration Research☆51Feb 24, 2026Updated 3 weeks ago
- PyDictionary is an offline English dictionary made using Python along with the Wordnet Lexical Database and Enchant Spell Dictionary. The…☆19May 16, 2021Updated 4 years ago
- Official Repository for Westlake Deep Learning Course (2024)☆14Jun 6, 2024Updated last year
- Supporting code for ReCEval paper☆31Sep 14, 2024Updated last year