☆20May 16, 2024Updated 2 years ago
Alternatives and similar repositories for Value-Augmented-Sampling
Users that are interested in Value-Augmented-Sampling are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The rule-based evaluation subset and code implementation of Omni-MATH☆27Dec 23, 2024Updated last year
- This repo explores how AMR to address tasks difficult for LLMs☆13Jan 15, 2024Updated 2 years ago
- Central repository for all public AIDA resources☆13Mar 1, 2021Updated 5 years ago
- 💻 Terminal-Agent with Human-in-the-Loop Learning☆39Jan 16, 2026Updated 4 months ago
- Improving Your Model Ranking on Chatbot Arena by Vote Rigging (ICML 2025)☆27Feb 25, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Implementations of online merging optimizers proposed by Online Merging Optimizers for Boosting Rewards and Mitigating Tax in Alignment☆82Jun 19, 2024Updated last year
- Code for: "Cutting Down on Prompts and Parameters: Simple Few-Shot Learning with Language Models"☆19Feb 2, 2022Updated 4 years ago
- [ICML 2025] Official repository for paper "OR-Bench: An Over-Refusal Benchmark for Large Language Models"☆27Mar 4, 2025Updated last year
- ☆10Mar 1, 2025Updated last year
- ☆46Feb 8, 2024Updated 2 years ago
- ☆13Jul 2, 2025Updated 11 months ago
- [EMNLP 2025] RouterLens☆29Sep 15, 2025Updated 8 months ago
- [NeurIPS 2022] "A Win-win Deal: Towards Sparse and Robust Pre-trained Language Models", Yuanxin Liu, Fandong Meng, Zheng Lin, Jiangnan Li…☆21Jan 9, 2024Updated 2 years ago
- Official Implementation of "Learning to Refuse: Towards Mitigating Privacy Risks in LLMs"☆10Dec 13, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆19Aug 4, 2025Updated 10 months ago
- A Controllable Model of Grounded Response Generation (AAAI 21)☆13Oct 25, 2022Updated 3 years ago
- Deep Reinforcement Learning by using an on-policy adaptation of Maximum a Posteriori Policy Optimization (MPO)☆16Oct 23, 2021Updated 4 years ago
- Forcing Diffuse Distributions out of Language Models☆18Sep 10, 2024Updated last year
- The code implementation of the paper Stumbling Blocks: Stress Testing the Robustness of Machine-Generated Text Detectors Under Attacks (A…☆13Jul 16, 2024Updated last year
- Finding of ACL2023: Clustering-Aware Negative Sampling for Unsupervised Sentence Representation☆13Oct 16, 2023Updated 2 years ago
- 从零开始无框架python实现卷积神经网络☆13Aug 24, 2020Updated 5 years ago
- ☆14Nov 14, 2023Updated 2 years ago
- [ICLR 2023] PyTorch code of Summarization Programs: Interpretable Abstractive Summarization with Neural Modular Trees☆23Jun 19, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Code release for "TempLM: Distilling Language Models into Template-Based Generators"☆14Jul 21, 2022Updated 3 years ago
- [MM 2024 Oral] Diffusion Posterior Proximal Sampling for Image Restoration☆18Nov 19, 2024Updated last year
- Official implementation of the paper: [EMNLP 2025] RICO: Improving Accuracy and Completeness in Image Recaptioning via Visual Reconstruct…☆21Dec 9, 2025Updated 6 months ago
- ☆12Mar 14, 2024Updated 2 years ago
- This repository includes the code implementation of the paper Improving Pacing in Long-Form Story Planning by Yichen Wang, Kevin Yang, Xi…☆17Nov 19, 2024Updated last year
- A simple Python wrapper for the ClearNLP constituents-to-dependencies converter☆11Nov 2, 2015Updated 10 years ago
- Code release for the paper "Goal Representations for Instruction Following: A Semi-Supervised Language Interface to Control"☆17Apr 9, 2024Updated 2 years ago
- Accompanying repo for the DP2O paper accepted by AAAI 2024 main conference☆17Mar 28, 2024Updated 2 years ago
- Official Repository for Westlake Deep Learning Course (2024)☆14Jun 6, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- An implementation of SEAL: Safety-Enhanced Aligned LLM fine-tuning via bilevel data selection.☆24Feb 20, 2025Updated last year
- MoCo: A One-Stop Shop for Model Collaboration Research☆56May 27, 2026Updated 2 weeks ago
- Official Github repo for the paper "Evaluating the Evaluation of Diversity in Natural Language Generation"☆21Feb 23, 2021Updated 5 years ago
- Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…☆54Jun 24, 2024Updated last year
- The code of paper "Nonlinear Hybrid Planning with Deep Net Learned Transition Models and Mixed-Integer Linear Programming." published on …☆10Apr 27, 2018Updated 8 years ago
- ☆31Nov 27, 2025Updated 6 months ago
- The MiniAgents visualization tool for simulacra.☆18Apr 18, 2024Updated 2 years ago