☆12Aug 6, 2024Updated last year
Alternatives and similar repositories for LionAlignment
Users that are interested in LionAlignment are comparing it to the libraries listed below
Sorting:
- ACL24☆11Jun 7, 2024Updated last year
- [ACL 2025 Main] (🏆 Outstanding Paper Award) Rethinking the Role of Prompting Strategies in LLM Test-Time Scaling: A Perspective of Proba…☆15Aug 15, 2025Updated 6 months ago
- Code for the paper Multi-Armed Bandits with Correlated Arms☆10Jun 3, 2021Updated 4 years ago
- ☆11Oct 11, 2023Updated 2 years ago
- Unofficial implementation of the Ask-LLM paper 'How to Train Data-Efficient LLMs', arXiv:2402.09668.☆12Jun 19, 2024Updated last year
- ☆12Apr 24, 2024Updated last year
- Code for Expert Supervised Reinforcement Learning☆10Apr 7, 2021Updated 4 years ago
- ROCC: Reinforcement learning for the Optimisation of Co-Cultures☆13Nov 17, 2020Updated 5 years ago
- Implementation for the paper "Fictitious Synthetic Data Can Improve LLM Factuality via Prerequisite Learning"☆11Jan 10, 2025Updated last year
- ☆10Nov 29, 2024Updated last year
- A PyTorch Deep Learning Kit☆12Apr 30, 2023Updated 2 years ago
- [COLM 2025] Official code for "When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoni…☆15Oct 31, 2025Updated 4 months ago
- PyTorch implementation of the estimator proposed in the paper "Estimating Differential Entropy under Gaussian Convolutions"☆13Oct 22, 2020Updated 5 years ago
- This repository contains code for the paper "Better Estimation of the KL Divergence Between Language Models"☆18May 30, 2025Updated 8 months ago
- ☆16Jul 10, 2023Updated 2 years ago
- ☆15Mar 12, 2024Updated last year
- ☆16May 31, 2024Updated last year
- Official code for the paper: DRA-GRPO: Exploring Diversity-Aware Reward Adjustment for R1-Zero-Like Training of Large Language Models☆22Jan 6, 2026Updated last month
- Debate interface, experiments, etc.☆10Mar 12, 2024Updated last year
- Codebase for Prioritizing samples in Reinforcement Learning with Reducible Loss☆12Oct 10, 2022Updated 3 years ago
- Code for the NeurIPS 2021 paper "Deep Bandits Show-Off: Simple and Efficient Exploration with Deep Networkst"☆14Sep 12, 2022Updated 3 years ago
- The data and the PyTorch implementation for the models and experiments in the paper "Language Model Decoding as Likelihood–Utility Alignm…☆14Sep 7, 2023Updated 2 years ago
- ☆18Oct 12, 2022Updated 3 years ago
- ☆19Aug 4, 2025Updated 6 months ago
- Supervised Speech Representation Learning for Parkinson's Disease Classification☆17Oct 26, 2021Updated 4 years ago
- ☆15Feb 11, 2022Updated 4 years ago
- Code related to different aspects of conformal learning☆17Jan 28, 2025Updated last year
- Python wrapper for lean-gym☆12Apr 5, 2023Updated 2 years ago
- (ACL 2025 Main) Safe: Enhancing Mathematical Reasoning in Large Language Models via Retrospective Step-aware Formal Verification - Offici…☆18Dec 26, 2025Updated 2 months ago
- We define and estimate smooth unique information of samples with respect to classifier weights and predictions. We compute these quantiti…☆11Mar 9, 2021Updated 4 years ago
- ☆15Sep 7, 2022Updated 3 years ago
- This repository contains a Python implementation that allows you to use gorilla-llm/gorilla-openfunctions-v2 LLM to perform function call…☆17Apr 7, 2024Updated last year
- Official Code Repository for paper "HYDRA: Model Factorization Framework for Black-Box LLM Personalization"☆16Oct 7, 2024Updated last year
- Git for "Stepwise Self-Consistent Mathematical Reasoning with Large Language Models"☆12Nov 26, 2024Updated last year
- Lectures on NLP☆13Aug 18, 2023Updated 2 years ago
- Extract your SlidesLive presentation.☆15Apr 19, 2024Updated last year
- ☆21Feb 8, 2025Updated last year
- ☆14Jun 17, 2024Updated last year
- [NeurIPS '25] Multi-Token Prediction Needs Registers☆26Dec 14, 2025Updated 2 months ago