Codebase for Paper Reusing Embeddings: Reproducible Reward Model Research in Large Language Model Alignment without GPUs
☆22Apr 24, 2025Updated 11 months ago
Alternatives and similar repositories for embedding-based-llm-alignment
Users that are interested in embedding-based-llm-alignment are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models in Preference-based Reward Modeling: Foundations, Theory, and…☆72Apr 2, 2025Updated 11 months ago
- ☆26Oct 26, 2020Updated 5 years ago
- [AAAI'25] CharacterBench: Benchmarking Character Customization of Large Language Models☆21Aug 1, 2025Updated 7 months ago
- Simulation and power analysis of panel/hierarchical data that allows for independently generating effects by cross-section (between-subje…☆18May 14, 2025Updated 10 months ago
- [MICCAI-2023]Visual-Attribute Prompt Learning for Progressive Mild Cognitive Impairment Prediction☆15Dec 12, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Source code for Interpretable Reward Redistribution in Reinforcement Learning: A Causal Approach (NeurIPS 2023)☆10Dec 12, 2023Updated 2 years ago
- Official Implementation of "Learning to Refuse: Towards Mitigating Privacy Risks in LLMs"☆10Dec 13, 2024Updated last year
- Code for AISTATS'25 paper - On the Power of Adaptive Weighted Aggregation in Heterogeneous Federated Learning and Beyond☆13Sep 23, 2025Updated 6 months ago
- ☆15Apr 18, 2019Updated 6 years ago
- ☆42Nov 8, 2025Updated 4 months ago
- Distributed Feedback-Looped Networks☆10Jan 15, 2020Updated 6 years ago
- This is AlpaGasus2-QLoRA based on LLaMA2 with AlpaGasus mechanism using QLoRA!☆15Nov 22, 2023Updated 2 years ago
- Code for [NeurIPS'2019 Spotlight] Policy Continuation with Hindsight Inverse Dynamics☆15Jan 7, 2020Updated 6 years ago
- Code for the paper: Dense Reward for Free in Reinforcement Learning from Human Feedback (ICML 2024) by Alex J. Chan, Hao Sun, Samuel Holt…☆38Aug 11, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- code for ACL2024-main: BatchEval: Towards Human-like Text Evaluation☆19May 20, 2024Updated last year
- time-varying connectivity benchmarker (simulation tool for neuroimaging/fmri)☆12Jan 5, 2019Updated 7 years ago
- Code for SyncTwin: Treatment Effect Estimation with Longitudinal Outcomes (NeurIPS 2021)☆12Nov 30, 2021Updated 4 years ago
- ☆10Jul 11, 2022Updated 3 years ago
- Pandoc filter for D2☆22Sep 3, 2024Updated last year
- SimPER: A Minimalist Approach to Preference Alignment without Hyperparameters (ICLR 2025)☆17Aug 22, 2025Updated 7 months ago
- ☆13Feb 24, 2026Updated last month
- [EMNLP 2025] Dataset and Code of "PersonaGym: Evaluating Persona Agents and LLMs"☆40Aug 21, 2025Updated 7 months ago
- Steve's {ggplot2} themes and related theme elements☆12Apr 27, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- A tutorial on Bayesian multilevel modeling using R and Stan.☆14Nov 19, 2021Updated 4 years ago
- [ACL 2025 Findings] Official implementation of the paper "Unveiling the Key Factors for Distilling Chain-of-Thought Reasoning".☆20Feb 26, 2025Updated last year
- ☆12Nov 26, 2024Updated last year
- An R corpus class for tokenized texts☆32Jul 10, 2025Updated 8 months ago
- ☆16Oct 21, 2024Updated last year
- ☆19Jun 19, 2022Updated 3 years ago
- Behavioral analysis via self-supervised pretraining of transformers☆23Feb 27, 2026Updated 3 weeks ago
- [NAACL 2024 Findings] Evaluation suite for the systematic evaluation of instruction selection methods.☆23Jul 26, 2023Updated 2 years ago
- ☆44Jul 28, 2025Updated 7 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- [ECCV 2024] Official implementation of "Uncertainty Calibration with Energy Based Instance-wise Scaling in the Wild Dataset"☆11Aug 13, 2024Updated last year
- Computing resources for peace of mind☆13Feb 23, 2023Updated 3 years ago
- Single-cell Consensus Clusters of Encoded Subspaces☆14Aug 14, 2023Updated 2 years ago
- ☆13Nov 28, 2025Updated 3 months ago
- papers related to Direct Preference Optimization(DPO)☆19Jul 16, 2024Updated last year
- ☆10Sep 19, 2022Updated 3 years ago
- A small implementation of SPSA in Python☆15Jun 13, 2018Updated 7 years ago