Codebase for Paper Reusing Embeddings: Reproducible Reward Model Research in Large Language Model Alignment without GPUs
☆22Apr 24, 2025Updated last year
Alternatives and similar repositories for embedding-based-llm-alignment
Users that are interested in embedding-based-llm-alignment are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆17Jul 23, 2025Updated 9 months ago
- ☆26Oct 26, 2020Updated 5 years ago
- Code for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RL☆29Oct 29, 2023Updated 2 years ago
- Simulation and power analysis of panel/hierarchical data that allows for independently generating effects by cross-section (between-subje…☆18May 14, 2025Updated 11 months ago
- Based on the R1-Zero method, using rule-based rewards and GRPO on the Code Contests dataset.☆18Apr 22, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [AAAI'25] CharacterBench: Benchmarking Character Customization of Large Language Models☆22Aug 1, 2025Updated 9 months ago
- [ICML 2025] Official code of "DAMA: Data- and Model-aware Alignment of Multi-modal LLMs"☆16May 24, 2025Updated 11 months ago
- A Controllable Model of Grounded Response Generation (AAAI 21)☆13Oct 25, 2022Updated 3 years ago
- Tutorials for Stance Detection: A practical guide☆25Oct 12, 2022Updated 3 years ago
- Code for Neural Networks journal paper - StoCFL: A stochastically clustered federated learning framework for Non-IID data with dynamic cl…☆13Apr 28, 2024Updated 2 years ago
- ☆42Nov 8, 2025Updated 5 months ago
- Streaming, Distributed, Asynchronous Bayesian Nonparametric Inference☆12Nov 2, 2015Updated 10 years ago
- Distributed Feedback-Looped Networks☆10Jan 15, 2020Updated 6 years ago
- PyTorch implementation of Swap-VAE: A self-supervised approach for generating neural activity☆13Nov 17, 2021Updated 4 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆15Feb 18, 2021Updated 5 years ago
- Code for [NeurIPS'2019 Spotlight] Policy Continuation with Hindsight Inverse Dynamics☆15Jan 7, 2020Updated 6 years ago
- time-varying connectivity benchmarker (simulation tool for neuroimaging/fmri)☆12Jan 5, 2019Updated 7 years ago
- Code for SyncTwin: Treatment Effect Estimation with Longitudinal Outcomes (NeurIPS 2021)☆12Nov 30, 2021Updated 4 years ago
- A public repo for ICML 2021 "Shortest-Path Constrained Reinforcement Learning for Sparse Reward Tasks"☆13Jul 19, 2021Updated 4 years ago
- SimPER: A Minimalist Approach to Preference Alignment without Hyperparameters (ICLR 2025)☆17Aug 22, 2025Updated 8 months ago
- ☆13Feb 24, 2026Updated 2 months ago
- Let's make good things!☆13Aug 22, 2018Updated 7 years ago
- Official implementation of Neural Episodic Control with State Abstraction☆13Aug 3, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Steve's {ggplot2} themes and related theme elements☆13Apr 27, 2023Updated 3 years ago
- A tutorial on Bayesian multilevel modeling using R and Stan.☆14Nov 19, 2021Updated 4 years ago
- ☆11Jun 15, 2019Updated 6 years ago
- pytorch implementation of XMC-GAN☆11Jun 2, 2021Updated 4 years ago
- An R corpus class for tokenized texts☆32Jul 10, 2025Updated 9 months ago
- ☆19Jun 19, 2022Updated 3 years ago
- [ACL 2025 Findings] Official implementation of the paper "Unveiling the Key Factors for Distilling Chain-of-Thought Reasoning".☆22Feb 26, 2025Updated last year
- ☆12Apr 27, 2026Updated last week
- [ECCV 2024] Official implementation of "Uncertainty Calibration with Energy Based Instance-wise Scaling in the Wild Dataset"☆11Aug 13, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Computing resources for peace of mind☆13Feb 23, 2023Updated 3 years ago
- PyTorch Implementation of “Unsupervised learning by competing hidden units” MNIST classifier☆12May 6, 2019Updated 7 years ago
- ☆13Nov 28, 2025Updated 5 months ago
- ✒️ ChatGPT as a writing partner.☆14Mar 6, 2023Updated 3 years ago
- [ICML 2025] Beyond Bradley-Terry Models: A General Preference Model for Language Model Alignment (https://arxiv.org/abs/2410.02197)☆41Sep 8, 2025Updated 7 months ago
- R package for 'Efficient Learning of Word Representations and Sentence Classification'☆45Mar 4, 2026Updated 2 months ago
- A small implementation of SPSA in Python☆15Jun 13, 2018Updated 7 years ago