jasonyux / TriPosTLinks
☆12Updated last year
Alternatives and similar repositories for TriPosT
Users that are interested in TriPosT are comparing it to the libraries listed below
Sorting:
- Introducing Filtered Direct Preference Optimization (fDPO) that enhances language model alignment with human preferences by discarding lo…☆16Updated last year
- Directional Preference Alignment☆58Updated last year
- Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter Merging☆113Updated 2 years ago
- Official repository for ACL 2025 paper "Model Extrapolation Expedites Alignment"☆76Updated 6 months ago
- Official implementation of Bootstrapping Language Models via DPO Implicit Rewards☆45Updated 7 months ago
- [NeurIPS 2023] Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective☆38Updated 2 years ago
- ☆28Updated 10 months ago
- Code for paper "Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning"☆83Updated last year
- DiffusER: Discrete Diffusion via Edit-based Reconstruction (Reid, Hellendoorn & Neubig, 2022)☆54Updated 4 months ago
- Repository for "TESS-2: A Large-Scale, Generalist Diffusion Language Model"☆53Updated 9 months ago
- Code for paper "Patch-Level Training for Large Language Models"☆95Updated last month
- [EMNLP 2023]Context Compression for Auto-regressive Transformers with Sentinel Tokens☆25Updated 2 years ago
- A collection of instruction data and scripts for machine translation.☆20Updated 2 years ago
- Evaluate your agent memory on real-world dialogues, not LLM-simulated dialogues.☆35Updated 5 months ago
- [NeurIPS 2024 Spotlight] Code and data for the paper "Finding Transformer Circuits with Edge Pruning".☆62Updated 3 months ago
- Code for Blog Post: Can Better Cold-Start Strategies Improve RL Training for LLMs?☆19Updated 9 months ago
- Online Adaptation of Language Models with a Memory of Amortized Contexts (NeurIPS 2024)☆70Updated last year
- This repository contains the code used for the experiments in the paper "Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity…