jasonyux / TriPosTLinks
☆12Updated last year
Alternatives and similar repositories for TriPosT
Users that are interested in TriPosT are comparing it to the libraries listed below
Sorting:
- Introducing Filtered Direct Preference Optimization (fDPO) that enhances language model alignment with human preferences by discarding lo…☆16Updated last year
- [NeurIPS 2023] Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective☆39Updated 2 years ago
- Directional Preference Alignment☆58Updated last year
- Repository for "TESS-2: A Large-Scale, Generalist Diffusion Language Model"☆53Updated 10 months ago
- Evaluate your agent memory on real-world dialogues, not LLM-simulated dialogues.☆35Updated 6 months ago
- A collection of instruction data and scripts for machine translation.☆20Updated 2 years ago
- Code for "Everybody Prune Now: Structured Pruning of LLMs with only Forward Passes"☆29Updated last year
- Official repository for ACL 2025 paper "Model Extrapolation Expedites Alignment"☆76Updated 7 months ago
- Code for Blog Post: Can Better Cold-Start Strategies Improve RL Training for LLMs?☆19Updated 10 months ago
- The original Backpack Language Model implementation, a fork of FlashAttention☆71Updated 2 years ago
- ☆28Updated 10 months ago
- [ICLR 2024] CLEX: Continuous Length Extrapolation for Large Language Models☆78Updated last year
- Code for paper "Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning"☆84Updated last year
- This repository contains the code used for the experiments in the paper "Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity…☆28Updated 2 months ago
- Code for ICML 25 paper "Metadata Conditioning Accelerates Language Model Pre-training (MeCo)"☆48Updated 6 months ago
- [EMNLP 2023]Context Compression for Auto-regressive Transformers with Sentinel Tokens☆25Updated 2 years ago
- WorldSense benchmark for grounded reasoning in language models☆22Updated 2 years ago
- Reproduction of "RLCD Reinforcement Learning from Contrast Distillation for Language Model Alignment☆69Updated 2 years ago
- [NeurIPS 2024 Main Track] Code for the paper titled "Instruction Tuning With Loss Over Instructions"☆38Updated last year
- Code for the ICML 2025 paper "SelfCite Self-Supervised Alignment for Context Attribution in Large Language Models"☆21Updated 3 weeks ago
- Code for paper "Patch-Level Training for Large Language Models"☆96Updated last month
- Long Context Extension and Generalization in LLMs☆62Updated last year
- Code for Zero-Shot Tokenizer Transfer☆142Updated 11 months ago
- ☆130Updated 3 years ago
- Official implementation of Bootstrapping Language Models via DPO Implicit Rewards☆46Updated 8 months ago
- Official PyTorch Implementation of EMoE: Unlocking Emergent Modularity in Large Language Models [main conference @ NAACL2024]☆38Updated last year
- [NeurIPS 2024 Spotlight] Code and data for the paper "Finding Transformer Circuits with Edge Pruning".☆63Updated 4 months ago
- Repo accompanying our paper "Do Llamas Work in English? On the Latent Language of Multilingual Transformers".☆80Updated last year
- Function Vectors in Large Language Models (ICLR 2024)☆189Updated 8 months ago
- Suri: Multi-constraint instruction following for long-form text generation (EMNLP’24)☆27Updated 3 months ago