alisawuffles/proxy-tuning

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/alisawuffles/proxy-tuning)

alisawuffles / proxy-tuning

Code associated with Tuning Language Models by Proxy (Liu et al., 2024)

☆134

Alternatives and similar repositories for proxy-tuning

Users that are interested in proxy-tuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

liujch1998 / memo-trap
View on GitHub
☆23Jan 25, 2023Updated 3 years ago
vidhishanair / FactEdit
View on GitHub
☆14Aug 30, 2023Updated 2 years ago
srzer / MOD
View on GitHub
Official code for "Decoding-Time Language Model Alignment with Multiple Objectives".
☆30Oct 30, 2024Updated last year
clinicalml / co-llm
View on GitHub
Co-LLM: Learning to Decode Collaboratively with Multiple Language Models
☆128May 7, 2024Updated 2 years ago
yurujiang2003 / sparta
View on GitHub
NeurIPS 2025
☆15Feb 4, 2026Updated 5 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
cloudygoose / blindspot_nlg
View on GitHub
☆21Jan 15, 2024Updated 2 years ago
ZHZisZZ / weak-to-strong-search
View on GitHub
[NeurIPS'24] Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models
☆67Dec 10, 2024Updated last year
cmu-llms-class / cmu-llm-class-website-2023
View on GitHub
The course website for Large Language Models Methods and Applications
☆28May 6, 2024Updated 2 years ago
yale-nlp / ODSum
View on GitHub
Data and code for paper "ODSum: New Benchmarks for Open Domain Multi-Document Summarization"
☆11Sep 20, 2024Updated last year
haotiansun14 / BBox-Adapter
View on GitHub
Lightweight Adapting for Black-Box Large Language Models
☆25Feb 15, 2024Updated 2 years ago
BunsenFeng / model_collaboration
View on GitHub
MoCo: A One-Stop Shop for Model Collaboration Research
☆60Updated this week
ZHZisZZ / modpo
View on GitHub
[ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization
☆101Aug 20, 2024Updated last year
yaojin17 / Unlearning_LLM
View on GitHub
[ACL 2024] Code and data for "Machine Unlearning of Pre-trained Large Language Models"
☆68Sep 30, 2024Updated last year
BunsenFeng / FactKB
View on GitHub
Code for "FactKB: Generalizable Factuality Evaluation using Language Models Enhanced with Factual Knowledge". EMNLP 2023.
☆20Dec 25, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
lyh6560new / BotMoE
View on GitHub
The code for the paper "BotMoE: Twitter Bot Detection with Community-Aware Mixtures of Modal-Specific Experts"
☆28Sep 16, 2023Updated 2 years ago
alisawuffles / DExperts
View on GitHub
code associated with ACL 2021 DExperts paper
☆119May 24, 2023Updated 3 years ago
sail-sg / lorahub
View on GitHub
[COLM 2024] LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition
☆671Jul 22, 2024Updated 2 years ago
GeorgeVern / smala
View on GitHub
Python source code for EMNLP 2021 Findings paper: "Subword Mapping and Anchoring Across Languages".
☆13Sep 17, 2021Updated 4 years ago
XiangLi1999 / ContrastiveDecoding
View on GitHub
contrastive decoding
☆206Nov 14, 2022Updated 3 years ago
Re-Align / URIAL
View on GitHub
☆316Jun 9, 2024Updated 2 years ago
thomasgauthier / LLM-self-play
View on GitHub
Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)
☆29Mar 1, 2024Updated 2 years ago
yikee / Knowledge_Conflict
View on GitHub
Resolving Knowledge Conflicts in Large Language Models, COLM 2024
☆18Oct 7, 2025Updated 9 months ago
microsoft / rho
View on GitHub
Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.
☆470Apr 18, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
alon-albalak / online-data-mixing
View on GitHub
An implementation of online data mixing for the Pile dataset, based on the GPT-NeoX library.
☆14Jan 9, 2024Updated 2 years ago
maszhongming / ParaKnowTransfer
View on GitHub
Code for "Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective"
☆33May 9, 2024Updated 2 years ago
starrYYxuan / LeCo
View on GitHub
This the implementation of LeCo
☆33Jan 20, 2025Updated last year
swj0419 / detect-pretrain-code
View on GitHub
This repository provides an original implementation of Detecting Pretraining Data from Large Language Models by *Weijia Shi, *Anirudh Aji…
☆243Nov 3, 2023Updated 2 years ago
kaistAI / LangBridge
View on GitHub
[ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision
☆97Oct 30, 2024Updated last year
leopoldwhite / GraphDancer
View on GitHub
GraphDancer: Training LLMs to Explore and Reason over Graphs via Curriculum Reinforcement Learning
☆20May 25, 2026Updated last month
yikee / ScienceMeter
View on GitHub
ScienceMeter: Tracking Scientific Knowledge Updates in Language Models, COLM 2026
☆17Jun 28, 2025Updated last year
FreedomIntelligence / Apollo
View on GitHub
Multilingual Medicine: Model, Dataset, Benchmark, Code
☆200Oct 15, 2024Updated last year
LHRYANG / Generalization_of_FT-LLM
View on GitHub
Implementation of NAACL 2024 paper Unveiling the Generalization Power of Fine-Tuned Large Language Models
☆11Mar 14, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
HKUNLP / ChunkLlama
View on GitHub
[ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"
☆450Oct 16, 2024Updated last year
xhan77 / jpeg-lm
View on GitHub
JPEG-LM: LLMs as Image Generators with Canonical Codec Representations
☆16Sep 29, 2024Updated last year
eth-sri / language-model-arithmetic
View on GitHub
Controlled Text Generation via Language Model Arithmetic
☆224Sep 15, 2024Updated last year
abhishekpanigrahi1996 / Skill-Localization-by-grafting
View on GitHub
☆52Jan 1, 2024Updated 2 years ago
lili-chen / rltf
View on GitHub
Reinforcement Learning from Text Feedback
☆49Feb 17, 2026Updated 5 months ago
jonnypei / acl23-preadd
View on GitHub
☆12Jul 25, 2023Updated 2 years ago
CHEN-YIZHU / GACL
View on GitHub
[NeurIPS 2024] GACL: Exemplar-Free Generalized Analytic Continual Learning
☆18Nov 5, 2024Updated last year