jayelm / gistingLinks

Learning to Compress Prompts with Gist Tokens - https://arxiv.org/abs/2304.08467

☆300

Alternatives and similar repositories for gisting

Users that are interested in gisting are comparing it to the libraries listed below

Sorting:

haoliuhl / chain-of-hindsight
Simple next-token-prediction for RLHF
☆227Updated 2 years ago
anthropics / ConstitutionalHarmlessnessPaper
☆248Updated 2 years ago
princeton-nlp / AutoCompressors
[EMNLP 2023] Adapting Language Models to Compress Long Contexts
☆318Updated last year
p-lambda / dsir
DSIR large-scale data selection framework for language model training
☆266Updated last year
facebookresearch / Shepherd
This is the repo for the paper Shepherd -- A Critic for Language Model Generation
☆219Updated 2 years ago
IBM / SALMON
Self-Alignment with Principle-Following Reward Models
☆169Updated 2 months ago
likenneth / honest_llama
Inference-Time Intervention: Eliciting Truthful Answers from a Language Model
☆556Updated 10 months ago
lukasberglund / reversal_curse
☆297Updated 2 years ago
OpenLMLab / LEval
[ACL'24 Outstanding] Data and code for L-Eval, a comprehensive long context language models evaluation benchmark
☆391Updated last year
declare-lab / instruct-eval
This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.
☆552Updated last year
FranxYao / GPT-Bargaining
Code for Arxiv 2023: Improving Language Model Negociation with Self-Play and In-Context Learning from AI Feedback
☆208Updated 2 years ago
nelson-liu / lost-in-the-middle
Code and data for "Lost in the Middle: How Language Models Use Long Contexts"
☆365Updated last year
kaistAI / CoT-Collection
[EMNLP 2023] The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning
☆250Updated 2 years ago
bhargaviparanjape / language-programmes
☆173Updated 2 years ago
kmeng01 / memit
Mass-editing thousands of facts into a transformer memory (ICLR 2023)
☆532Updated last year
raunak-agarwal / instruction-datasets
Datasets for Instruction Tuning of Large Language Models
☆259Updated 2 years ago
Ber666 / ToolkenGPT
ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings - NeurIPS 2023 (oral)
☆264Updated last year
sangmichaelxie / doremi
Pytorch implementation of DoReMi, a method for optimizing the data mixture weights in language modeling datasets
☆347Updated last year
tianjunz / HIR
☆159Updated 2 years ago
allenai / FineGrainedRLHF
☆280Updated 10 months ago
allenai / WildBench
Benchmarking LLMs with Challenging Tasks from Real Users
☆246Updated last year
veronica320 / Faithful-COT
Code and data accompanying our paper on arXiv "Faithful Chain-of-Thought Reasoning".
☆164Updated last year
agi-templar / Stable-Alignment
Multi-agent Social Simulation + Efficient, Effective, and Stable alternative of RLHF. Code for the paper "Training Socially Aligned Langu…
☆354Updated 2 years ago
ezelikman / STaR
Code for STaR: Bootstrapping Reasoning With Reasoning (NeurIPS 2022)
☆218Updated 2 years ago
OpenBMB / UltraFeedback
A large-scale, fine-grained, diverse preference dataset (and models).
☆356Updated last year
huggingface / datablations
Scaling Data-Constrained Language Models
☆342Updated 5 months ago
orhonovich / unnatural-instructions
☆180Updated 2 years ago
TIGER-AI-Lab / MAmmoTH
Code and data for "MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning" [ICLR 2024]
☆376Updated last year
xingyaoww / mint-bench
Official Repo for ICLR 2024 paper MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback by Xingyao Wang*, Ziha…
☆134Updated last year
FranxYao / Long-Context-Data-Engineering
Implementation of paper Data Engineering for Scaling Language Models to 128K Context
☆478Updated last year