mukhal / intrinsic-source-citation
[COLM '24] Source-Aware Training Enables Knowledge Attribution in Language Models
☆15Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for intrinsic-source-citation
- Code for our paper Resources and Evaluations for Multi-Distribution Dense Information Retrieval☆14Updated 9 months ago
- Training hybrid models for dummies.☆15Updated last week
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Updated last year
- GoldFinch and other hybrid transformer components☆39Updated 3 months ago
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆41Updated 9 months ago
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆44Updated 11 months ago
- ☆44Updated 5 months ago
- FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions☆39Updated 4 months ago
- Codebase for Context-aware Meta-learned Loss Scaling (CaMeLS). https://arxiv.org/abs/2305.15076.☆23Updated 9 months ago
- ☆30Updated last month
- ☆39Updated 2 weeks ago
- Entailment self-training☆25Updated last year
- Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)☆28Updated 8 months ago
- [ICML 24 NGSM workshop] Associative Recurrent Memory Transformer implementation and scripts for training and evaluating☆30Updated this week
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Updated last year
- A repository for research on medium sized language models.☆74Updated 5 months ago
- ☆33Updated 5 months ago
- ☆25Updated 11 months ago
- Code repo for "Model-Generated Pretraining Signals Improves Zero-Shot Generalization of Text-to-Text Transformers" (ACL 2023)☆22Updated last year
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆17Updated 3 weeks ago
- ☆20Updated this week
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆22Updated 7 months ago
- ☆15Updated 4 months ago
- Improving Text Embedding of Language Models Using Contrastive Fine-tuning☆60Updated 3 months ago
- distill chatGPT coding ability into small model (1b)☆24Updated last year
- Measuring and Controlling Persona Drift in Language Model Dialogs☆11Updated 8 months ago
- Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"☆18Updated 2 months ago
- PyTorch code for System-1.x: Learning to Balance Fast and Slow Planning with Language Models☆19Updated 3 months ago
- A new way to generate large quantities of high quality synthetic data (on par with GPT-4), with better controllability, at a fraction of …☆21Updated last month