facebookresearch / SelfCiteLinks
Code for the ICML 2025 paper "SelfCite Self-Supervised Alignment for Context Attribution in Large Language Models"
☆21Updated 3 weeks ago
Alternatives and similar repositories for SelfCite
Users that are interested in SelfCite are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2023] Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective☆39Updated 2 years ago
- The original Backpack Language Model implementation, a fork of FlashAttention☆71Updated 2 years ago
- Code for EMNLP 2023 findings paper "A Closer Look into Using Large Language Models for Automatic Evaluation"☆19Updated 2 years ago
- Evaluate your agent memory on real-world dialogues, not LLM-simulated dialogues.☆35Updated 6 months ago
- DiffusER: Discrete Diffusion via Edit-based Reconstruction (Reid, Hellendoorn & Neubig, 2022)☆54Updated 5 months ago
- Introducing Filtered Direct Preference Optimization (fDPO) that enhances language model alignment with human preferences by discarding lo…☆16Updated last year
- Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular Control☆76Updated 3 years ago
- contrastive decoding☆205Updated 3 years ago
- ☆12Updated last year
- PyTorch reimplementation of REALM and ORQA☆22Updated 3 years ago
- Simple Parameter-efficient Fine-tuning for Transformer-based Masked Language-models☆143Updated 3 years ago
- Measuring the Mixing of Contextual Information in the Transformer☆34Updated 2 years ago
- ☆44Updated last year
- ☆68Updated 2 years ago
- Code for ICML 25 paper "Metadata Conditioning Accelerates Language Model Pre-training (MeCo)"☆48Updated 6 months ago
- ☆103Updated 2 years ago
- Long Context Extension and Generalization in LLMs☆62Updated last year
- [NeurIPS 2024 Main Track] Code for the paper titled "Instruction Tuning With Loss Over Instructions"☆38Updated last year
- [ACL'24 Oral] Analysing The Impact of Sequence Composition on Language Model Pre-Training☆23Updated last year
- Code for "Tracing Knowledge in Language Models Back to the Training Data"☆39Updated 3 years ago
- Directional Preference Alignment☆58Updated last year
- [NeurIPS 2025] MergeBench: A Benchmark for Merging Domain-Specialized LLMs☆37Updated 3 weeks ago
- Code for paper "Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning"☆84Updated last year
- NAACL 2022: MCSE: Multimodal Contrastive Learning of Sentence Embeddings☆58Updated last year
- ☆19Updated 4 months ago
- Tasks for describing differences between text distributions.☆17Updated last year
- Reproduction of "RLCD Reinforcement Learning from Contrast Distillation for Language Model Alignment☆69Updated 2 years ago
- ☆111Updated 2 years ago
- [AAAI 2024] DenoSent: A Denoising Objective for Self-Supervised Sentence Representation Learning☆15Updated last year
- Randomized Positional Encodings Boost Length Generalization of Transformers☆82Updated last year