facebookresearch/SelfCite

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/facebookresearch/SelfCite)

facebookresearch / SelfCite

Code for the ICML 2025 paper "SelfCite Self-Supervised Alignment for Context Attribution in Large Language Models"

☆30

Alternatives and similar repositories for SelfCite

Users that are interested in SelfCite are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ruizheliUOA / ARC_JSD
View on GitHub
A Jensen-Shannon Divergence Driven Mechanistic Study of Context Attribution in Retrieval-Augmented Generation
☆15Aug 28, 2025Updated 10 months ago
vectominist / spin
View on GitHub
Official code for Interspeech 2023 paper "Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clusterin…
☆65May 19, 2023Updated 3 years ago
lingjzhu / spoken_sent_embedding
View on GitHub
Unsupervised spoken sentence embeddings
☆14Dec 14, 2022Updated 3 years ago
Philip-MIT / rover-vlm
View on GitHub
☆18Dec 1, 2025Updated 7 months ago
vectominist / End-to-end-ASR-Pytorch-DLHLP
View on GitHub
Joint CTC-Attention End-to-end Speech Recognition - PyTorch Implementation (Deep Learning for Human Language Processing Special Project)
☆17Nov 22, 2020Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
wbopan / flashtrace
View on GitHub
Efficient multi-token attribution for reasoning language models — Python package, CLI, and HTML token traces
☆32Updated this week
lovodkin93 / attribute-first-then-generate
View on GitHub
Repository for "Attribute First, then Generate: Locally-attributable Grounded Text Generation", ACL 2024
☆30Dec 19, 2024Updated last year
voidism / EAR
View on GitHub
Code for the ACL 2023 long paper - Expand, Rerank, and Retrieve: Query Reranking for Open-Domain Question Answering
☆38May 30, 2023Updated 3 years ago
allenai / smashed
View on GitHub
SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…
☆35May 24, 2024Updated 2 years ago
github / octo-recipes
View on GitHub
A GitHub repository used to collaborate on recipes
☆11Sep 1, 2015Updated 10 years ago
zijwang / talkdown
View on GitHub
Dataset and pre-trained model of EMNLP-IJCNLP 2019 paper "TalkDown: A Corpus for Condescension Detection in Context."
☆10Jan 26, 2020Updated 6 years ago
Alexander-H-Liu / dinosr
View on GitHub
DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning
☆53Jan 18, 2024Updated 2 years ago
Philip-MIT / thread
View on GitHub
☆22Aug 18, 2024Updated last year
isjwdu / DFADD
View on GitHub
Official Implementation and Dataset of paper - DFADD: The Diffusion and Flow-matching based Audio Deepfake Dataset
☆16Apr 7, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
roudimit / Omni-R1
View on GitHub
[ASRU 2025] Omni-R1: Do You Really Need Audio to Fine-Tune Your Audio LLM?
☆47Nov 21, 2025Updated 8 months ago
TristaCao / into_inclusivecoref
View on GitHub
☆14Jul 27, 2020Updated 5 years ago
lucasmllr / xsbert
View on GitHub
explainable Siamese sentence transformers
☆13Mar 26, 2024Updated 2 years ago
lwq20020127 / OmniDrag
View on GitHub
[IJCV 2025] OmniDrag: Enabling Motion Control for Omnidirectional Image-to-Video Generation
☆16Feb 13, 2026Updated 5 months ago
myracheng / lm_caricature
View on GitHub
code and data associated with CoMPosT: Characterizing and Evaluating Caricature in LLM Simulations
☆11Oct 13, 2023Updated 2 years ago
jefflai108 / Semi-Supervsied-Spoken-Language-Understanding-PyTorch
View on GitHub
Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining
☆12Mar 23, 2021Updated 5 years ago
selab-hcmus / AI_City_2021
View on GitHub
Traffic Video Event Retrieval via Text Query using Vehicle Appearance and Motion Attributes
☆10May 31, 2026Updated last month
IliaLarchenko / lehome_solution
View on GitHub
My solution for the LeHome challenge (1st place online, 2nd place in real world round)
☆62Jun 26, 2026Updated 3 weeks ago
allenai / understanding_mcqa
View on GitHub
Code for the arXiv preprint "Answer, Assemble, Ace: Understanding How Transformers Answer Multiple Choice Questions"
☆15Aug 2, 2025Updated 11 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Dreamyao516 / DialogueLLM
View on GitHub
☆10Jan 18, 2024Updated 2 years ago
IceWhaleTech / CasaOS-LocalStorage
View on GitHub
Local Storage service provides local storage and disk management functionalities to CasaOS
☆15Apr 17, 2025Updated last year
WingSingFung / TISDiSS
View on GitHub
Official implementation of TISDiSS, a scalable framework for discriminative source separation.
☆16Oct 19, 2025Updated 9 months ago
rrphys / KernelHerding
View on GitHub
Kernel Herding for probability density estimation
☆14Feb 23, 2016Updated 10 years ago
xyease / Dialog-PrLM
View on GitHub
☆12Mar 12, 2022Updated 4 years ago
ian-k-1217 / Fully-Generalized-Non-Local-Network
View on GitHub
☆10Jun 2, 2021Updated 5 years ago
TwidereProject / MetaTextKit
View on GitHub
☆15Dec 12, 2025Updated 7 months ago
zqs01 / data2vecnoisy
View on GitHub
☆11Oct 20, 2022Updated 3 years ago
r-three / AttriBoT
View on GitHub
Code for AttriBoT from "AttriBoT: A Bag of Tricks for Efficiently Approximating Leave-One-Out Context Attribution"
☆15Apr 21, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
tywings / BUPT_IS_Bank
View on GitHub
☆13May 25, 2018Updated 8 years ago
eastonYi / Unsupervised-ASR
View on GitHub
unsupervised ASR (mainly phone classifier) using EODM and GAN
☆12Oct 22, 2020Updated 5 years ago
felix-schmitt / MathNet
View on GitHub
MathNet: A Data-Centric Approach, Dataset and Benchmark Model to Advance Mathematical Expression Recognition
☆10Mar 19, 2025Updated last year
itsnotacie / AAAI-26_SepPrune
View on GitHub
SepPrune: Structured Pruning for Efficient Deep Speech Separation-AAAI'26
☆15May 31, 2025Updated last year
ncherel / infusion
View on GitHub
Internal diffusion for video inpainting
☆17May 19, 2025Updated last year
brendel-group / compositional-ood-generalization
View on GitHub
Official code for the paper "Compositional Generalization from First Principles" (NeurIPS 2023)
☆15Jul 25, 2023Updated 2 years ago
NTIA / alignnet
View on GitHub
Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.
☆18Aug 1, 2025Updated 11 months ago