MadryLab / context-citeLinks

Attribute (or cite) statements generated by LLMs back to in-context information.

☆268

Alternatives and similar repositories for context-cite

Users that are interested in context-cite are comparing it to the libraries listed below

Sorting:

davanstrien / awesome-synthetic-datasets
awesome synthetic (text) datasets
☆291Updated 3 weeks ago
wang-research-lab / agentinstruct
Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"
☆115Updated 10 months ago
Liyan06 / MiniCheck
MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents [EMNLP 2024]
☆174Updated 7 months ago
RulinShao / retrieval-scaling
Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".
☆207Updated 2 months ago
msclar / formatspread
Code accompanying "How I learned to start worrying about prompt formatting".
☆107Updated last month
felipemaiapolo / tinyBenchmarks
Evaluating LLMs with fewer examples
☆160Updated last year
writer / writing-in-the-margins
☆118Updated 11 months ago
Mihaiii / llm_steer
Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…
☆240Updated 5 months ago
microsoft / FILM
Official repo for "Make Your LLM Fully Utilize the Context"
☆253Updated last year
chentong0 / factoid-wiki
Dense X Retrieval: What Retrieval Granularity Should We Use?
☆159Updated last year
spcl / MRAG
Official Implementation of "Multi-Head RAG: Solving Multi-Aspect Problems with LLMs"
☆222Updated last month
huggingface / llm-swarm
Manage scalable open LLM inference endpoints in Slurm clusters
☆268Updated last year
salesforce / summary-of-a-haystack
Codebase accompanying the Summary of a Haystack paper.
☆79Updated 10 months ago
orionw / promptriever
The first dense retrieval model that can be prompted like an LM
☆81Updated 2 months ago
illuin-tech / vidore-benchmark
Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.
☆223Updated this week
zeroentropy-ai / legalbenchrag
This is the repo for the LegalBench-RAG Paper: https://arxiv.org/abs/2408.10343.
☆118Updated 2 months ago
JinjieNi / MixEval
The official evaluation suite and dynamic data release for MixEval.
☆242Updated 8 months ago
ConsequentAI / fneval
Functional Benchmarks and the Reasoning Gap
☆88Updated 10 months ago
quotient-ai / judges
A small library of LLM judges
☆248Updated this week
WildEval / ZeroEval
A simple unified framework for evaluating LLMs
☆235Updated 3 months ago
SALT-NLP / demonstrated-feedback
☆125Updated 10 months ago
haizelabs / verdict
Inference-time scaling for LLMs-as-a-judge.
☆267Updated 3 weeks ago
jakespringer / echo-embeddings
☆152Updated last year
zai-org / ComplexFuncBench
Complex Function Calling Benchmark.
☆123Updated 6 months ago
redotvideo / pluto
Synthetic Data for LLM Fine-Tuning
☆120Updated last year
zbambergerNLP / strategic-debate-tot
A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments
☆87Updated 10 months ago
shengliu66 / ICV
Code for In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering
☆182Updated 5 months ago
jxmorris12 / cde
code for training & evaluating Contextual Document Embedding models
☆196Updated 2 months ago
allenai / WildBench
Benchmarking LLMs with Challenging Tasks from Real Users
☆233Updated 9 months ago
facebookresearch / ReasonIR
Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".
☆188Updated last month