ziegler-ingo / CRAFTLinks

Code, datasets, and checkpoints for the paper "CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Through Corpus Retrieval and Augmentation"

☆30

Alternatives and similar repositories for CRAFT

Users that are interested in CRAFT are comparing it to the libraries listed below

Sorting:

padas-lab-de / ir-rag-sigir24-persona-rag
☆47Updated 9 months ago
AnswerDotAI / ModernBERT-Instruct-mini-cookbook
☆48Updated 5 months ago
salesforce / summary-of-a-haystack
Codebase accompanying the Summary of a Haystack paper.
☆79Updated 9 months ago
pygongnlp / CoSearchAgent
[SIGIR 2024 (Demo)] CoSearchAgent: A Lightweight Collborative Search Agent with Large Language Models
☆27Updated last year
TIGER-AI-Lab / StructLM
Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)
☆75Updated 8 months ago
ibm-granite / granite-embedding-models
☆29Updated 2 weeks ago
trapoom555 / Language-Model-STS-CFT
Improving Text Embedding of Language Models Using Contrastive Fine-tuning
☆64Updated 11 months ago
miralab-ai / autoreason
☆40Updated 7 months ago
tanyuqian / cappy
NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer
☆43Updated last year
microsoft / Structured-Entity-Extraction
Code for the EMNLP'24 paper "Learning to Extract Structured Entities Using Language Models"
☆42Updated 3 months ago
princeton-nlp / LitSearch
[EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search
☆90Updated 7 months ago
arcee-ai / DAM
☆52Updated 8 months ago
du-nlp-lab / MLR-Copilot
☆66Updated 3 months ago
anyscale / long-context-fine-tuning-blogpost
☆17Updated last year
Pleias / Pleias-RAG-Library
Python library to use Pleias-RAG models
☆58Updated 2 months ago
SalesforceAIResearch / SFR-RAG
☆76Updated 6 months ago
huggingface / wikirace-llms
☆23Updated 2 months ago
DunZhang / Stella
☆62Updated 11 months ago
matthewrenze / jhu-concise-cot
The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models
☆22Updated 7 months ago
cxcscmu / RAGViz
Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]
☆85Updated 5 months ago
icip-cas / SelfRetrieval
☆33Updated 8 months ago
Hannibal046 / nanoColBERT
Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).
☆80Updated last year
OSU-NLP-Group / In-Context-Reranking
[ICLR'25] "Attention in Large Language Models Yields Efficient Zero-Shot Re-Rankers"
☆25Updated 3 months ago
Knowledgator / FlashDeBERTa
Trully flash implementation of DeBERTa disentangled attention mechanism.
☆61Updated last month
para-lost / ReBase
ReBase: Training Task Experts through Retrieval Based Distillation
☆29Updated 5 months ago
ElleLeonne / Lightning-ReLoRA
A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.
☆33Updated last year
codefuse-ai / D2LLM
☆30Updated 11 months ago
Knowledgator / LiqFit
Efficient few-shot learning with cross-encoders.
☆54Updated last year
automix-llm / automix
Mixing Language Models with Self-Verification and Meta-Verification
☆106Updated 7 months ago
Zyphra / Zyda_processing
☆35Updated last year