allenai / papermage
library supporting NLP and CV research on scientific papers
☆698Updated this week
Related projects ⓘ
Alternatives and complementary repositories for papermage
- Generative Representational Instruction Tuning☆562Updated this week
- Easily embed, cluster and semantically label text datasets☆459Updated 7 months ago
- RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.☆337Updated last week
- A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.☆692Updated last month
- Is ChatGPT Good at Search? LLMs as Re-Ranking Agent [EMNLP 2023 Outstanding Paper Award]☆524Updated 7 months ago
- ⚡FlashRAG: A Python Toolkit for Efficient RAG Research☆1,293Updated this week
- Extend existing LLMs way beyond the original training length with constant memory usage, without retraining☆667Updated 6 months ago
- Parsers for scientific papers (PDF2JSON, TEX2JSON, JATS2JSON)☆347Updated 6 months ago
- Data and tools for generating and inspecting OLMo pre-training data.☆976Updated this week
- RAGChecker: A Fine-grained Framework For Diagnosing RAG☆528Updated last month
- Evaluation suite for LLMs☆309Updated last week
- Contriever: Unsupervised Dense Information Retrieval with Contrastive Learning☆680Updated last year
- ☆332Updated 11 months ago
- Forward-Looking Active REtrieval-augmented generation (FLARE)☆586Updated 11 months ago
- ☆445Updated last week
- [EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627☆457Updated last month
- ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting wit…☆971Updated 8 months ago
- Train and Infer Powerful Sentence Embeddings with AnglE | 🔥 SOTA on STS and MTEB Leaderboard☆484Updated last week
- ☆1,263Updated this week
- All-in-one text de-duplication☆618Updated 5 months ago
- SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models☆467Updated 4 months ago
- Guideline following Large Language Model for Information Extraction☆309Updated last week
- Train Models Contrastively in Pytorch☆543Updated 2 weeks ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆1,612Updated this week
- [NeurIPS 2024 Spotlight] Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models☆531Updated last week
- This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai,…☆1,822Updated 5 months ago
- ☆1,812Updated 6 months ago
- The papers are organized according to our survey: Evaluating Large Language Models: A Comprehensive Survey.☆709Updated 6 months ago
- The official GitHub page for the survey paper "A Survey on Evaluation of Large Language Models".☆1,425Updated 5 months ago
- Framework for enhancing LLMs for RAG tasks using fine-tuning.☆502Updated 3 weeks ago