mangiucugna / json_repair
A python module to repair invalid JSON from LLMs
☆1,793Updated this week
Alternatives and similar repositories for json_repair:
Users that are interested in json_repair are comparing it to the libraries listed below
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,391Updated 3 weeks ago
- A simple, easy-to-hack GraphRAG implementation☆2,834Updated 2 weeks ago
- Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy☆1,120Updated last week
- The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval☆1,191Updated 7 months ago
- Enforce the output format (JSON Schema, Regex etc) of a language model☆1,776Updated last month
- RAGChecker: A Fine-grained Framework For Diagnosing RAG☆850Updated 4 months ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆2,659Updated this week
- High-performance retrieval engine for unstructured data☆1,348Updated last week
- [NeurIPS'24] HippoRAG is a novel RAG framework inspired by human long-term memory that enables LLMs to continuously integrate knowledge a…☆2,254Updated 2 weeks ago
- Code for explaining and evaluating late chunking (chunked pooling)☆375Updated 4 months ago
- TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.☆2,461Updated 3 weeks ago
- The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.☆1,768Updated this week
- Empowering RAG with a memory-based data interface for all-purpose applications!☆1,747Updated this week
- Superfast AI decision making and intelligent processing of multi-modal data.☆2,552Updated this week
- RAG (Retrieval-Augmented Generation) Chatbot Examples Using PyMuPDF☆877Updated 2 weeks ago
- Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'☆1,493Updated 3 months ago
- Lite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and Pairwise reranking based on LLMs and cro…☆786Updated 4 months ago
- Fast State-of-the-Art Static Embeddings☆1,359Updated this week
- Official Repo for ICML 2024 paper "Executable Code Actions Elicit Better LLM Agents" by Xingyao Wang, Yangyi Chen, Lifan Yuan, Yizhe Zhan…☆1,097Updated 11 months ago
- ☆1,122Updated 10 months ago
- A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.☆769Updated last month
- ⚡FlashRAG: A Python Toolkit for Efficient RAG Research (WWW2025 Resource)☆2,201Updated this week
- A fast, lightweight and easy-to-use Python library for splitting text into semantically meaningful chunks.☆293Updated 3 weeks ago
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali☆2,080Updated last week
- LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA☆487Updated 3 months ago
- Task-Aware Agent-driven Prompt Optimization Framework☆3,175Updated last month
- This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai,…☆2,051Updated 11 months ago
- [CVPR 2025] A Comprehensive Benchmark for Document Parsing and Evaluation☆368Updated 2 weeks ago
- This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?☆1,066Updated 2 months ago
- Prompt optimization scratch☆706Updated last week