Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]
☆89Jan 18, 2025Updated last year
Alternatives and similar repositories for RAGViz
Users that are interested in RAGViz are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆17Dec 11, 2024Updated last year
- [ICLR'25] "Attention in Large Language Models Yields Efficient Zero-Shot Re-Rankers"☆43Mar 31, 2025Updated last year
- aigc evals☆10Dec 2, 2023Updated 2 years ago
- Code for explaining and evaluating late chunking (chunked pooling)☆516Dec 23, 2024Updated last year
- ☆59Oct 18, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This is the official repository for Auto-RAG.☆234Jul 18, 2025Updated 10 months ago
- [EMNLP 2024: Demo Oral] RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented Generation☆308Oct 18, 2024Updated last year
- text2sql with modern LLMs (duckdb-nsql, SQLCoder etc ...)☆18Apr 13, 2024Updated 2 years ago
- Multimodal Open-O1 (MO1) is designed to enhance the accuracy of inference models by utilizing a novel prompt-based approach. This tool wo…☆28Sep 25, 2024Updated last year
- A high performance batching router optimises max throughput for text inference workload☆16Sep 6, 2023Updated 2 years ago
- The code for paper: Hierarchical Document Refinement for Long-context Retrieval-augmented Generation [ACL2025 Oral]☆46Aug 25, 2025Updated 9 months ago
- [ACL 2025] LongSafety: Evaluating Long-Context Safety of Large Language Models☆16Jun 18, 2025Updated 11 months ago
- AutoRAG example about benchmarking Korean embeddings.☆45Oct 2, 2024Updated last year
- GISTEmbed: Guided In-sample Selection of Training Negatives for Text Embeddings☆45Mar 6, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 本项目旨在构建一套多场景下可复用的辅助决策型智能 Agent 系统。通过提取用户输入的关键信息,结合历史数据进行智能匹配,系统可在教育路径、法律咨询、金融投资、心理健康、企业经营、供应链优化、危机应对、智能客服等多个领域提供个性化决策建议。系统采用统一的决策流程设计,具备高…☆26Mar 7, 2026Updated 2 months ago
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.☆12Dec 24, 2022Updated 3 years ago
- Fine Tune Multimodal LLM "Idefics 2" using QLoRA.☆11Apr 20, 2024Updated 2 years ago
- ☆16Sep 10, 2024Updated last year
- Repo for for paper "AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction".☆70Jul 24, 2024Updated last year
- ☆23Oct 30, 2023Updated 2 years ago
- Official repo for "Make Your LLM Fully Utilize the Context"☆272May 15, 2024Updated 2 years ago
- Official implementation of "Data Mixture Inference: What do BPE tokenizers reveal about their training data?"☆19May 15, 2025Updated last year
- Implementation for EACL 2024 paper "Corpus-Steered Query Expansion with Large Language Models"☆13Mar 19, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This repository contains my research work on building the state of the art next basket recommendations using techniques such as Autoencod…☆11Mar 10, 2021Updated 5 years ago
- Parsing-free RAG supported by VLMs☆956Dec 7, 2025Updated 5 months ago
- This is an implementation of the paper: Searching for Best Practices in Retrieval-Augmented Generation (EMNLP2024)☆346Dec 21, 2024Updated last year
- This is the code repo for the paper "RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewards".☆24Oct 28, 2024Updated last year
- Multimodal RAG with PyMuPDF☆45Oct 4, 2024Updated last year
- ☆15Jun 8, 2023Updated 2 years ago
- Official Implementation of "Multi-Head RAG: Solving Multi-Aspect Problems with LLMs"☆242Feb 26, 2026Updated 3 months ago
- ☆14Jul 7, 2024Updated last year
- [NAACL 2024] A Framework aims to wisely initialize unseen subword embeddings in PLMs for efficient large-scale continued pretraining☆18Nov 26, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- bb25 is a fast, self-contained BM25 + Bayesian calibration implementation with a minimal Python API.☆147Mar 17, 2026Updated 2 months ago
- ☆30Mar 18, 2024Updated 2 years ago
- Official codebase for "Analyzing the Generalization and Reliability of Steering Vectors"☆21Dec 14, 2024Updated last year
- Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders☆19May 23, 2025Updated last year
- Source code of DRAGIN, ACL 2024 main conference Long Paper (Oral)☆191Dec 5, 2025Updated 5 months ago
- The training codes of Jasper-Token-Compression-600M☆19Nov 19, 2025Updated 6 months ago
- Learning to Skip the Middle Layers of Transformers☆17Aug 7, 2025Updated 9 months ago