lumina-ai-inc / benchmarkLinks
☆65Updated last year
Alternatives and similar repositories for benchmark
Users that are interested in benchmark are comparing it to the libraries listed below
Sorting:
- Repo housing the open sourced code for the ai2 scholar qa app and also the corresponding library☆233Updated last week
- LLM reads a paper and produce a working prototype☆57Updated 7 months ago
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆91Updated 2 months ago
- An AI regulatory assistant to pre-check your documentation before FDA or MDR submission.☆12Updated last year
- Score LLM pretraining data with classifiers☆54Updated 2 years ago
- LLM-driven automated knowledge graph construction from text using DSPy and Neo4j.☆196Updated last year
- a collection of resources around LLMs, aggregated for the workshop "Mastering LLMs: End-to-End Fine-Tuning and Deployment" by Dan Becker …☆110Updated last year
- Measuring RAG solutions throughput and latency☆18Updated last year
- Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]☆87Updated 9 months ago
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆101Updated 11 months ago
- 🔧 Compare how Agent systems perform on several benchmarks. 📊🚀☆102Updated 3 months ago
- A prompting library☆185Updated 4 months ago
- [ACL 2024] This is the code repo for our ACL’24 paper "Cleaner Pretraining Corpus Curation with Neural Web Scraping".☆230Updated last year
- TF-ID: Table/Figure IDentifier for academic papers☆242Updated last year
- ARAGOG- Advanced RAG Output Grading. Exploring and comparing various Retrieval-Augmented Generation (RAG) techniques on AI research paper…☆114Updated last year
- Repository for “PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makers”, NAACL24☆149Updated last year
- Lighter, cheaper and faster RAG toolkit (Graph RAG) supported by TargetPilot☆46Updated 5 months ago
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆44Updated last year
- Mixing Language Models with Self-Verification and Meta-Verification☆109Updated 11 months ago
- 🪐 ✨ AI Agents for JupyterLab with 🛠️ MCP tools - Chat interface for intelligent notebook interaction, code execution, and workspace man…☆120Updated last week
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆46Updated last year
- I have explained how to create superior RAG pipeline for complex pdfs using LlamaParse. We can extract text and tables from pdf and QA on…☆48Updated last year
- Reasoning by Communicating with Agents☆29Updated 6 months ago
- Using LlamaIndex, Redis, and OpenAI to chat with PDF documents. Supplementary material for blog post on Microsoft Developer Blog☆115Updated 2 years ago
- The latest graphrag interface is used, using the local ollama to provide the LLM interface.Support for using the pip installation☆155Updated last year
- Social and customizable AI writing assistant! ✍️☆254Updated 4 months ago
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker☆122Updated 2 weeks ago
- ☆66Updated last year
- ☆89Updated 9 months ago
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆92Updated last month