lumina-ai-inc / benchmarkLinks
☆64Updated last year
Alternatives and similar repositories for benchmark
Users that are interested in benchmark are comparing it to the libraries listed below
Sorting:
- Repo housing the open sourced code for the ai2 scholar qa app and also the corresponding library☆249Updated this week
- LLM reads a paper and produce a working prototype☆60Updated 9 months ago
- LLM-driven automated knowledge graph construction from text using DSPy and Neo4j.☆199Updated last year
- Score LLM pretraining data with classifiers☆55Updated 2 years ago
- Social and customizable AI writing assistant! ✍️☆259Updated 6 months ago
- ☆67Updated last year
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆90Updated 4 months ago
- TF-ID: Table/Figure IDentifier for academic papers☆245Updated last year
- 🔧 Compare how Agent systems perform on several benchmarks. 📊🚀☆103Updated 5 months ago
- Langchain implementation of HuggingGPT☆134Updated 2 years ago
- This repository contains ScholarQABench data and evaluation pipeline.☆93Updated 5 months ago
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆97Updated 3 months ago
- An AI regulatory assistant to pre-check your documentation before FDA or MDR submission.☆13Updated last year
- [ACL 2024] This is the code repo for our ACL’24 paper "Cleaner Pretraining Corpus Curation with Neural Web Scraping".☆231Updated last year
- Mixing Language Models with Self-Verification and Meta-Verification☆112Updated last year
- ☆20Updated last year
- A prompting library☆190Updated 6 months ago
- Official code repository for: DiagrammerGPT: Generating Open-Domain, Open-Platform Diagrams via LLM Planning (COLM 2024)☆154Updated last year
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆102Updated last year
- Repository for “PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makers”, NAACL24☆152Updated last year
- Query Expension for Better Query Embedding using LLMs☆64Updated 11 months ago
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆45Updated last year
- Very minimal (and stateless) agent framework☆44Updated last year
- I have explained how to create superior RAG pipeline for complex pdfs using LlamaParse. We can extract text and tables from pdf and QA on…☆49Updated last year
- The code powering searchthearxiv.com, a simple semantic search engine for more than 300,000 ML papers on arXiv.☆165Updated 9 months ago
- a collection of resources around LLMs, aggregated for the workshop "Mastering LLMs: End-to-End Fine-Tuning and Deployment" by Dan Becker …☆110Updated last year
- Lighter, cheaper and faster RAG toolkit (Graph RAG) supported by TargetPilot☆46Updated 7 months ago
- Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]☆88Updated last year
- ☆365Updated 5 months ago
- The latest graphrag interface is used, using the local ollama to provide the LLM interface.Support for using the pip installation☆156Updated last year