lumina-ai-inc / benchmarkLinks
☆64Updated last year
Alternatives and similar repositories for benchmark
Users that are interested in benchmark are comparing it to the libraries listed below
Sorting:
- Repo housing the open sourced code for the ai2 scholar qa app and also the corresponding library☆225Updated last week
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆90Updated this week
- 🔧 Compare how Agent systems perform on several benchmarks. 📊🚀☆101Updated last month
- ☆66Updated last year
- LLM-driven automated knowledge graph construction from text using DSPy and Neo4j.☆190Updated last year
- TF-ID: Table/Figure IDentifier for academic papers☆239Updated last year
- [ACL 2024] This is the code repo for our ACL’24 paper "Cleaner Pretraining Corpus Curation with Neural Web Scraping".☆228Updated last year
- An AI regulatory assistant to pre-check your documentation before FDA or MDR submission.☆12Updated last year
- LLM reads a paper and produce a working prototype☆57Updated 5 months ago
- Social and customizable AI writing assistant! ✍️☆253Updated 2 months ago
- My implementation of "Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models"☆97Updated last year
- Score LLM pretraining data with classifiers☆55Updated last year
- Official code repository for: DiagrammerGPT: Generating Open-Domain, Open-Platform Diagrams via LLM Planning (COLM 2024)☆146Updated last year
- Official implementation of paper "Meta Prompting for AI Systems" (https://arxiv.org/abs/2311.11482)☆221Updated 3 weeks ago
- Repository for “PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makers”, NAACL24☆145Updated last year
- The code powering searchthearxiv.com, a simple semantic search engine for more than 300,000 ML papers on arXiv.☆160Updated 4 months ago
- This repository contains ScholarQABench data and evaluation pipeline.☆84Updated last month
- Code repo for MathAgent☆17Updated last year
- Lighter, cheaper and faster RAG toolkit (Graph RAG) supported by TargetPilot☆46Updated 3 months ago
- ☆59Updated 9 months ago
- ☆185Updated last year
- ☆20Updated last year
- Query Expension for Better Query Embedding using LLMs☆56Updated 6 months ago
- a collection of resources around LLMs, aggregated for the workshop "Mastering LLMs: End-to-End Fine-Tuning and Deployment" by Dan Becker …☆110Updated last year
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆96Updated 9 months ago
- Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]☆85Updated 7 months ago
- Langchain implementation of HuggingGPT☆133Updated 2 years ago
- Google Deepmind's PromptBreeder for automated prompt engineering implemented in langchain expression language.☆144Updated last year
- Mixing Language Models with Self-Verification and Meta-Verification☆110Updated 9 months ago
- ☆89Updated 7 months ago