lumina-ai-inc / benchmarkLinks
☆64Updated last year
Alternatives and similar repositories for benchmark
Users that are interested in benchmark are comparing it to the libraries listed below
Sorting:
- Repo housing the open sourced code for the ai2 scholar qa app and also the corresponding library☆242Updated last month
- LLM reads a paper and produce a working prototype☆60Updated 8 months ago
- [ACL 2024] This is the code repo for our ACL’24 paper "Cleaner Pretraining Corpus Curation with Neural Web Scraping".☆230Updated last year
- LLM-driven automated knowledge graph construction from text using DSPy and Neo4j.☆199Updated last year
- Lighter, cheaper and faster RAG toolkit (Graph RAG) supported by TargetPilot☆46Updated 6 months ago
- Google Deepmind's PromptBreeder for automated prompt engineering implemented in langchain expression language.☆161Updated last year
- ☆20Updated last year
- ☆361Updated 5 months ago
- Social and customizable AI writing assistant! ✍️☆258Updated 6 months ago
- This repository contains ScholarQABench data and evaluation pipeline.☆91Updated 4 months ago
- Langchain implementation of HuggingGPT☆134Updated 2 years ago
- The code powering searchthearxiv.com, a simple semantic search engine for more than 300,000 ML papers on arXiv.☆164Updated 8 months ago
- Interactive coding assistant for data scientists and machine learning developers, empowered by large language models.☆99Updated last year
- Official code repository for: DiagrammerGPT: Generating Open-Domain, Open-Platform Diagrams via LLM Planning (COLM 2024)☆152Updated last year
- a collection of resources around LLMs, aggregated for the workshop "Mastering LLMs: End-to-End Fine-Tuning and Deployment" by Dan Becker …☆110Updated last year
- OpenResearcher, an advanced Scientific Research Assistant☆475Updated last year
- ☆63Updated last year
- ☆67Updated last year
- TF-ID: Table/Figure IDentifier for academic papers☆245Updated last year
- My implementation of "Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models"☆99Updated 2 years ago
- Public space for the user community of Semantic Scholar APIs to share scripts, report issues, and make suggestions.☆255Updated 11 months ago
- Repository for “PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makers”, NAACL24☆151Updated last year
- [ICLR 2025] A trinity of environments, tools, and benchmarks for general virtual agents☆219Updated 6 months ago
- Mixing Language Models with Self-Verification and Meta-Verification☆111Updated last year
- Official implementation of paper "Meta Prompting for AI Systems" (https://arxiv.org/abs/2311.11482)☆262Updated 2 weeks ago
- 🔧 Compare how Agent systems perform on several benchmarks. 📊🚀☆102Updated 5 months ago
- Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and …☆347Updated last year
- Official repo of Respond-and-Respond: data, code, and evaluation☆104Updated last year
- The latest graphrag interface is used, using the local ollama to provide the LLM interface.Support for using the pip installation☆155Updated last year
- Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]☆88Updated 11 months ago