lumina-ai-inc / benchmarkLinks
☆64Updated last year
Alternatives and similar repositories for benchmark
Users that are interested in benchmark are comparing it to the libraries listed below
Sorting:
- Repo housing the open sourced code for the ai2 scholar qa app and also the corresponding library☆226Updated last month
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆90Updated 3 weeks ago
- Social and customizable AI writing assistant! ✍️☆254Updated 3 months ago
- To automate the SLR process and write paper quickly using multi agents of AI☆48Updated last year
- LitLLM: A Toolkit for Scientific Literature Review☆71Updated 5 months ago
- ☆20Updated last year
- ☆60Updated 10 months ago
- The code powering searchthearxiv.com, a simple semantic search engine for more than 300,000 ML papers on arXiv.☆161Updated 5 months ago
- 🔧 Compare how Agent systems perform on several benchmarks. 📊🚀☆102Updated 2 months ago
- LLM-driven automated knowledge graph construction from text using DSPy and Neo4j.☆191Updated last year
- Official implementation of paper "Meta Prompting for AI Systems" (https://arxiv.org/abs/2311.11482)☆232Updated 2 weeks ago
- This repository contains ScholarQABench data and evaluation pipeline.☆85Updated last month
- a collection of resources around LLMs, aggregated for the workshop "Mastering LLMs: End-to-End Fine-Tuning and Deployment" by Dan Becker …☆110Updated last year
- LLM reads a paper and produce a working prototype☆56Updated 5 months ago
- ☆132Updated last year
- Langchain implementation of HuggingGPT☆133Updated 2 years ago
- ScholarCopilot: Training Large Language Models for Academic Writing with Accurate Citations [COLM 2025]☆232Updated 2 months ago
- Lighter, cheaper and faster RAG toolkit (Graph RAG) supported by TargetPilot☆46Updated 3 months ago
- Public space for the user community of Semantic Scholar APIs to share scripts, report issues, and make suggestions.☆242Updated 8 months ago
- ☆352Updated last month
- 🪐 ✨ Jupyter AI Agents are agents equipped with tools like 'execute', 'insert_cell', and more, to transform your Jupyter Notebooks into a…☆107Updated last month
- InterfaceAgent: a versatile framework designed to create system and interface agents capable of managing mobile and desktop applications …☆114Updated last year
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆44Updated last year
- [ACL 2024] This is the code repo for our ACL’24 paper "Cleaner Pretraining Corpus Curation with Neural Web Scraping".☆229Updated last year
- Code repo for MathAgent☆17Updated last year
- Official code repository for: DiagrammerGPT: Generating Open-Domain, Open-Platform Diagrams via LLM Planning (COLM 2024)☆147Updated last year
- TF-ID: Table/Figure IDentifier for academic papers☆240Updated last year
- Mixing Language Models with Self-Verification and Meta-Verification☆110Updated 9 months ago
- AlphaXIV open-source alternative: Chat with any arXiv paper.☆82Updated 4 months ago
- frozen-in-time version of our Paper Finder agent for reproducing evaluation results☆178Updated last month