lumina-ai-inc / benchmarkLinks
☆62Updated 9 months ago
Alternatives and similar repositories for benchmark
Users that are interested in benchmark are comparing it to the libraries listed below
Sorting:
- Repo housing the open sourced code for the ai2 scholar qa app and also the corresponding library☆176Updated last week
- Query Expension for Better Query Embedding using LLMs☆51Updated 3 months ago
- An AI regulatory assistant to pre-check your documentation before FDA or MDR submission.☆11Updated 10 months ago
- Official implementation of paper "Meta Prompting for AI Systems" (https://arxiv.org/abs/2311.11482)☆172Updated 3 weeks ago
- ☆51Updated 10 months ago
- This repository contains ScholarQABench data and evaluation pipeline.☆72Updated last month
- ☆55Updated 3 weeks ago
- Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".☆231Updated 9 months ago
- autonomous agent with access to a tool library☆38Updated 2 months ago
- LLM based agents with proactive interactions, long-term memory, external tool integration, and local deployment capabilities.☆100Updated this week
- ScholarCopilot: Training Large Language Models for Academic Writing with Accurate Citations☆204Updated last month
- The latest graphrag interface is used, using the local ollama to provide the LLM interface.Support for using the pip installation☆150Updated 7 months ago
- Fetch arxiv data to LLM-friendly text☆118Updated 3 months ago
- LLM reads a paper and produce a working prototype☆57Updated last month
- ☆53Updated last year
- LLM-driven automated knowledge graph construction from text using DSPy and Neo4j.☆182Updated last year
- Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]☆83Updated 4 months ago
- InterfaceAgent: a versatile framework designed to create system and interface agents capable of managing mobile and desktop applications …☆112Updated last year
- LangCode - Improving alignment and reasoning of large language models (LLMs) with natural language embedded program (NLEP).☆42Updated last year
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆43Updated last year
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.☆24Updated 2 months ago
- A gradio webui for Andrewyng translation-agent☆29Updated 6 months ago
- ☆41Updated 5 months ago
- A prompting library☆165Updated 8 months ago
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆35Updated 3 months ago
- AgileGen: Empowering Agile-Based Generative Software Development through Human-AI Teamwork (accepted by ACM TOSEM)☆22Updated 6 months ago
- A curated list of autonomous agents and developer tools powered by LLM.☆40Updated last year
- TF-ID: Table/Figure IDentifier for academic papers☆236Updated 10 months ago
- ☆20Updated last year
- Code repo for MathAgent☆16Updated last year