lumina-ai-inc / benchmarkLinks

☆64

Alternatives and similar repositories for benchmark

Users that are interested in benchmark are comparing it to the libraries listed below

Sorting:

allenai / ai2-scholarqa-lib
Repo housing the open sourced code for the ai2 scholar qa app and also the corresponding library
☆200Updated last week
phunterlau / paper_without_code
LLM reads a paper and produce a working prototype
☆58Updated 3 months ago
VikParuchuri / classified
Score LLM pretraining data with classifiers
☆55Updated last year
cxcscmu / RAGViz
Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]
☆86Updated 6 months ago
read-agent / read-agent.github.io
☆62Updated last year
google-deepmind / llms_can_learn_rules
☆57Updated 7 months ago
jina-ai / submodular-optimization
Submodular optimization for context engineering: query fan-out, text selection, passage reranking
☆56Updated last week
wade1010 / graphrag-ui
The latest graphrag interface is used, using the local ollama to provide the LLM interface.Support for using the pip installation
☆154Updated 9 months ago
ai8hyf / TF-ID
TF-ID: Table/Figure IDentifier for academic papers
☆238Updated last year
aymeric-roucher / agent_reasoning_benchmark
🔧 Compare how Agent systems perform on several benchmarks. 📊🚀
☆98Updated 9 months ago
chrisammon3000 / dspy-neo4j-knowledge-graph
LLM-driven automated knowledge graph construction from text using DSPy and Neo4j.
☆187Updated last year
OpenMatch / NeuScraper
[ACL 2024] This is the code repo for our ACL’24 paper "Cleaner Pretraining Corpus Curation with Neural Web Scraping".
☆226Updated 10 months ago
marco-jeffrey / awesome-llm-resources
a collection of resources around LLMs, aggregated for the workshop "Mastering LLMs: End-to-End Fine-Tuning and Deployment" by Dan Becker …
☆110Updated last year
tanyuqian / cappy
NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer
☆43Updated last year
miralab-ai / autoreason
☆40Updated 7 months ago
datalayer / jupyter-ai-agents
🪐 ✨ Jupyter AI Agents are agents equipped with tools like 'execute', 'insert_cell', and more, to transform your Jupyter Notebooks into a…
☆88Updated 3 weeks ago
jina-ai / llm-query-expansion
Query Expension for Better Query Embedding using LLMs
☆54Updated 5 months ago
EasyShopAI / rag-lab
Lighter, cheaper and faster RAG toolkit (Graph RAG) supported by TargetPilot
☆47Updated last month
ShayekhBinIslam / openrag
Official Code for Oᴘᴇɴ-RAG: Enhanced Retrieval Augmented Reasoning with Open-Source Large Language Models (EMNLP Findings 2024)
☆130Updated 5 months ago
LLM360 / Analysis360
Open Implementations of LLM Analyses
☆105Updated 9 months ago
h2oai / enterprise-h2ogpte
Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform
☆87Updated last month
automix-llm / automix
Mixing Language Models with Self-Verification and Meta-Verification
☆106Updated 7 months ago
sujitpal / llm-rag-eval
Large Language Model (LLM) powered evaluator for Retrieval Augmented Generation (RAG) pipelines.
☆30Updated last year
camille-vanhoffelen / langchain-huggingGPT
Langchain implementation of HuggingGPT
☆132Updated 2 years ago
BalianWang / OSLUI
Natural Language User Interface for Operating Systems
☆8Updated last year
augustwester / searchthearxiv
The code powering searchthearxiv.com, a simple semantic search engine for more than 300,000 ML papers on arXiv.
☆157Updated 3 months ago
charlesjin / emergent-semantics
☆41Updated last year
GPT-Laboratory / SLR-automation
To automate the SLR process and write paper quickly using multi agents of AI
☆45Updated last year
meta-prompting / meta-prompting
Official implementation of paper "Meta Prompting for AI Systems" (https://arxiv.org/abs/2311.11482)
☆199Updated 2 months ago
kyegomez / Algorithm-Of-Thoughts
My implementation of "Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models"
☆97Updated last year