machine-theory / lm-councilLinks
LLMs sitting on a council together to decide, by consensus, who among them is the best.
☆21Updated 3 months ago
Alternatives and similar repositories for lm-council
Users that are interested in lm-council are comparing it to the libraries listed below
Sorting:
- Evaluation framework for document processing models and services.☆53Updated this week
- ☆14Updated last month
- Streamlit app for recommending eval functions using prompt diffs☆29Updated last year
- Explore the use of DSPy for extracting features from PDFs 🔎☆48Updated last year
- SCREWS: A Modular Framework for Reasoning with Revisions☆27Updated 2 years ago
- ☆10Updated last year
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆44Updated last year
- code for training and using chess embeddings models☆12Updated last year
- Benchmarks for Business Document Foundation Models☆10Updated last year
- ☆45Updated 3 months ago
- Track the progress of LLM context utilisation☆54Updated 6 months ago
- ☆40Updated 10 months ago
- LLM reads a paper and produce a working prototype☆57Updated 6 months ago
- Mixing Language Models with Self-Verification and Meta-Verification☆109Updated 10 months ago
- Experimenting with LLMs to Research, Reflect, and Plan (LLM assistants, retrieval, and Discord integration)☆34Updated last year
- Comparing retrieval abilities from GPT4-Turbo and a RAG system on a toy example for various context lengths☆35Updated last year
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆23Updated last year
- Advanced Coding AI Assistant that uses a Gradio interface to stream coding related responses. ChatRAG supports local and API inference an…☆22Updated 6 months ago
- KITE (Knowledge-Intensive Task Evaluation) is an end-to-end benchmark for RAG pipelines☆21Updated last year
- Very minimal (and stateless) agent framework☆45Updated 9 months ago
- Median is an open-source flashcard application that leverages the power of spaced repetition and artificial intelligence to transform the…☆23Updated last year
- create workflows with LLMs☆54Updated last year
- Solve Geometric & Graph Problems with Large Language Models☆33Updated 2 years ago
- Advanced Reasoning Benchmark Dataset for LLMs☆46Updated last year
- Small python package to measure OCR quality and other related metrics.☆25Updated last year
- The repository contains generative AI analytics platform application code.☆28Updated last month
- Self Organizing Maps (SOM) ML model can be used to conduct semantic search to populate context required for Retrieval Augmented Generatio…☆15Updated last year
- ☆73Updated last year
- Based on the tree of thoughts paper☆48Updated 2 years ago
- Python library to use Pleias-RAG models☆64Updated 6 months ago