machine-theory / lm-councilLinks

LLMs sitting on a council together to decide, by consensus, who among them is the best.

☆18

Alternatives and similar repositories for lm-council

Users that are interested in lm-council are comparing it to the libraries listed below

Sorting:

darrow-labs / LegalLens
☆8Updated last year
tanyuqian / cappy
NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer
☆43Updated last year
kumar-shridhar / Screws
SCREWS: A Modular Framework for Reasoning with Revisions
☆27Updated last year
IBM / InspectorRAGet
The repository contains generative AI analytics platform application code.
☆26Updated 3 months ago
lamini-ai / lamini-earnings-calls
☆41Updated last year
phunterlau / paper_without_code
LLM reads a paper and produce a working prototype
☆58Updated 3 months ago
langchain-ai / prompt-eval-recommendation
Streamlit app for recommending eval functions using prompt diffs
☆29Updated last year
TIGER-AI-Lab / StructLM
Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)
☆75Updated 9 months ago
automix-llm / automix
Mixing Language Models with Self-Verification and Meta-Verification
☆105Updated 7 months ago
shane-kercheval / llm-workflow
create workflows with LLMs
☆54Updated last year
pygongnlp / CoSearchAgent
[SIGIR 2024 (Demo)] CoSearchAgent: A Lightweight Collborative Search Agent with Large Language Models
☆28Updated last year
luohongyin / LangCode
LangCode - Improving alignment and reasoning of large language models (LLMs) with natural language embedded program (NLEP).
☆43Updated last year
uiuc-kang-lab / agentic-benchmarks
☆33Updated last week
SalesforceAIResearch / text2data
☆21Updated 5 months ago
padas-lab-de / ir-rag-sigir24-persona-rag
☆48Updated 10 months ago
S1M0N38 / dspy-arxiv
Explore the use of DSPy for extracting features from PDFs 🔎
☆45Updated last year
microsoft / Structured-Entity-Extraction
Code for the EMNLP'24 paper "Learning to Extract Structured Entities Using Language Models"
☆42Updated 4 months ago
iulia-b10 / multilingual-embedding-models
☆20Updated last year
coastalcph / lexlms
LeXFiles and LegalLAMA: Facilitating English Multinational Legal Language Model Development
☆20Updated 2 years ago
BerriAI / bettertest
☆75Updated last year
epoch-research / training-cost-trends
☆18Updated this week
Knowledgator / unlimited_classifier
Universal text classifier for generative models
☆24Updated last year
benediktstroebl / agent-evals
☆23Updated 2 months ago
stanford-oval / ovalchat
OVALChat is a customizable Web app aimed at conducting user studies with chatbots
☆29Updated last year
aymeric-roucher / LongContext_vs_RAG_NeedleInAHaystack
Comparing retrieval abilities from GPT4-Turbo and a RAG system on a toy example for various context lengths
☆35Updated last year
stunningpixels / lou-eval
Track the progress of LLM context utilisation
☆55Updated 3 months ago
gilfernandes / chat_functions
Simple playground chat app that interacts with OpenAI's functions with memory and custom tools.
☆18Updated 2 years ago
docugami / DFM-benchmarks
Benchmarks for Business Document Foundation Models
☆10Updated last year
Tebmer / Rereading-LLM-Reasoning
EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…
☆26Updated 8 months ago
allenai / recoma
Reasoning by Communicating with Agents
☆29Updated 3 months ago