machine-theory / lm-councilLinks
LLMs sitting on a council together to decide, by consensus, who among them is the best.
☆18Updated 3 weeks ago
Alternatives and similar repositories for lm-council
Users that are interested in lm-council are comparing it to the libraries listed below
Sorting:
- ☆8Updated last year
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆43Updated last year
- SCREWS: A Modular Framework for Reasoning with Revisions☆27Updated last year
- The repository contains generative AI analytics platform application code.☆26Updated 3 months ago
- ☆41Updated last year
- LLM reads a paper and produce a working prototype☆58Updated 3 months ago
- Streamlit app for recommending eval functions using prompt diffs☆29Updated last year
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆75Updated 9 months ago
- Mixing Language Models with Self-Verification and Meta-Verification☆105Updated 7 months ago
- create workflows with LLMs☆54Updated last year
- [SIGIR 2024 (Demo)] CoSearchAgent: A Lightweight Collborative Search Agent with Large Language Models☆28Updated last year
- LangCode - Improving alignment and reasoning of large language models (LLMs) with natural language embedded program (NLEP).☆43Updated last year
- ☆33Updated last week
- ☆21Updated 5 months ago
- ☆48Updated 10 months ago
- Explore the use of DSPy for extracting features from PDFs 🔎☆45Updated last year
- Code for the EMNLP'24 paper "Learning to Extract Structured Entities Using Language Models"☆42Updated 4 months ago
- ☆20Updated last year
- LeXFiles and LegalLAMA: Facilitating English Multinational Legal Language Model Development☆20Updated 2 years ago
- ☆75Updated last year
- ☆18Updated this week
- Universal text classifier for generative models☆24Updated last year
- ☆23Updated 2 months ago
- OVALChat is a customizable Web app aimed at conducting user studies with chatbots☆29Updated last year
- Comparing retrieval abilities from GPT4-Turbo and a RAG system on a toy example for various context lengths☆35Updated last year
- Track the progress of LLM context utilisation☆55Updated 3 months ago
- Simple playground chat app that interacts with OpenAI's functions with memory and custom tools.☆18Updated 2 years ago
- Benchmarks for Business Document Foundation Models☆10Updated last year
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆26Updated 8 months ago
- Reasoning by Communicating with Agents☆29Updated 3 months ago