machine-theory / lm-councilView external linksLinks
LLMs sitting on a council together to decide, by consensus, who among them is the best.
☆284Jul 20, 2025Updated 6 months ago
Alternatives and similar repositories for lm-council
Users that are interested in lm-council are comparing it to the libraries listed below
Sorting:
- Landing page + leaderboard for SWE-Bench benchmark☆11Jan 26, 2026Updated 2 weeks ago
- Datasette plugin providing a UI for executing SQL writes against the database☆12Nov 11, 2025Updated 3 months ago
- The goal is to pilot Microsoft Cognitive Services to unlock the strategic value of UN unstructured content by building on AI and semantic…☆16Jul 6, 2023Updated 2 years ago
- A website to store all my tests for ease of access.☆23Feb 28, 2025Updated 11 months ago
- Official repository for Decentralized Arena via Collective LLM Intelligence☆17May 19, 2025Updated 8 months ago
- SQL functions for calling OpenAI APIs☆22Jan 14, 2023Updated 3 years ago
- ☆34Jan 21, 2026Updated 3 weeks ago
- [NAACL 2024 Findings] Evaluation suite for the systematic evaluation of instruction selection methods.☆23Jul 26, 2023Updated 2 years ago
- Datasette plugin for uploading CSV files and converting them to database tables☆27Nov 10, 2025Updated 3 months ago
- Datasette plugin for rendering Markdown☆31Aug 15, 2023Updated 2 years ago
- Fine-tuning scripts for evaluating transformer-based models on KLEJ benchmark.☆26Jul 6, 2023Updated 2 years ago
- Repository for R for Data Mining Class at Chulalongkorn University☆10Apr 18, 2018Updated 7 years ago
- This framework aims to assists in the documentation of datasets to promote transparency and help dataset creators and consumers make info…☆37Jun 23, 2024Updated last year
- ☆31Nov 14, 2024Updated last year
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Aug 25, 2023Updated 2 years ago
- ☆13Sep 9, 2022Updated 3 years ago
- ComfyUI custom node to extend Wan videos in loops with overlap consistency, per loop prompts, and optional LoRA control.☆25Nov 29, 2025Updated 2 months ago
- An AI-powered equity research analyst demo using Large Language Models to analyze 10-K filings of renowned NYSE listed companies.☆36Nov 28, 2023Updated 2 years ago
- A step by step guide on how you can crack the AWS Certified Cloud Practitioner Exam☆10Jan 21, 2023Updated 3 years ago
- Build queries with an elegant WordPress-oriented query builder☆13Jan 23, 2025Updated last year
- TAXII 2.0 Server implemented in Node JS with MongoDB backend☆12Jan 3, 2023Updated 3 years ago
- ☆10Jul 6, 2023Updated 2 years ago
- Quickstart template to build real-time voice AI solutions using the LiveKit Agent framework with WebRTC, open-source and local or 3rd par…☆17Dec 22, 2025Updated last month
- Basic ROS Control of GoPiGo Robot☆14Aug 23, 2017Updated 8 years ago
- Evaluating LLMs with CommonGen-Lite☆94Mar 21, 2024Updated last year
- Base mech☆39Jan 8, 2026Updated last month
- Social Watcher on Facebook Marketing API☆10Jul 20, 2022Updated 3 years ago
- Implementation of a fast semantic chunker in C++, installable in python 3.7+ projects.☆22Sep 20, 2025Updated 4 months ago
- DoctorRAG is a medical AI that mimics doctor-like reasoning by combining textbook knowledge with insights from similar patient cases, usi…☆15May 21, 2025Updated 8 months ago
- ☆11Jan 27, 2026Updated 2 weeks ago
- ERT webviz plugins☆13Jan 9, 2026Updated last month
- ROS package to send a sequence of navigation goals read from a YAML file to move_base (C++)☆11Feb 19, 2020Updated 5 years ago
- ☆12Jan 11, 2026Updated last month
- LiteLLM model integration for Pydantic AI framework - access 100+ LLM providers through a unified interface☆19Nov 19, 2025Updated 2 months ago
- Precision Knowledge Editing (PKE): A novel method to reduce toxicity in LLMs while preserving performance, with robust evaluations and ha…☆11Nov 26, 2024Updated last year
- DOMAINEVAL is an auto-constructed benchmark for multi-domain code generation that consists of 2k+ subjects (i.e., description, reference …☆14Dec 12, 2024Updated last year
- PETSc Interface for Octave and MATLAB (Deprecated)☆10Nov 10, 2022Updated 3 years ago
- Official Code for the NeurIPS'25 paper: Selective Learning for Deep Time Series Forecasting☆33Nov 7, 2025Updated 3 months ago
- A distilled DeepSeek-R1 variant built on Qwen2.5-32B, fine-tuned with curated data for enhanced performance and efficiency. <metadata> gp…☆16Mar 11, 2025Updated 11 months ago