automix-llm / automix
Mixing Language Models with Self-Verification and Meta-Verification
☆97Updated last year
Related projects ⓘ
Alternatives and complementary repositories for automix
- Just a bunch of benchmark logs for different LLMs☆116Updated 3 months ago
- Functional Benchmarks and the Reasoning Gap☆78Updated last month
- ☆48Updated last year
- Evaluating LLMs with CommonGen-Lite☆85Updated 8 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 4 months ago
- ☆41Updated 2 weeks ago
- ☆87Updated 9 months ago
- ☆37Updated this week
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆81Updated last year
- ☆74Updated 3 weeks ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated 10 months ago
- Small and Efficient Mathematical Reasoning LLMs☆71Updated 9 months ago
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆93Updated 5 months ago
- Codebase accompanying the Summary of a Haystack paper.☆72Updated 2 months ago
- Code repository for the c-BTM paper☆105Updated last year
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆203Updated 6 months ago
- ☆72Updated last year
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆97Updated 7 months ago
- ☆38Updated 4 months ago
- Camel-Coder: Collaborative task completion with multiple agents. Role-based prompts, intervention mechanism, and thoughtful suggestions☆33Updated last year
- ☆24Updated last year
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆130Updated this week
- ☆78Updated 11 months ago
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆128Updated last month
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆113Updated 3 weeks ago
- Track the progress of LLM context utilisation☆53Updated 4 months ago
- Repository for the paper Stream of Search: Learning to Search in Language☆93Updated 3 months ago
- ☆94Updated 2 months ago
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆82Updated 2 months ago
- The Next Generation Multi-Modality Superintelligence☆70Updated 2 months ago