This is the implementation for the paper "LARGE LANGUAGE MODEL CASCADES WITH MIX- TURE OF THOUGHT REPRESENTATIONS FOR COST- EFFICIENT REASONING".
☆31Jun 1, 2024Updated 2 years ago
Alternatives and similar repositories for LLM_MoT_cascade
Users that are interested in LLM_MoT_cascade are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Framework for Cost-Effective Language Model Choice☆16Dec 12, 2023Updated 2 years ago
- ☆13Nov 17, 2024Updated last year
- ☆16Jul 17, 2025Updated 11 months ago
- ☆11Feb 5, 2026Updated 4 months ago
- The official repository for the experiments included in the paper titled "Patch-level Routing in Mixture-of-Experts is Provably Sample-ef…☆14Feb 12, 2026Updated 4 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆30Dec 10, 2024Updated last year
- [TMLR 2024] Official implementation of "Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics"☆20Sep 15, 2023Updated 2 years ago
- English and Chinese LaTeX template for reports/projects/proposal at Beijing Institute of Technology☆10Nov 19, 2020Updated 5 years ago
- ☆17Mar 20, 2025Updated last year
- Supporting code for ReCEval paper☆32Sep 14, 2024Updated last year
- A trainable user simulator☆34Jun 30, 2025Updated 11 months ago
- Mixing Language Models with Self-Verification and Meta-Verification☆114Dec 12, 2024Updated last year
- ☆16Apr 8, 2026Updated 2 months ago
- ☆26Sep 3, 2025Updated 9 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆11Mar 24, 2023Updated 3 years ago
- K-means algorithm implementation in Javascript.☆20Mar 5, 2026Updated 3 months ago
- Code for the ICLR 2019 paper "Learning to Represent Edits"☆13Dec 8, 2022Updated 3 years ago
- Concurrent inverse Bloom filter.☆15Feb 3, 2015Updated 11 years ago
- SKT A.X LLM 3.1☆13Jul 24, 2025Updated 10 months ago
- Ouroboros: Speculative Decoding with Large Model Enhanced Drafting (EMNLP 2024 main)☆117Mar 20, 2025Updated last year
- Implementation of "PaLM2-VAdapter:" from the multi-modal model paper: "PaLM2-VAdapter: Progressively Aligned Language Model Makes a Stron…☆17Nov 11, 2024Updated last year
- Hierarchical Attention Network based Explainable Knowledge Tracing☆10May 18, 2022Updated 4 years ago
- A collection of research papers related to Natural Language Reasoning☆10May 27, 2022Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆13Dec 18, 2024Updated last year
- THOUGHTSCULPT, a general reasoning and search method for complex tasks☆13Dec 13, 2024Updated last year
- ☆78Feb 28, 2026Updated 3 months ago
- SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis☆120Jun 3, 2025Updated last year
- ☆10Aug 24, 2023Updated 2 years ago
- ☆10Sep 6, 2024Updated last year
- Implementation of Bitune: Bidirectional Instruction-Tuning☆27Jun 19, 2025Updated 11 months ago
- Code associated with the paper: "Few-Shot Self-Rationalization with Natural Language Prompts"☆12Apr 27, 2022Updated 4 years ago
- The implementation for FREE-Merging: Fourier Transform for Model Merging with Lightweight Experts (ICCV25)☆15Jun 26, 2025Updated 11 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Exposure-slot: Exposure-centric representations learning with Slot-in-Slot Attention for Region-aware Exposure Correction, Computer Visi…☆26Sep 2, 2025Updated 9 months ago
- [NeurIPS'24] Grammar-Aligned Decoding: An algorithm to constrain LLMs' outputs without distorting its original distribution☆28Feb 10, 2025Updated last year
- [ICCV2023] Borrowing Knowledge From Pre-trained Language Model: A New Data-efficient Visual Learning Paradigm☆18Sep 28, 2023Updated 2 years ago
- [ICML 2023] "Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?" by Ruisi Cai, Zhenyu Zhang, Zhangyang Wang☆16May 4, 2023Updated 3 years ago
- [ICCV 2023 Oral] Official PyTorch implementation of our paper for semi-supervised continual learning "A soft nearest-neighbor framework f…☆25Dec 17, 2024Updated last year
- Implementation of self-certainty as an extention of ZeroEval Project☆36May 31, 2025Updated last year
- Efficient LLM query routing via multi-sampling. BEST-Route selects both model and number of responses based on query difficulty, cutting …☆58Apr 8, 2026Updated 2 months ago