MurongYue/LLM_MoT_cascade

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/MurongYue/LLM_MoT_cascade)

MurongYue / LLM_MoT_cascade

This is the implementation for the paper "LARGE LANGUAGE MODEL CASCADES WITH MIX- TURE OF THOUGHT REPRESENTATIONS FOR COST- EFFICIENT REASONING".

☆32

Alternatives and similar repositories for LLM_MoT_cascade

Users that are interested in LLM_MoT_cascade are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sparkle-reasoning / sparkle
View on GitHub
[NeurIPS'25] Beyond Accuracy: Dissecting Mathematical Reasoning for LLMs Under Reinforcement Learning
☆16Dec 12, 2025Updated 7 months ago
nowazrabbani / pMoE_CNN
View on GitHub
The official repository for the experiments included in the paper titled "Patch-level Routing in Mixture-of-Experts is Provably Sample-ef…
☆14Feb 12, 2026Updated 5 months ago
LukasHedegaard / structured-pruning-adapters
View on GitHub
Structured Pruning Adapters in PyTorch
☆19Aug 30, 2023Updated 2 years ago
Tebmer / Rereading-LLM-Reasoning
View on GitHub
EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…
☆30Dec 10, 2024Updated last year
UCSC-VLAA / Sight-Beyond-Text
View on GitHub
[TMLR 2024] Official implementation of "Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics"
☆20Sep 15, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
CharlieLeee / BIT-Report-LaTeX
View on GitHub
English and Chinese LaTeX template for reports/projects/proposal at Beijing Institute of Technology
☆10Nov 19, 2020Updated 5 years ago
princeton-pli / STAT
View on GitHub
Skill-Targeted Adaptive Training
☆24Mar 12, 2026Updated 4 months ago
microsoft / iclr2019-learning-to-represent-edits
View on GitHub
Code for the ICLR 2019 paper "Learning to Represent Edits"
☆13Dec 8, 2022Updated 3 years ago
TIGER-AI-Lab / Program-of-Thoughts
View on GitHub
Data and Code for Program of Thoughts [TMLR 2023]
☆317May 15, 2024Updated 2 years ago
brian-lou / Training-Data-Extraction-Attack-on-LLMs
View on GitHub
This project explores training data extraction attacks on the LLaMa 7B, GPT-2XL, and GPT-2-IMDB models to discover memorized content usin…
☆15Jun 15, 2023Updated 3 years ago
tic-top / LoraCSE
View on GitHub
😜Constrative Learning of Sentence Embedding using LoRA (EECS487 final project)
☆13Apr 19, 2023Updated 3 years ago
joey-wang123 / DRO-Task-free
View on GitHub
Code for Improving Task-free Continual Learning by Distributionally Robust Memory Evolution (ICML 2022)
☆11Aug 20, 2022Updated 3 years ago
leondelee / PointGCN
View on GitHub
☆11Mar 24, 2023Updated 3 years ago
guardrails-ai / detect_pii
View on GitHub
Guardrails AI: PII Filter - Validates that any text does not contain any PII
☆17Updated this week
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
tylertreat / InverseBloomFilter
View on GitHub
Concurrent inverse Bloom filter.
☆15Feb 3, 2015Updated 11 years ago
DeqingYang / ACAM-model
View on GitHub
It is deep recommendation model with attribute-level co-attention, which has been accepted as a short paper in SIGIR2020.
☆10Aug 13, 2020Updated 5 years ago
thunlp / Ouroboros
View on GitHub
Ouroboros: Speculative Decoding with Large Model Enhanced Drafting (EMNLP 2024 main)
☆117Mar 20, 2025Updated last year
microsoft / compositional-generalization-span-level-attention
View on GitHub
code for the NAACL 2021 paper Compositional Generalization for Neural Semantic Parsing via Span-level Supervised Attention by Microsoft S…
☆12Apr 21, 2023Updated 3 years ago
ppetraki / meson-android-helloworld
View on GitHub
meson android build PoC
☆11Oct 29, 2019Updated 6 years ago
kyegomez / PaLM2-VAdapter
View on GitHub
Implementation of "PaLM2-VAdapter:" from the multi-modal model paper: "PaLM2-VAdapter: Progressively Aligned Language Model Makes a Stron…
☆17Nov 11, 2024Updated last year
sroy9 / mawps
View on GitHub
Code for MAWPS: A Math Word Problem Repository
☆41Mar 23, 2023Updated 3 years ago
jacky121298 / WLST
View on GitHub
[ICRA 2024] WLST: Weak Labels Guided Self-training for Weakly-supervised Domain Adaptation on 3D Object Detection
☆12Feb 6, 2024Updated 2 years ago
SKT-AI / A.X-3
View on GitHub
SKT A.X LLM 3.1
☆13Jul 24, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
WeiminXiong / RationaleCL
View on GitHub
Rationale-enhanced language models are better continual relation learners (EMNLP 2023 Main Conference)
☆12Oct 11, 2023Updated 2 years ago
dreamingfish2011 / ml_dl
View on GitHub
机器学习和深度学习练习
☆13Apr 24, 2019Updated 7 years ago
cyzus / thoughtsculpt
View on GitHub
THOUGHTSCULPT, a general reasoning and search method for complex tasks
☆13Dec 13, 2024Updated last year
ibm-self-serve-assets / MetaGen-Blended-RAG
View on GitHub
☆17Aug 5, 2025Updated 11 months ago
boostcampaitech3 / final-project-level3-cv-17
View on GitHub
[2022.05.16 ~ 2022.06.10] 🌤️미세먼지 없는 맑은 사진📷 - 부스트캠프 AI Tech 3기 최종 프로젝트
☆14Jun 11, 2022Updated 4 years ago
karlchahine / Neural-Cover-Selection-for-Image-Steganography
View on GitHub
☆13Dec 18, 2024Updated last year
skaiworldwide-oss / postgres-xl-ha
View on GitHub
☆11Jun 26, 2017Updated 9 years ago
dkopi / Bitune
View on GitHub
Implementation of Bitune: Bidirectional Instruction-Tuning
☆27Jun 19, 2025Updated last year
ShiyuNee / Awesome-Calibration-Papers
View on GitHub
A curated list of awesome papers about calibration
☆15May 6, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
polo5 / FDS
View on GitHub
Gradient-based Hyperparameter Optimization Over Long Horizons
☆14Sep 29, 2021Updated 4 years ago
z3dm4n / k8s-dev-cluster-hcloud
View on GitHub
Minimal, highly available (HA) Kubernetes cluster on Hetzner Cloud — up and running in under 10 minutes.
☆11Apr 23, 2026Updated 3 months ago
adityagilra / archibrain
View on GitHub
Synthesize bio-plausible neural networks for cognitive tasks, mimicking brain architecture
☆11Apr 14, 2021Updated 5 years ago
LINC-BIT / FedKNOW
View on GitHub
☆16Sep 30, 2024Updated last year
allenai / feb
View on GitHub
Code associated with the paper: "Few-Shot Self-Rationalization with Natural Language Prompts"
☆12Apr 27, 2022Updated 4 years ago
BIT-DA / BorLan
View on GitHub
[ICCV2023] Borrowing Knowledge From Pre-trained Language Model: A New Data-efficient Visual Learning Paradigm
☆18Sep 28, 2023Updated 2 years ago
VITA-Group / Robust_Weight_Signatures
View on GitHub
[ICML 2023] "Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?" by Ruisi Cai, Zhenyu Zhang, Zhangyang Wang
☆16May 4, 2023Updated 3 years ago