zwxandy / Awesome-Efficient-CoT-Reasoning-SummaryView external linksLinks
🔥 How to efficiently and effectively compress the CoTs or directly generate concise CoTs during inference while maintaining the reasoning performance is an important topic!
☆64May 22, 2025Updated 8 months ago
Alternatives and similar repositories for Awesome-Efficient-CoT-Reasoning-Summary
Users that are interested in Awesome-Efficient-CoT-Reasoning-Summary are comparing it to the libraries listed below
Sorting:
- MoE-Visualizer is a tool designed to visualize the selection of experts in Mixture-of-Experts (MoE) models.☆16Apr 8, 2025Updated 10 months ago
- [NAACL'25 🏆 SAC Award] Official code for "Advancing MoE Efficiency: A Collaboration-Constrained Routing (C2R) Strategy for Better Expert…☆14Feb 4, 2025Updated last year
- [EMNLP 2024] Quantize LLM to extremely low-bit, and finetune the quantized LLMs☆15Jul 18, 2024Updated last year
- [NeurIPS 2023] ShiftAddViT: Mixture of Multiplication Primitives Towards Efficient Vision Transformer☆30Dec 6, 2023Updated 2 years ago
- Awesome-Parallel-Reasoning: Unlocking the reasoning potential of LLMs. Papers, Code, Resources & Survey.☆48Jan 6, 2026Updated last month
- Analyzing LLM Alignment via Token distribution shift☆17Jan 26, 2024Updated 2 years ago
- Paper list for Efficient Reasoning.☆822Feb 11, 2026Updated last week
- [ICLR 2025] Official implementation of paper "Dynamic Low-Rank Sparse Adaptation for Large Language Models".☆23Mar 16, 2025Updated 11 months ago
- [ACL 25] SafeChain: Safety of Language Models with Long Chain-of-Thought Reasoning Capabilities☆27Apr 2, 2025Updated 10 months ago
- Efficient Expert Pruning for Sparse Mixture-of-Experts Language Models: Enhancing Performance and Reducing Inference Costs☆23Nov 11, 2025Updated 3 months ago
- a website for accessing many models through api(deepseek、Qwen、Hunyuan etc.)☆17Jul 12, 2025Updated 7 months ago
- ☆31Nov 11, 2024Updated last year
- Advanced Machine Learning Course☆12Nov 16, 2024Updated last year
- ☆63Jul 14, 2025Updated 7 months ago
- [COLM 2025] SEAL: Steerable Reasoning Calibration of Large Language Models for Free☆52Apr 6, 2025Updated 10 months ago
- ☆92Mar 28, 2025Updated 10 months ago
- Official Pytorch Implementation of "Outlier Weighed Layerwise Sparsity (OWL): A Missing Secret Sauce for Pruning LLMs to High Sparsity"☆82Jul 7, 2025Updated 7 months ago
- This repo summarizes papers for efficient PPML across protocol, model, and system levels.☆67Jan 18, 2026Updated last month
- ☆39Aug 27, 2024Updated last year
- Official implementation of the ICLR paper "Streamlining Redundant Layers to Compress Large Language Models"☆40May 1, 2025Updated 9 months ago
- A Mechanistic‑Interpretability study that finds the structural dynamics of Large Language Models under fine‑tuning.☆16May 30, 2025Updated 8 months ago
- Papers about training data quality management for ML models.☆109Oct 15, 2025Updated 4 months ago
- [NeurIPS'24] Official implement of "PrivCirNet: Efficient Private Inference via Block Circulant Transformation"☆15May 28, 2025Updated 8 months ago
- The official code for "Advancing Multimodal Large Language Models with Quantization-Aware Scale Learning for Efficient Adaptation" | [MM2…☆14Dec 7, 2024Updated last year
- Chameleon: A MatMul-Free TCN Accelerator for End-to-End Few-Shot and Continual Learning from Sequential Data☆25Jun 6, 2025Updated 8 months ago
- ☆12Apr 22, 2024Updated last year
- ☆16Feb 23, 2025Updated 11 months ago
- 🔥 [NeurIPS 2024] A Cat Is A Cat (Not A Dog!): Unraveling Information Mix-ups in Text-to-Image Encoders through Causal Analysis and Embed…☆13Jun 21, 2025Updated 7 months ago
- Code for paper "Concrete Subspace Learning based Interference Elimination for Multi-task Model Fusion"☆14Mar 28, 2024Updated last year
- KAF : Kolmogorov-Arnold Fourier Networks☆20Feb 19, 2025Updated 11 months ago
- Official code for "Algorithmic Capabilities of Random Transformers" (NeurIPS 2024)☆16Sep 28, 2024Updated last year
- ☆18Oct 14, 2025Updated 4 months ago
- ☆10May 26, 2022Updated 3 years ago
- The implementation of Graph Image Prior (GIP) for Unsupervised Dynamic MRI Reconstruction☆12Apr 27, 2024Updated last year
- This is a sample implementation of "Robust Graph Convolutional Networks Against Adversarial Attacks", KDD 2019.☆10Dec 8, 2020Updated 5 years ago
- Integration test of Verilog AXI modules (https://github.com/alexforencich/verilog-axi) with LiteX.☆17Dec 19, 2022Updated 3 years ago
- ☆10Feb 12, 2024Updated 2 years ago
- 解压缩<时光印记>软件中的数据☆17Sep 24, 2021Updated 4 years ago
- ☆13Oct 3, 2024Updated last year