Relaxed-System-Lab / COMP6211J_Course_HKUSTLinks

☆42

Alternatives and similar repositories for COMP6211J_Course_HKUST

Users that are interested in COMP6211J_Course_HKUST are comparing it to the libraries listed below

Sorting:

hemingkx / Awesome-Efficient-Reasoning
Paper list for Efficient Reasoning.
☆548Updated 3 weeks ago
hemingkx / SpeculativeDecodingPapers
📰 Must-read papers and blogs on Speculative Decoding ⚡️
☆828Updated 3 weeks ago
withinmiaov / A-Survey-on-Mixture-of-Experts-in-LLMs
[TKDE'25] The official GitHub page for the survey paper "A Survey on Mixture of Experts in Large Language Models".
☆390Updated 3 weeks ago
October2001 / Awesome-KV-Cache-Compression
📰 Must-read papers on KV Cache Compression (constantly updating 🤗).
☆484Updated 3 weeks ago
Eclipsess / Awesome-Efficient-Reasoning-LLMs
Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models
☆519Updated 2 weeks ago
Zefan-Cai / Awesome-LLM-KV-Cache
Awesome-LLM-KV-Cache: A curated list of 📙Awesome LLM KV Cache Papers with Codes.
☆330Updated 4 months ago
UbiquitousLearning / Efficient_Foundation_Model_Survey
Survey Paper List - Efficient LLM and Foundation Models
☆252Updated 9 months ago
feifeibear / LLMSpeculativeSampling
Fast inference from large lauguage models via speculative decoding
☆779Updated 10 months ago
zepingyu0512 / awesome-llm-understanding-mechanism
awesome papers in LLM interpretability
☆522Updated 3 weeks ago
zwxandy / Awesome-Efficient-CoT-Reasoning-Summary
🔥 How to efficiently and effectively compress the CoTs or directly generate concise CoTs during inference while maintaining the reasonin…
☆55Updated last month
Hongcheng-Gao / Awesome-Long2short-on-LRMs
Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains…
☆235Updated last month
HArmonizedSS / HASS
Official Implementation of "Learning Harmonized Representations for Speculative Sampling" (HASS)
☆43Updated 4 months ago
TsinghuaC3I / Awesome-RL-Reasoning-Recipes
Awesome RL Reasoning Recipes ("Triple R")
☆745Updated last month
hemingkx / Spec-Bench
Spec-Bench: A Comprehensive Benchmark and Unified Evaluation Platform for Speculative Decoding (ACL 2024 Findings)
☆285Updated 2 months ago
XueFuzhao / awesome-mixture-of-experts
A collection of AWESOME things about mixture-of-experts
☆1,161Updated 7 months ago
codecaution / Awesome-Mixture-of-Experts-Papers
A curated reading list of research in Mixture-of-Experts(MoE).
☆638Updated 8 months ago
MoE-Inf / awesome-moe-inference
Curated collection of papers in MoE model inference
☆210Updated 4 months ago
openpsi-project / ReaLHF
Super-Efficient RLHF Training of LLMs with Parameter Reallocation
☆306Updated 2 months ago
THUDM / ReST-MCTS
ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)
☆652Updated 5 months ago
FMInference / H2O
[NeurIPS'23] H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.
☆459Updated 11 months ago
ZO-Bench / ZO-LLM
[ICML 2024] Official code for the paper "Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark ".
☆105Updated last week
THUDM / slime
slime is a LLM post-training framework aiming for RL Scaling.
☆596Updated this week
TreeAI-Lab / Awesome-KV-Cache-Management
This repository serves as a comprehensive survey of LLM development, featuring numerous research papers along with their corresponding co…
☆164Updated 3 weeks ago
dilab-zju / self-speculative-decoding
Code associated with the paper **Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative Decoding**
☆194Updated 5 months ago
AmadeusChan / Awesome-LLM-System-Papers
☆601Updated 2 months ago
zzli2022 / Awesome-System2-Reasoning-LLM
Latest Advances on System-2 Reasoning
☆1,180Updated last month
EIT-NLP / Awesome-Latent-CoT
This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.
☆134Updated this week
mit-han-lab / Quest
[ICML 2024] Quest: Query-Aware Sparsity for Efficient Long-Context LLM Inference
☆305Updated last week
gta0804 / MASS
Official implementation of MASS: Multi-Agent Simulation Scaling for Portfolio Construction
☆139Updated last month
Guangxuan-Xiao / GSM8K-eval
☆44Updated last year