kvignesh1420 / cot-icl-labLinks
[ACL 2025] CoT-ICL Lab: A Synthetic Framework for Studying Chain-of-Thought Learning from In-Context Demonstrations
β11Updated last month
Alternatives and similar repositories for cot-icl-lab
Users that are interested in cot-icl-lab are comparing it to the libraries listed below
Sorting:
- Building blocks for foundation models.β516Updated last year
- π° Must-read papers on KV Cache Compression (constantly updating π€).β484Updated 3 weeks ago
- β512Updated last year
- Minimalistic 4D-parallelism distributed training framework for education purposeβ1,598Updated last week
- π° Must-read papers and blogs on Speculative Decoding β‘οΈβ828Updated 3 weeks ago
- A bibliography and survey of the papers surrounding o1β1,206Updated 8 months ago
- KernelBench: Can LLMs Write GPU Kernels? - Benchmark with Torch -> CUDA problemsβ475Updated this week
- LLM KV cache compression made easyβ538Updated this week
- Systems for GenAIβ142Updated 2 months ago
- Scalable toolkit for efficient model reinforcementβ521Updated this week
- Perplexity GPU Kernelsβ405Updated this week
- Puzzles for learning Tritonβ1,760Updated 8 months ago
- Awesome-LLM-KV-Cache: A curated list of πAwesome LLM KV Cache Papers with Codes.β330Updated 4 months ago
- Zero Bubble Pipeline Parallelismβ406Updated 2 months ago
- β601Updated 2 months ago
- Analyze the inference of Large Language Models (LLMs). Analyze aspects like computation, storage, transmission, and hardware roofline modβ¦β509Updated 10 months ago
- Spec-Bench: A Comprehensive Benchmark and Unified Evaluation Platform for Speculative Decoding (ACL 2024 Findings)β285Updated 2 months ago
- Minimalistic large language model 3D-parallelism trainingβ2,034Updated last week
- Curated collection of papers in machine learning systemsβ384Updated last month
- β11Updated 8 months ago
- Ring attention implementation with flash attentionβ802Updated 2 weeks ago
- Large Language Model (LLM) Systems Paper Listβ1,362Updated last week
- Distributed Compiler based on Triton for Parallel Systemsβ891Updated this week
- A PyTorch Native LLM Training Frameworkβ831Updated last week
- π³ Efficient Triton implementations for "Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention"β725Updated 4 months ago
- My learning notes/codes for ML SYS.β2,928Updated this week
- Disaggregated serving system for Large Language Models (LLMs).β642Updated 3 months ago
- ByteCheckpoint: An Unified Checkpointing Library for LFMsβ226Updated last week
- [TMLR 2024] Efficient Large Language Models: A Surveyβ1,186Updated 3 weeks ago
- Official Implementation of EAGLE-1 (ICML'24), EAGLE-2 (EMNLP'24), and EAGLE-3.β1,402Updated last week