kvignesh1420 / cot-icl-labLinks
[ACL 2025] CoT-ICL Lab: A Synthetic Framework for Studying Chain-of-Thought Learning from In-Context Demonstrations
☆10Updated 2 weeks ago
Alternatives and similar repositories for cot-icl-lab
Users that are interested in cot-icl-lab are comparing it to the libraries listed below
Sorting:
- Building blocks for foundation models.☆502Updated last year
- KernelBench: Can LLMs Write GPU Kernels? - Benchmark with Torch -> CUDA problems☆374Updated this week
- Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels☆1,250Updated this week
- 📰 Must-read papers and blogs on Speculative Decoding ⚡️☆763Updated this week
- Minimalistic 4D-parallelism distributed training framework for education purpose☆1,518Updated this week
- Scalable toolkit for efficient model reinforcement☆385Updated this week
- LLM KV cache compression made easy☆497Updated this week
- 📰 Must-read papers on KV Cache Compression (constantly updating 🤗).☆431Updated last week
- ☆293Updated 5 months ago
- Puzzles for learning Triton☆1,671Updated 6 months ago
- Scalable toolkit for efficient model alignment☆807Updated this week
- ☆10Updated 7 months ago
- Minimalistic large language model 3D-parallelism training☆1,898Updated last week
- Distributed Triton for Parallel Systems☆775Updated last week
- A PyTorch Native LLM Training Framework☆816Updated 5 months ago
- Zero Bubble Pipeline Parallelism☆396Updated last month
- NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the …☆173Updated this week
- FlashInfer: Kernel Library for LLM Serving☆3,088Updated this week
- A curated collection of resources, tutorials, and best practices for learning and mastering NVIDIA CUTLASS☆183Updated last month
- Official implementation for Yuan & Liu & Zhong et al., KV Cache Compression, But What Must We Give in Return? A Comprehensive Benchmark o…☆78Updated 3 months ago
- Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM☆1,441Updated this week
- ☆487Updated 10 months ago
- An ML Systems Onboarding list☆794Updated 4 months ago
- Awesome-LLM-KV-Cache: A curated list of 📙Awesome LLM KV Cache Papers with Codes.☆307Updated 3 months ago
- A bibliography and survey of the papers surrounding o1☆1,194Updated 6 months ago
- ☆591Updated 3 weeks ago
- Ring attention implementation with flash attention☆774Updated 2 weeks ago
- [KDD 2023] code for "Test accuracy vs. generalization gap: model selection in NLP without accessing training or testing data" https://arx…☆12Updated 2 years ago
- 🚀 Efficiently (pre)training foundation models with native PyTorch features, including FSDP for training and SDPA implementation of Flash…☆249Updated this week
- Large Language Model (LLM) Systems Paper List☆1,254Updated this week