yifanlu0227 / MIT-6.5940View external linksLinks
All Homeworks for TinyML and Efficient Deep Learning Computing 6.5940 • Fall • 2023 • https://efficientml.ai
☆191Dec 2, 2023Updated 2 years ago
Alternatives and similar repositories for MIT-6.5940
Users that are interested in MIT-6.5940 are comparing it to the libraries listed below
Sorting:
- Lab 5 project of MIT-6.5940, deploying LLaMA2-7B-chat on one's laptop with TinyChatEngine.☆18Dec 1, 2023Updated 2 years ago
- TinyML and Efficient Deep Learning Computing | MIT 6.S965/6.5940☆20Jan 16, 2026Updated last month
- 模型加速/模型压缩(已完成所有Lab)☆11Dec 24, 2023Updated 2 years ago
- TinyML and Efficient Deep Learning Computing☆19Apr 26, 2024Updated last year
- TinyChatEngine: On-Device LLM Inference Library☆940Jul 4, 2024Updated last year
- Learning material for CMU10-714: Deep Learning System☆301May 12, 2024Updated last year
- This repo contains the Assignments from Cornell Tech's ECE 5545 - Machine Learning Hardware and Systems offered in Spring 2023☆42May 31, 2023Updated 2 years ago
- papers of llm compression☆13Mar 6, 2024Updated last year
- UCB EECS126 : probability theory and random processes.☆18Sep 26, 2024Updated last year
- TensorRT-in-Action 是一个 GitHub 代码库,提供了使用 TensorRT 的代码示例,并有对应 Jupyter Notebook。☆15Jun 1, 2023Updated 2 years ago
- Material for gpu-mode lectures☆5,726Feb 1, 2026Updated 2 weeks ago
- Advanced Programming - HW4☆16Apr 23, 2022Updated 3 years ago
- List of papers related to neural network quantization in recent AI conferences and journals.☆800Mar 27, 2025Updated 10 months ago
- Student files for CS245 Programming Assignment 1: In-memory data layout☆13Nov 16, 2022Updated 3 years ago
- code for ml2023-spring-hung-yi-lee(李宏毅)☆13Apr 10, 2023Updated 2 years ago
- This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several…☆1,239Jul 29, 2023Updated 2 years ago
- A curated list for Efficient Large Language Models☆1,950Jun 17, 2025Updated 7 months ago
- ☆31Apr 2, 2025Updated 10 months ago
- ☆291Aug 20, 2024Updated last year
- ☆41Nov 1, 2025Updated 3 months ago
- A block pruning framework for LLMs.☆27May 17, 2025Updated 8 months ago
- learning how CUDA works☆375Mar 3, 2025Updated 11 months ago
- flash attention tutorial written in python, triton, cuda, cutlass☆486Jan 20, 2026Updated 3 weeks ago
- [ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models☆1,607Jul 12, 2024Updated last year
- Official implementation of the ICLR 2024 paper AffineQuant☆28Mar 30, 2024Updated last year
- tutorial for writing custom pytorch cpp+cuda kernel, applied on volume rendering (NeRF)☆29Dec 12, 2023Updated 2 years ago
- [ACL 2024] A novel QAT with Self-Distillation framework to enhance ultra low-bit LLMs.☆134May 16, 2024Updated last year
- 📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉☆9,666Updated this week
- [ICCV 2025] InfiniCube: Unbounded and Controllable Dynamic 3D Driving Scene Generation with World-Guided Video Models☆113Jan 23, 2026Updated 3 weeks ago
- Puzzles for learning Triton, play it with minimal environment configuration!☆624Dec 28, 2025Updated last month
- Research in compressing convolutional layers of CNN using low-rank Tucker tensor decomposition☆11Nov 1, 2023Updated 2 years ago
- Code for "DittoGym: Learning to Control Soft Shape-Shifting Robots" by Suning Huang, Boyuan Chen, Huazhe Xu, and Vincent Sitzmann.☆30May 1, 2025Updated 9 months ago
- Complete self-learning materials of CS106L☆89Mar 31, 2024Updated last year
- Awesome LLM compression research papers and tools.☆1,776Nov 10, 2025Updated 3 months ago
- ☆152Jul 4, 2025Updated 7 months ago
- Awesome machine learning model compression research papers, quantization, tools, and learning material.☆540Sep 21, 2024Updated last year
- Embedded control system (ECS) software controls the overall behavior of ScanBot3D, an autonomous 3D reconstruction robot☆11Nov 1, 2018Updated 7 years ago
- ☆21Updated this week
- ☆10Feb 7, 2022Updated 4 years ago