Course materials for MIT6.5940: TinyML and Efficient Deep Learning Computing
☆79Jan 8, 2025Updated last year
Alternatives and similar repositories for MIT6.5940_TinyML
Users that are interested in MIT6.5940_TinyML are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆178Aug 9, 2023Updated 2 years ago
- Cute layout visualization☆40Jan 18, 2026Updated 5 months ago
- 使用 cutlass 实现 flash-attention 精简版,具有教学意义☆59Aug 12, 2024Updated last year
- Boosted E-Graph Extraction with Adaptive Heuristics and Exact Solving☆30Jan 7, 2026Updated 5 months ago
- This repo contains the code developed for my master thesis☆13Mar 24, 2022Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- The official implementation for the intra-stage fusion technique introduced in https://arxiv.org/abs/2409.13221☆31Apr 22, 2025Updated last year
- A template-based, layer-oriented High Level Synthesis Tool for AI algorithms☆15Apr 28, 2026Updated last month
- 每日自动推送ArXiv最新论文概要。灵感来自于https://github.com/curryfromuestc/arxiv_paper_tracker☆18Nov 1, 2025Updated 7 months ago
- ☆16Mar 26, 2025Updated last year
- [MLSys 2026] AccelOpt: Self-improving Agents for AI Accelerator Kernel Optimization☆55Jun 7, 2026Updated last week
- MICRO 2024 Evaluation Artifact for FuseMax☆17Aug 26, 2024Updated last year
- Proposal for the next generation of course-oriented IR.☆10Dec 24, 2021Updated 4 years ago
- ☆12Oct 30, 2024Updated last year
- MPI Code Generation through Domain-Specific Language Models☆16Nov 19, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A disk-based HashMap implementation allowing persistence of data across sessions.☆15May 7, 2014Updated 12 years ago
- 模型加速/模型压缩(已完成所有Lab)☆11Dec 24, 2023Updated 2 years ago
- Vector search with bounded performance.☆35Jan 26, 2024Updated 2 years ago
- The repository has collected a batch of noteworthy MLSys bloggers (Algorithms/Systems)☆341Jan 5, 2025Updated last year
- 基于 CUDA Driver API 的 cuda 运行时环境☆16Jul 30, 2025Updated 10 months ago
- Summary of the Specs of Commonly Used GPUs for Training and Inference of LLM☆78Aug 12, 2025Updated 10 months ago
- ☆16Nov 28, 2024Updated last year
- Asynchronous pipeline parallel optimization☆22Feb 2, 2026Updated 4 months ago
- 实验:rust 实现 llama2 推理☆17Feb 23, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 算子库(Rust)☆15Jul 24, 2025Updated 10 months ago
- ☆54Apr 30, 2025Updated last year
- Memory Tagging ISA extension that can be used by software to enforce memory tag checks on memory loads and stores☆35May 20, 2026Updated 3 weeks ago
- ZJU毛概资料汇总☆11Mar 16, 2024Updated 2 years ago
- Cryptocurrency Design and Engineering class Fall 2025☆60Feb 19, 2026Updated 3 months ago
- GRE 再要你命3K 背单词小程序☆24Jul 8, 2018Updated 7 years ago
- 一个开源数学大模型项目,旨在探索大模型是否具有数学创造能力,以及大模型在前沿数学研究中的潜在能力。☆20Mar 19, 2026Updated 3 months ago
- ☆41May 17, 2026Updated last month
- 《自己动手写AI编译器》☆40Oct 19, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Try HopWeaver: The first automatic synthesis framework based on any corpora, with quality approaching manual annotation.☆27Apr 7, 2026Updated 2 months ago
- 研究生课《网络大数据管理理论和应用》大作业项目代码☆13Dec 31, 2022Updated 3 years ago
- 2026 科学上网/魔法上网教程:Clash 机场 + VPN + VPS 图文指南,持续更新,从 0 到 1 带你突破网络封锁☆152Jun 8, 2026Updated last week
- AAAI2025☆13Apr 18, 2025Updated last year
- TileFusion is an experimental C++ macro kernel template library that elevates the abstraction level in CUDA C for tile processing.☆111Jun 28, 2025Updated 11 months ago
- This repo is to demo the concept of lossless compression with Transformers as encoder and decoder.☆14May 2, 2024Updated 2 years ago
- Empowering LLM Agents for Real-World Computer System Optimization☆18Sep 10, 2025Updated 9 months ago