TinyML and Efficient Deep Learning Computing
☆20Apr 26, 2024Updated 2 years ago
Alternatives and similar repositories for mit6.5940-2023
Users that are interested in mit6.5940-2023 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- All Homeworks for TinyML and Efficient Deep Learning Computing 6.5940 • Fall • 2023 • https://efficientml.ai☆197Dec 2, 2023Updated 2 years ago
- papers of llm compression☆13Mar 6, 2024Updated 2 years ago
- [CVPR2026] BinaryAttention: One-Bit QK-Attention for Vision and Diffusion Transformers☆33Mar 17, 2026Updated 2 months ago
- The official implementation of the paper "Affective Faces for Goal-Driven Dyadic Communication."☆15Jan 27, 2023Updated 3 years ago
- 📚200+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).☆80Apr 26, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Wan: Open and Advanced Large-Scale Video Generative Models☆29Jul 28, 2025Updated 10 months ago
- ☆18Dec 2, 2025Updated 5 months ago
- [ICML2024 Spotlight] Fine-Tuning Pre-trained Large Language Models Sparsely☆24Jun 26, 2024Updated last year
- The official implementation of "Sparse-vDiT: Unleashing the Power of Sparse Attention to Accelerate Video Diffusion Transformers" (arXiv …☆51Jun 6, 2025Updated 11 months ago
- Lab 5 project of MIT-6.5940, deploying LLaMA2-7B-chat on one's laptop with TinyChatEngine.☆18Dec 1, 2023Updated 2 years ago
- Official implementation of paper "VMoBA: Mixture-of-Block Attention for Video Diffusion Models"☆65Jul 1, 2025Updated 10 months ago
- A comprehensive repository for Compute Express Link (CXL) resources: covering research papers, specifications, simulation/emulation tools…☆25Feb 24, 2026Updated 3 months ago
- UNIST blackboard web extension program☆12Apr 20, 2023Updated 3 years ago
- ☆19Dec 3, 2019Updated 6 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- The official implementation of the AAAI 2024 paper Bi-ViT.☆13Dec 18, 2023Updated 2 years ago
- flex-block-attn: an efficient block sparse attention computation library☆131Dec 26, 2025Updated 5 months ago
- Streaming Video Diffusion: Online Video Editing with Diffusion Models☆18Jun 3, 2024Updated last year
- MNIST accelerator using binary qunatization on Xilinx pynq-z2☆14Sep 4, 2024Updated last year
- su su su supernova☆25Jan 9, 2025Updated last year
- Preprint: Asymmetry in Low-Rank Adapters of Foundation Models☆39Feb 27, 2024Updated 2 years ago
- ☆49Apr 15, 2024Updated 2 years ago
- ☆15Apr 28, 2023Updated 3 years ago
- An open-source simulator framework for neural processing units☆39Mar 23, 2026Updated 2 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Tool to create and import diff of docker images (to transfer only new layers)☆15Oct 25, 2019Updated 6 years ago
- Sample files for fuzzing ImageMagick☆19May 10, 2017Updated 9 years ago
- LLM Inference via Triton (Flexible & Modular): Focused on Kernel Optimization using CUBIN binaries, Starting from gpt-oss Model☆117Apr 28, 2026Updated last month
- The code of Zero To Production In Rust for exercise☆22Jun 18, 2023Updated 2 years ago
- NeuPIMs: NPU-PIM Heterogeneous Acceleration for Batched LLM Inferencing☆119Jun 19, 2024Updated last year
- A command line utility effectively replicating `docker save` except that it will only save the LAST layer of the image in the output arch…☆25Apr 5, 2019Updated 7 years ago
- Nsight Systems In Docker☆21Dec 21, 2023Updated 2 years ago
- OSS-Fuzz Public Corpora Crawler☆30Feb 23, 2023Updated 3 years ago
- A curated list of sanitizers to detect bugs☆29Feb 2, 2026Updated 3 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 根据这个https://github.com/matterport/Mask_RCNN修改的FPN☆28Mar 2, 2018Updated 8 years ago
- This repo contains the Assignments from Cornell Tech's ECE 5545 - Machine Learning Hardware and Systems offered in Spring 2023☆45May 31, 2023Updated 2 years ago
- ☆52Aug 6, 2024Updated last year
- ☆179Aug 9, 2023Updated 2 years ago
- Download Docker images manually☆29Dec 15, 2023Updated 2 years ago
- ☆43Jan 18, 2025Updated last year
- A Docker Compose to run a local ChatGPT-like application using Ollama, Ollama Web UI, Mistral NeMo & DeepSeek R1.☆28Aug 25, 2025Updated 9 months ago