Examples for MS-AMP package.
☆30Jul 17, 2025Updated 7 months ago
Alternatives and similar repositories for MS-AMP-Examples
Users that are interested in MS-AMP-Examples are comparing it to the libraries listed below
Sorting:
- Microsoft Automatic Mixed Precision Library☆636Dec 1, 2025Updated 2 months ago
- Benchmark tests supporting the TiledCUDA library.☆18Nov 19, 2024Updated last year
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆19Updated this week
- ☆38Aug 7, 2025Updated 6 months ago
- Source code for the paper "LongGenBench: Long-context Generation Benchmark"☆24Oct 8, 2024Updated last year
- End to End steps for adding custom ops in PyTorch.☆24Aug 20, 2020Updated 5 years ago
- A hackable, simple, and reseach-friendly GRPO Training Framework with high speed weight synchronization in a multinode environment.☆37Aug 27, 2025Updated 6 months ago
- Compression for Foundation Models☆35Jul 21, 2025Updated 7 months ago
- ☆42Nov 1, 2025Updated 4 months ago
- ☆25Jun 24, 2021Updated 4 years ago
- DeeperGEMM: crazy optimized version☆74May 5, 2025Updated 9 months ago
- Hackable and optimized Transformers building blocks, supporting a composable construction.☆34Updated this week
- 板球控制系統/滾球系統/BallPlate 2017年全国大学生电子设计竞赛B题 全国二等奖作品☆10May 27, 2024Updated last year
- [WIP] Better (FP8) attention for Hopper☆32Feb 24, 2025Updated last year
- Transformers components but in Triton☆34May 9, 2025Updated 9 months ago
- ☆87Updated this week
- A Simple Adaptive Unfolding Network for Hyperspectral Image Reconstruction☆32Feb 1, 2023Updated 3 years ago
- Kinematic and dynamic models of continuum and articulated soft robots.☆15Nov 22, 2025Updated 3 months ago
- This repository contains the experimental PyTorch native float8 training UX☆226Aug 1, 2024Updated last year
- ☆40Feb 28, 2020Updated 6 years ago
- An artificial matrix generator in C☆12Feb 16, 2023Updated 3 years ago
- MATLAB function to fill an area with hatching ~~or speckling~~☆11Mar 4, 2018Updated 7 years ago
- ☆14Apr 14, 2025Updated 10 months ago
- A CSS3 Overlay system for modal dialogs.☆66Dec 16, 2010Updated 15 years ago
- ☆12Apr 14, 2025Updated 10 months ago
- [CVPR 2026] Official repo for "VideoSSR: Video Self-Supervised Reinforcement Learning"☆32Nov 11, 2025Updated 3 months ago
- BERT Sentiment Classification on the IMDb Large Movie Review Dataset.☆16Sep 8, 2022Updated 3 years ago
- Code for the paper "Faster Neural Network Training with Approximate Tensor Operations"☆10Oct 23, 2021Updated 4 years ago
- QuTLASS: CUTLASS-Powered Quantized BLAS for Deep Learning☆168Nov 11, 2025Updated 3 months ago
- GPTQ inference TVM kernel☆40Apr 25, 2024Updated last year
- Causal Reasoning for Membership Inference Attacks☆11Oct 21, 2022Updated 3 years ago
- A python implementation of the neural network joint language model and an extension of it using global source context.☆11May 17, 2017Updated 8 years ago
- PolyLib official git.☆11Jan 27, 2026Updated last month
- Advanced Formal Language Theory (263-5352-00L; Frühjahr 2023)☆10Feb 21, 2023Updated 3 years ago
- This simulator models multi core systems, intended primarily for studies on main memory management techniques. It models a trace-based ou…☆12Jan 18, 2016Updated 10 years ago
- Persistent dense gemm for Hopper in `CuTeDSL`☆15Aug 9, 2025Updated 6 months ago
- ☆11Apr 3, 2023Updated 2 years ago
- This repository is outdated and the related functionality has been migrated to https://github.com/easysoc/easysoc-firrtl☆11Nov 3, 2021Updated 4 years ago
- Musings in GEMM (General Matrix Multiplication)☆14Dec 14, 2025Updated 2 months ago