udaymallappa / ECE277-GPU-WI21Links
UCSD ECE277 GPU Programming coursework: GPU-accelerated reinforcement learning on CUDA C with Nsight System
☆11Updated 4 years ago
Alternatives and similar repositories for ECE277-GPU-WI21
Users that are interested in ECE277-GPU-WI21 are comparing it to the libraries listed below
Sorting:
- ☆17Updated 3 months ago
- MLIR+EqSat☆21Updated 3 months ago
- SmoothE: Differentiable E-Graph Extraction (ASPLOS'25 Best Paper)☆24Updated this week
- ☆22Updated last year
- ☆126Updated last month
- Benchmark Framework for Buddy Projects☆55Updated last month
- Driving Snax with MLIR☆16Updated this week
- tutorials about polyhedral compilation.☆56Updated last month
- An out-of-tree MLIR dialect template.☆110Updated last year
- ☆17Updated last month
- ☆13Updated 2 years ago
- An MLIR dialect to enable the efficient acceleration of ML model on CGRAs.☆64Updated last year
- Tutorial on building a gpu compiler backend in LLVM☆49Updated 10 months ago
- A curated list of research papers, datasets, and tools for applying machine learning/Deep learning techniques to compilers and program op…☆116Updated 2 years ago
- Simulator code of the paper "Dissecting and Modeling the Architecture of Modern GPU Cores"☆41Updated last month
- Optimizing scheduler. Combinatorial instruction scheduling project.☆28Updated 3 weeks ago
- ☆33Updated 3 years ago
- Heron: Automatically Constrained High-Performance Library Generation for Deep Learning Accelerators☆23Updated last year
- ☆11Updated 2 years ago
- ☆40Updated last month
- TileFlow is a performance analysis tool based on Timeloop for fusion dataflows☆63Updated last year
- This repo contains the Assignments from Cornell Tech's ECE 5545 - Machine Learning Hardware and Systems offered in Spring 2023☆40Updated 2 years ago
- This repository contains companion software for the Colfax Research paper "Categorical Foundations for CuTe Layouts".☆80Updated 2 months ago
- [DAC2024] A Holistic Functionalization Approach to Optimizing Imperative Tensor Programs in Deep Learning☆15Updated last year
- ARIES: An Agile MLIR-Based Compilation Flow for Reconfigurable Devices with AI Engines (FPGA 2025 Best Paper Nominee)☆51Updated this week
- ☆118Updated last week
- Asynchronous semantics for architectural simulation and synthesis.☆56Updated this week
- MLIR Sample dialect☆131Updated 9 months ago
- Data-Centric MLIR dialect☆43Updated 2 years ago
- ☆23Updated 7 months ago