cornell-zhang/llm-datatypes

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/cornell-zhang/llm-datatypes)

cornell-zhang / llm-datatypes

Codebase for ICML'24 paper: Learning from Students: Applying t-Distributions to Explore Accurate and Efficient Formats for LLMs

☆27

Alternatives and similar repositories for llm-datatypes

Users that are interested in llm-datatypes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

cornell-zhang / preprop-gnn
View on GitHub
Graph Learning at Scale: Characterizing and Optimizing Pre-Propagation GNNs (MLSys'25)
☆19Apr 4, 2025Updated last year
cornell-zhang / UniSparse
View on GitHub
UniSparse: An Intermediate Language for General Sparse Format Customization (OOPSLA'24)
☆34Nov 12, 2024Updated last year
cornell-zhang / SmoothE
View on GitHub
SmoothE: Differentiable E-Graph Extraction (ASPLOS'25 Best Paper)
☆34Jan 15, 2026Updated 6 months ago
cornell-zhang / HOGA
View on GitHub
Hop-Wise Graph Attention for Scalable and Generalizable Learning on Circuits
☆35Aug 25, 2024Updated last year
cornell-zhang / GARNET
View on GitHub
GARNET: Reduced-Rank Topology Learning for Robust and Scalable Graph Neural Networks
☆36Oct 1, 2023Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
cornell-zhang / GLAIVE
View on GitHub
Graph-learning assisted instruction vulnerability estimation published in DATE 2020
☆14Dec 6, 2020Updated 5 years ago
cornell-zhang / eqmap
View on GitHub
Using e-graphs for logic synthesis (ICCAD'25)
☆35Jul 13, 2026Updated last week
cornell-zhang / uptune
View on GitHub
A Generic Distributed Auto-Tuning Infrastructure
☆24Jul 29, 2021Updated 4 years ago
cornell-zhang / SPADE
View on GitHub
☆11Oct 28, 2021Updated 4 years ago
cornell-zhang / Polynormer
View on GitHub
Polynormer: Polynomial-Expressive Graph Transformer in Linear Time (ICLR'24)
☆44Apr 6, 2024Updated 2 years ago
cornell-zhang / quickest
View on GitHub
QuickEst repository: Quick Estimation of Quality of Results
☆27Oct 23, 2018Updated 7 years ago
cornell-zhang / dnn-gating
View on GitHub
Conditional channel- and precision-pruning on neural networks
☆71Mar 4, 2020Updated 6 years ago
cornell-zhang / dnn-quant-ocs
View on GitHub
DNN quantization with outlier channel splitting (ICML'19)
☆114Mar 21, 2020Updated 6 years ago
cornell-zhang / GraphZoom
View on GitHub
GraphZoom: A Multi-level Spectral Approach for Accurate and Scalable Graph Embedding (ICLR'20 Oral)
☆113Mar 24, 2023Updated 3 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
abdelfattah-lab / BRAMAC
View on GitHub
☆10Nov 27, 2024Updated last year
cornell-zhang / facedetect-fpga
View on GitHub
☆45Mar 21, 2020Updated 6 years ago
cornell-zhang / heurigym
View on GitHub
Agentic Benchmark for LLM-Crafted Heuristics in Combinatorial Optimization (ICLR'26)
☆91Apr 25, 2026Updated 2 months ago
HuangOwen / RoLoRA
View on GitHub
[EMNLP 2024] RoLoRA: Fine-tuning Rotated Outlier-free LLMs for Effective Weight-Activation Quantization
☆40Sep 24, 2024Updated last year
cornell-zhang / HiSparse
View on GitHub
High-Performance Sparse Linear Algebra on HBM-Equipped FPGAs Using HLS
☆102Sep 27, 2024Updated last year
chhzh123 / ptc-tutorial
View on GitHub
PyTorch compilation tutorial covering TorchScript, torch.fx, and Slapo
☆17Mar 13, 2023Updated 3 years ago
cornell-zhang / rosetta
View on GitHub
Rosetta: A Realistic High-level Synthesis Benchmark Suite for Software Programmable FPGAs (FPGA'18)
☆173Nov 7, 2023Updated 2 years ago
icgrp / hipr
View on GitHub
☆17Feb 3, 2023Updated 3 years ago
cornell-zhang / GraphLily
View on GitHub
A graph linear algebra overlay
☆52Apr 26, 2023Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
heterosys / mlir-vitis
View on GitHub
📥 🎯 (1,4/4) an MLIR-based toolchain with Vitis HLS LLVM input/output targeting FPGAs.
☆15Nov 15, 2022Updated 3 years ago
cornell-zhang / heterocl
View on GitHub
HeteroCL: A Multi-Paradigm Programming Infrastructure for Software-Defined Heterogeneous Computing (FPGA'19 Best Paper)
☆338Apr 20, 2024Updated 2 years ago
AIS-SNU / GraNNDis_Artifact
View on GitHub
[PACT'24] GraNNDis. A fast and unified distributed graph neural network (GNN) training framework for both full-batch (full-graph) and min…
☆10Aug 13, 2024Updated last year
cornell-zhang / FracBNN
View on GitHub
FracBNN: Accurate and FPGA-Efficient Binary Neural Networks with Fractional Activations
☆99Oct 2, 2021Updated 4 years ago
enyac-group / evol-q
View on GitHub
Quantization in the Jagged Loss Landscape of Vision Transformers
☆13Oct 22, 2023Updated 2 years ago
sIncerass / QBERT
View on GitHub
☆15Oct 26, 2022Updated 3 years ago
cornell-zhang / datuner
View on GitHub
DATuner Repository
☆17Sep 9, 2018Updated 7 years ago
ruikangliu / Quantized-Reasoning-Models
View on GitHub
[COLM 2025] Official PyTorch implementation of "Quantization Hurts Reasoning? An Empirical Study on Quantized Reasoning Models"
☆77Jul 8, 2025Updated last year
hongsunjang / pipe-bd
View on GitHub
[DATE 2023] Pipe-BD: Pipelined Parallel Blockwise Distillation
☆12Jul 13, 2023Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
UCLA-VAST / heterohalide
View on GitHub
HeteroHalide: From Image Processing DSL to Efficient FPGA Acceleration
☆15Sep 14, 2020Updated 5 years ago
cucapra / pollen
View on GitHub
generating hardware accelerators for pangenomic graph queries
☆45Jul 1, 2026Updated 3 weeks ago
HabanaAI / Megatron-DeepSpeed
View on GitHub
Intel Gaudi's Megatron DeepSpeed Large Language Models for training
☆18Dec 19, 2024Updated last year
Yu-Maryland / Differentiable_Scheduler_ICML24
View on GitHub
Differentiable Combinatorial Scheduling at Scale (ICML'24). Mingju Liu, Yingjie Li, Jiaqi Yin, Zhiru Zhang, Cunxi Yu.
☆22Oct 31, 2024Updated last year
cornell-zhang / allo
View on GitHub
Allo Accelerator Design and Programming Framework (PLDI'24)
☆392Updated this week
ucb-bar / dosa
View on GitHub
DOSA: Differentiable Model-Based One-Loop Search for DNN Accelerators
☆19Oct 10, 2024Updated last year
Zhen-Dong / CoDeNet
View on GitHub
[FPGA'21] CoDeNet is an efficient object detection model on PyTorch, with SOTA performance on VOC and COCO based on CenterNet and Co-Desi…
☆28Feb 7, 2023Updated 3 years ago