☆52Mar 14, 2025Updated last year
Alternatives and similar repositories for cse234-w25-PA
Users that are interested in cse234-w25-PA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Website for CSE 234, Winter 2025☆16Mar 24, 2025Updated last year
- Code for "What really matters in matrix-whitening optimizers?"☆24Oct 31, 2025Updated 7 months ago
- An efficient implementation of the NSA (Native Sparse Attention) kernel☆133Jun 24, 2025Updated last year
- Official repository of "Distort, Distract, Decode: Instruction-Tuned Model Can Refine its Response from Noisy Instructions", ICLR 2024 Sp…☆21Mar 7, 2024Updated 2 years ago
- Expert Specialization MoE Solution based on CUTLASS☆27Apr 14, 2026Updated 2 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆13Jan 7, 2025Updated last year
- ☆360Jun 15, 2026Updated 2 weeks ago
- Problems from IOITC'16 (India)☆10Jan 12, 2022Updated 4 years ago
- The official implementation for the intra-stage fusion technique introduced in https://arxiv.org/abs/2409.13221☆31Apr 22, 2025Updated last year
- Wave: Python Domain-Specific Language for High Performance Machine Learning☆58Jun 8, 2026Updated 3 weeks ago
- cc98爬虫☆15Sep 1, 2013Updated 12 years ago
- C++ library for finding Strongly Connected Components in parallel, based on paper: https://dl.acm.org/citation.cfm?id=2851161☆12May 22, 2018Updated 8 years ago
- ☆36Jan 22, 2024Updated 2 years ago
- ☆20Dec 24, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Point-to-Hyperplane NNS Beyond the Unit Hypersphere (SIGMOD 2021)☆13Sep 5, 2021Updated 4 years ago
- SAC: A Co-Design Cache Algorithm for Emerging SMR-based High-Density Disks☆13Jan 13, 2020Updated 6 years ago
- Sample Codes using NVSHMEM on Multi-GPU☆30Jan 22, 2023Updated 3 years ago
- Asynchronous pipeline parallel optimization☆22Feb 2, 2026Updated 4 months ago
- Awesome code, projects, books, etc. related to CUDA☆38Jun 2, 2026Updated 3 weeks ago
- This is the official implementation for paper "On Powerful Ways to Generate: Autoregression, Diffusion, and Beyond".☆22Nov 17, 2025Updated 7 months ago
- [NeurIPS 23] Characterizing OOD Error via Optimal Transport☆13Nov 19, 2023Updated 2 years ago
- ☆117Feb 25, 2025Updated last year
- The source code for PM-LSH (PVLDB 2020)☆13Nov 30, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- TopoTrans: Optimal Transport meets Topological Data Analysis☆14Apr 20, 2023Updated 3 years ago
- Implementation of Minimum Spanning Trees on Apache Spark.☆10May 25, 2015Updated 11 years ago
- Code and results accompanying our paper titled Leveraging Unlabeled Data to Predict Out-of-Distribution Performance at ICLR 2022☆10Dec 8, 2022Updated 3 years ago
- Code for Tangent Model Composition for Ensembling and Continual Fine-tuning (ICCV 2023) and Tangent Transformers for Composition, Privacy…☆14May 14, 2024Updated 2 years ago
- LSH Scheme based on Longest Circular Co-Substring (SIGMOD 2020)☆14Jul 8, 2021Updated 4 years ago
- Stick-breaking attention☆63Jul 1, 2025Updated 11 months ago
- ☆18Jun 4, 2025Updated last year
- 🫧 Code for Holistic Reasoning with Long-Context LMs: A Benchmark for Database Operations on Massive Textual Data (Maekawa*, Iso* et al.…☆12Feb 25, 2025Updated last year
- [ICLR 2025 & COLM 2025] Official PyTorch implementation of the Forgetting Transformer and Adaptive Computation Pruning☆150Feb 25, 2026Updated 4 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆80Nov 26, 2024Updated last year
- Tigon: A Distributed Database for a CXL Pod [OSDI '25]☆50Nov 25, 2025Updated 7 months ago
- ☆14Aug 16, 2023Updated 2 years ago
- Nex Venus Communication Library☆75Nov 17, 2025Updated 7 months ago
- ☆17Oct 13, 2022Updated 3 years ago
- DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling☆28Jun 20, 2026Updated last week
- NVIDIA cuTile learn☆168Dec 9, 2025Updated 6 months ago