☆52Mar 14, 2025Updated last year
Alternatives and similar repositories for cse234-w25-PA
Users that are interested in cse234-w25-PA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Website for CSE 234, Winter 2025☆15Mar 24, 2025Updated last year
- Minimal (truly) muP implementation, consistent with TP4 and TP5 papers notation☆14Jan 2, 2026Updated 5 months ago
- High performance RMSNorm Implement by using SM Core Storage(Registers and Shared Memory)☆30Jan 22, 2026Updated 4 months ago
- An efficient implementation of the NSA (Native Sparse Attention) kernel☆134Jun 24, 2025Updated 11 months ago
- Official repository of "Distort, Distract, Decode: Instruction-Tuned Model Can Refine its Response from Noisy Instructions", ICLR 2024 Sp…☆21Mar 7, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Expert Specialization MoE Solution based on CUTLASS☆27Apr 14, 2026Updated last month
- ☆13Jan 7, 2025Updated last year
- ECE408 (Applied Parallel Programming) Fall 2022 MP☆20Mar 24, 2023Updated 3 years ago
- The official implementation for the intra-stage fusion technique introduced in https://arxiv.org/abs/2409.13221☆31Apr 22, 2025Updated last year
- Open-source toolkit for training, Priming, and serving next generation Hybrid architectures☆71May 9, 2026Updated last month
- Wave: Python Domain-Specific Language for High Performance Machine Learning☆57Jun 2, 2026Updated last week
- A zero-to-one guide on scaling modern transformers with n-dimensional parallelism.☆125Dec 29, 2025Updated 5 months ago
- ☆36Jan 22, 2024Updated 2 years ago
- ☆20Dec 24, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Point-to-Hyperplane NNS Beyond the Unit Hypersphere (SIGMOD 2021)☆13Sep 5, 2021Updated 4 years ago
- SAC: A Co-Design Cache Algorithm for Emerging SMR-based High-Density Disks☆13Jan 13, 2020Updated 6 years ago
- Asynchronous pipeline parallel optimization☆22Feb 2, 2026Updated 4 months ago
- Awesome code, projects, books, etc. related to CUDA☆36Jun 2, 2026Updated last week
- This is the official implementation for paper "On Powerful Ways to Generate: Autoregression, Diffusion, and Beyond".☆22Nov 17, 2025Updated 6 months ago
- [NeurIPS 23] Characterizing OOD Error via Optimal Transport☆13Nov 19, 2023Updated 2 years ago
- ☆116Feb 25, 2025Updated last year
- TopoTrans: Optimal Transport meets Topological Data Analysis☆14Apr 20, 2023Updated 3 years ago
- Code for Tangent Model Composition for Ensembling and Continual Fine-tuning (ICCV 2023) and Tangent Transformers for Composition, Privacy…☆14May 14, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- LSH Scheme based on Longest Circular Co-Substring (SIGMOD 2020)☆14Jul 8, 2021Updated 4 years ago
- Learning-based Approximate k-NN Search in Graph Databases☆11Nov 29, 2021Updated 4 years ago
- BC-Tree and Ball-Tree for Point-to-Hyperplane NNS (ICDE 2023)☆17Aug 4, 2023Updated 2 years ago
- A fast high dimensional near neighbor search algorithm based on group testing and locality sensitive hashing☆23Dec 9, 2023Updated 2 years ago
- 🫧 Code for Holistic Reasoning with Long-Context LMs: A Benchmark for Database Operations on Massive Textual Data (Maekawa*, Iso* et al.…☆12Feb 25, 2025Updated last year
- ☆80Nov 26, 2024Updated last year
- Game engine for website version avalon card-board game☆16May 30, 2026Updated last week
- ☆22Jul 4, 2025Updated 11 months ago
- implementation of https://arxiv.org/pdf/2312.09299☆21Jul 3, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆14Aug 16, 2023Updated 2 years ago
- Nex Venus Communication Library☆75Nov 17, 2025Updated 6 months ago
- Exploitability calculation for imperfect-information game benchmarks☆35Apr 5, 2025Updated last year
- DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling☆24Updated this week
- NVIDIA cuTile learn☆168Dec 9, 2025Updated 5 months ago
- Share your GPU without MIG or MPS☆50Jan 27, 2026Updated 4 months ago
- ☆10Nov 18, 2024Updated last year