Experimental GPU language with meta-programming
☆27Sep 6, 2024Updated last year
Alternatives and similar repositories for opal_ptx
Users that are interested in opal_ptx are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆10Apr 24, 2023Updated 2 years ago
- FLOPS counter for all your GPU benchmarking needs☆13Aug 8, 2024Updated last year
- ☆21Mar 3, 2025Updated last year
- A basic pure pytorch implementation of flash attention☆16Oct 28, 2024Updated last year
- GeoT: Tensor Centric Library for Graph Neural Network via Efficient Segment Reduction on GPU☆24Mar 27, 2025Updated 11 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Minimal (truly) muP implementation, consistent with TP4 and TP5 papers notation☆14Jan 2, 2026Updated 2 months ago
- Website for CSE 234, Winter 2025☆13Mar 24, 2025Updated last year
- ☆24Dec 11, 2024Updated last year
- [WIP] A federated chat thing☆14Mar 24, 2023Updated 3 years ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Jun 21, 2023Updated 2 years ago
- ☆34May 14, 2025Updated 10 months ago
- ☆65Apr 26, 2025Updated 11 months ago
- Curating the best Libra project and resource☆11Jul 8, 2019Updated 6 years ago
- new optimizer☆20Aug 4, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- MLX Swift implementation of Andrej Karpathy's Let's build GPT video☆63Apr 14, 2024Updated last year
- ☆34Sep 10, 2024Updated last year
- Ultra fast 3D reconstruction and novel view synthesis.☆76Oct 30, 2024Updated last year
- Derive macro for generating arrays from struct fields.☆20Oct 6, 2022Updated 3 years ago
- Implementation of various equivariant models in JAX☆12Apr 12, 2024Updated last year
- ☆12Jul 28, 2022Updated 3 years ago
- PaiNN in jax☆11Jan 14, 2025Updated last year
- ☆48Jan 18, 2024Updated 2 years ago
- ☆12Dec 1, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- It's a baby compiler. (Lean btw.)☆16May 19, 2025Updated 10 months ago
- ☆12Jan 4, 2024Updated 2 years ago
- ☆38Jul 16, 2025Updated 8 months ago
- KernelBench: Can LLMs Write GPU Kernels? - Benchmark + Toolkit with Torch -> CUDA (+ more DSLs)☆869Mar 9, 2026Updated 2 weeks ago
- Cayley hashing as in "Navigating in the Cayley Graph of SL₂(𝔽ₚ)"☆21Jul 28, 2023Updated 2 years ago
- ☆27Oct 23, 2025Updated 5 months ago
- Training hybrid models for dummies.☆29Nov 1, 2025Updated 4 months ago
- SFC: Shared Feature Calibration in Weakly Supervised Semantic Segmentation (AAAI24)☆25Jul 2, 2024Updated last year
- devector and batch_deque containers for C++. See more at: http://erenon.hu/double_ended☆15Oct 7, 2017Updated 8 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Simple starter CMake project that uses NVBench.☆16May 6, 2025Updated 10 months ago
- Efficient non-uniform quantization with GPTQ for GGUF☆63Sep 17, 2025Updated 6 months ago
- My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activated☆34Aug 14, 2024Updated last year
- Train to 94% on CIFAR-10 in 4.4 seconds on a single A100☆12Dec 30, 2023Updated 2 years ago
- Solutions to the Matasano Crypto Challenges☆16Jun 1, 2021Updated 4 years ago
- ☆13Jun 2, 2024Updated last year
- Tensor Parallelism with JAX + Shard Map☆11Sep 29, 2023Updated 2 years ago