Triton implementation of GPT/LLAMA
☆21Aug 28, 2024Updated last year
Alternatives and similar repositories for gpt-triton
Users that are interested in gpt-triton are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Fast SGEMM emulation on Tensor Cores☆17Feb 16, 2025Updated last year
- ☆14Jun 24, 2024Updated last year
- Backup of the sources for my SJPO Teaching Notes☆10Apr 15, 2019Updated 7 years ago
- My fork os allen AI's OLMo for educational purposes.☆28Dec 5, 2024Updated last year
- Triton kernels for Flux☆23Jul 7, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- GpuUtils: A Simple Tool for GPU Analysis and Allocation☆15Apr 23, 2020Updated 6 years ago
- ☆11Feb 22, 2025Updated last year
- KV Cache & LoRA for minGPT☆62Mar 4, 2026Updated 2 months ago
- Mamba support for transformer lens☆20Sep 17, 2024Updated last year
- Full End-to-End examples showing how to use First-gen Gaudi and Gaudi2 in common use cases☆13Dec 2, 2024Updated last year
- CLIP is an open source, multimodal computer vision model and it's awesome!☆17Dec 16, 2024Updated last year
- PyTorch code for ROLL, a knowledge-based video story question answering model.☆21Sep 29, 2020Updated 5 years ago
- A Model Context Protocol server that provides documentation access capabilities. This server enables LLMs to search and retrieve content …☆19Apr 29, 2025Updated last year
- Source code for some notes for the mathematical tripos.☆23Dec 23, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Simple and scalable tools for data-driven pretraining data selection.☆29Jun 9, 2025Updated 11 months ago
- Replication package for evaluation of code generation metrics☆17Nov 24, 2025Updated 5 months ago
- Cross-GPU KV Cache Marketplace☆22Nov 12, 2025Updated 5 months ago
- This comprehensive guide provides a universal process for preparing your own speech datasets and training a custom Text-to-Speech (TTS) m…☆27May 3, 2025Updated last year
- A hub for ResNet based models and pretrained weights in TensorFlow.☆20Aug 5, 2021Updated 4 years ago
- ☆15Aug 26, 2023Updated 2 years ago
- ☆20Aug 31, 2022Updated 3 years ago
- Make quick mechanical turk HTML/Javascript interfaces and launch them with Python functions☆41Jun 1, 2021Updated 4 years ago
- Provides nodes to assemble point clouds from either LaserScan or PointCloud messages☆40Aug 7, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Writing FLUX in Triton☆42Sep 22, 2024Updated last year
- A tiny BERT for low-resource monolingual models☆31Dec 24, 2025Updated 4 months ago
- Converting Mixtral-8x7B to Mixtral-[1~7]x7B☆22Mar 4, 2024Updated 2 years ago
- Centerface ONNX accelerated with Deepstream 5.1☆10Jun 13, 2021Updated 4 years ago
- 🔊Replicate Cog'ified MMAudio🎵☆18Jul 10, 2025Updated 10 months ago
- Phoshell: a Forth inspired, extremely lightweight, stack machine shell, implementable in _ALL_ known programming languages.☆10Nov 21, 2020Updated 5 years ago
- Custom loss functions to use in (mainly) PyTorch.☆39Oct 5, 2020Updated 5 years ago
- Shaping capabilities with token-level pretraining data filtering☆93Jan 28, 2026Updated 3 months ago
- ☆28Jul 11, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A discord bot to play with the KittyCAD Text to CAD API.☆15Apr 25, 2026Updated 2 weeks ago
- Multi-Modal Multi-Task (3MT) Road Segmentation, IEEE RA-L 2023☆15Feb 13, 2024Updated 2 years ago
- coloring terminal text with intensities (used for plotting probability, entropy with tokens)☆12Oct 11, 2024Updated last year
- A Web app demonstrating multimodal image search using Visualized-BGE model☆15Dec 1, 2024Updated last year
- Use AI to preview how garments look on you directly on product pages from Amazon and Coupang. Upload your photo, click "Try On," and see …☆22Apr 13, 2025Updated last year
- ☆46Sep 15, 2025Updated 7 months ago
- ☆26Jan 16, 2026Updated 3 months ago