Triton implementation of GPT/LLAMA
☆21Aug 28, 2024Updated last year
Alternatives and similar repositories for gpt-triton
Users that are interested in gpt-triton are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Iterate fast on your RAG pipelines☆24Jun 21, 2025Updated 11 months ago
- ☆14Jun 24, 2024Updated last year
- Backup of the sources for my SJPO Teaching Notes☆10Apr 15, 2019Updated 7 years ago
- ☆11Feb 22, 2025Updated last year
- Because it's there.☆16Sep 22, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Mamba support for transformer lens☆20Sep 17, 2024Updated last year
- llama2 inference engine in Rust☆13Apr 12, 2024Updated 2 years ago
- KV Cache & LoRA for minGPT☆63Mar 4, 2026Updated 3 months ago
- Database for International Physics Olympiads☆11Sep 22, 2025Updated 8 months ago
- Implementation of Direct Preference Optimization☆17Jul 17, 2023Updated 2 years ago
- Full End-to-End examples showing how to use First-gen Gaudi and Gaudi2 in common use cases☆13Dec 2, 2024Updated last year
- CLIP is an open source, multimodal computer vision model and it's awesome!☆17Dec 16, 2024Updated last year
- This repository contains code for the paper "Better Estimation of the KL Divergence Between Language Models"☆19May 30, 2025Updated last year
- Repository for NLP project. Name to be changed when we decide on a project☆16Apr 19, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆13Jan 30, 2023Updated 3 years ago
- ☆15May 27, 2026Updated 2 weeks ago
- An SD upscale script made to work with an inpainting model. Supports tiling.☆11Mar 13, 2023Updated 3 years ago
- ☆21Mar 5, 2017Updated 9 years ago
- RBF Drivers for Blender☆10Oct 14, 2022Updated 3 years ago
- Minimal implementation of TokenFormer for inference and learning☆13Nov 6, 2024Updated last year
- ☆18Jan 20, 2025Updated last year
- A Model Context Protocol server that provides documentation access capabilities. This server enables LLMs to search and retrieve content …☆18Apr 29, 2025Updated last year
- Aroma of the Songs — Visualizing music in the form of intricate rose petals using moving cube traces.☆12Feb 5, 2026Updated 4 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A hackable, simple, and reseach-friendly GRPO Training Framework with high speed weight synchronization in a multinode environment.☆37Aug 27, 2025Updated 9 months ago
- Repository for ACM India Summer School on Generative AI for Text☆13Jul 11, 2024Updated last year
- Simple and scalable tools for data-driven pretraining data selection.☆29Jun 9, 2025Updated last year
- Replication package for evaluation of code generation metrics☆17Nov 24, 2025Updated 6 months ago
- Cross-GPU KV Cache Marketplace☆22Nov 12, 2025Updated 7 months ago
- Self Reproduction Code of Paper "Reducing Transformer Key-Value Cache Size with Cross-Layer Attention (MIT CSAIL)☆17May 24, 2024Updated 2 years ago
- ☆18Jan 4, 2024Updated 2 years ago
- DLiP course companion repository for practical 1☆16Jan 22, 2026Updated 4 months ago
- ☆15Aug 26, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A Proof-of-Concept Private Server for an Anime Fleet Game☆23Mar 29, 2025Updated last year
- Writing FLUX in Triton☆42Sep 22, 2024Updated last year
- My assignments for CN course [CSE232] [IIIT-Delhi].☆13May 24, 2018Updated 8 years ago
- ☆22Jan 10, 2025Updated last year
- Kernel created for 15-410 Operating Systems class at Carnegie Mellon☆16Apr 22, 2016Updated 10 years ago
- Fake86 8086 PC Emulator☆17Jun 30, 2016Updated 9 years ago
- Centerface ONNX accelerated with Deepstream 5.1☆10Jun 13, 2021Updated 4 years ago