Triton implementation of GPT/LLAMA
☆22Aug 28, 2024Updated last year
Alternatives and similar repositories for gpt-triton
Users that are interested in gpt-triton are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Text Normalization utilities for normalizing text for TTS☆23Mar 4, 2026Updated 4 months ago
- Backup of the sources for my SJPO Teaching Notes☆10Apr 15, 2019Updated 7 years ago
- My fork os allen AI's OLMo for educational purposes.☆28Dec 5, 2024Updated last year
- Inference code for LLaMA models☆21Apr 3, 2025Updated last year
- GpuUtils: A Simple Tool for GPU Analysis and Allocation☆15Apr 23, 2020Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Because it's there.☆16Sep 22, 2024Updated last year
- Mamba support for transformer lens☆20Sep 17, 2024Updated last year
- Implementation of Direct Preference Optimization☆17Jul 17, 2023Updated 2 years ago
- CLIP is an open source, multimodal computer vision model and it's awesome!☆17Dec 16, 2024Updated last year
- ☆17Jan 1, 2025Updated last year
- Repository for NLP project. Name to be changed when we decide on a project☆16Apr 19, 2022Updated 4 years ago
- ☆14Jul 14, 2018Updated 7 years ago
- RBF Drivers for Blender☆10Oct 14, 2022Updated 3 years ago
- Minimal implementation of TokenFormer for inference and learning☆13Nov 6, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆52Jun 9, 2026Updated 3 weeks ago
- A Model Context Protocol server that provides documentation access capabilities. This server enables LLMs to search and retrieve content …☆18Apr 29, 2025Updated last year
- A hackable, simple, and reseach-friendly GRPO Training Framework with high speed weight synchronization in a multinode environment.☆38Aug 27, 2025Updated 10 months ago
- Repository for ACM India Summer School on Generative AI for Text☆13Jul 11, 2024Updated last year
- Simple and scalable tools for data-driven pretraining data selection.☆29Jun 9, 2025Updated last year
- This comprehensive guide provides a universal process for preparing your own speech datasets and training a custom Text-to-Speech (TTS) m…☆34Jun 17, 2026Updated 2 weeks ago
- A Proof-of-Concept Private Server for an Anime Fleet Game☆23Mar 29, 2025Updated last year
- implement of NoProp-CT☆28May 2, 2025Updated last year
- Writing and Citation Assistant Tool☆39Dec 21, 2025Updated 6 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Converting Mixtral-8x7B to Mixtral-[1~7]x7B☆22Mar 4, 2024Updated 2 years ago
- Winning solution of the Kaggle "Google Brain - Ventilator Pressure Prediction" competition☆10Nov 12, 2021Updated 4 years ago
- Centerface ONNX accelerated with Deepstream 5.1☆10Jun 13, 2021Updated 5 years ago
- 🔊Replicate Cog'ified MMAudio🎵☆18Jul 10, 2025Updated 11 months ago
- Phoshell: a Forth inspired, extremely lightweight, stack machine shell, implementable in _ALL_ known programming languages.☆10Nov 21, 2020Updated 5 years ago
- Approaching Clinical NER as a MRC problem☆11Apr 4, 2024Updated 2 years ago
- Shaping capabilities with token-level pretraining data filtering☆94Jan 28, 2026Updated 5 months ago
- Code for experiments done for EMNLP2020.☆11Dec 8, 2022Updated 3 years ago
- Pytorch code for experiments on Linear Transformers☆24Jan 12, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A programming language based on bindings.☆12Jul 6, 2025Updated 11 months ago
- A Web app demonstrating multimodal image search using Visualized-BGE model☆15Dec 1, 2024Updated last year
- ☆13Mar 22, 2023Updated 3 years ago
- A general purpose library for training any type of GPT model.☆12Jun 13, 2023Updated 3 years ago
- WIP☆96Aug 13, 2024Updated last year
- CUDA GPU Benchmark☆38Jan 31, 2025Updated last year
- A modern set of Pyomo tutorials☆43Mar 11, 2025Updated last year