The missing tiktoken training code
☆479Jan 3, 2026Updated 5 months ago
Alternatives and similar repositories for rustbpe
Users that are interested in rustbpe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The Structure and Interpretation of Tensor Programs: The Hacker's Accelerated Introduction to Deep Learning and Deep Learning Systems☆76Updated this week
- Rusnel is a fast tcp/udp tunnel over QUIC☆13May 6, 2026Updated last month
- Any-Order GPT as Masked Diffusion Model: Decoupling Formulation and Architecture. Training an MDM using GPT with this repo!☆36Jun 23, 2025Updated 11 months ago
- Prune transformer layers☆74May 30, 2024Updated 2 years ago
- root repo☆151Jul 25, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Analyzing Hacker News discussions from a decade ago in hindsight with LLMs☆647Dec 10, 2025Updated 6 months ago
- Minimalistic 4D-parallelism distributed training framework for education purpose☆2,220Aug 26, 2025Updated 9 months ago
- Documentation retrieval system to help LLMs navigate less-popular (yet often more powerful) Python libraries☆14May 13, 2024Updated 2 years ago
- Explorations into adversarial losses on top of autoregressive loss for language modeling☆41Dec 21, 2025Updated 5 months ago
- A browser extension for organising Solana wallets☆17Jan 23, 2026Updated 4 months ago
- A local-first, terminal-based password manager built for people who care about security, simplicity, and control☆39Dec 31, 2025Updated 5 months ago
- Supabase Rust SDK☆30Oct 16, 2025Updated 8 months ago
- code for "EMS: 3D Eyebrow Modeling from Single-view Images"(SIGGRAPH Asia 2023)☆13May 3, 2025Updated last year
- Small GPU / CUDA stress bundle as a Docker image☆12Jan 6, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Tensorflow: Generalizing Across Domains via Cross-Gradient Training☆15May 11, 2018Updated 8 years ago
- ☆51Jun 1, 2026Updated 2 weeks ago
- Composable local validator setup for Solana☆22Nov 8, 2024Updated last year
- 🔢 Work with static vector models☆39Apr 21, 2025Updated last year
- Real-Time as a Service platform, built on Cloudflare for fast, cheap, and reliable infrastructure to power live apps and connected experi…☆19Mar 22, 2026Updated 2 months ago
- Project focused on enhancing the quality of low-fidelity endoscopy images using Generative Adversarial Networks (GANs) implemented in PyT…☆17Jun 5, 2025Updated last year
- ☆12Feb 18, 2025Updated last year
- ☆15Sep 13, 2020Updated 5 years ago
- MoE training for Me and You and maybe other people☆387Mar 15, 2026Updated 3 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- CompChomper is a framework for measuring how LLMs perform at code completion.☆21Apr 29, 2025Updated last year
- Rust Implementation of micrograd☆52Jul 3, 2024Updated last year
- Proof of concept showing a single class being used to render instancing and batching geometry in a single draw call.☆14Jun 3, 2024Updated 2 years ago
- ☆11Oct 2, 2024Updated last year
- A C compiler, written in Rust.☆10Feb 13, 2022Updated 4 years ago
- 2023 ABCI Llama-2 継続学習プロジェクト☆14Jan 22, 2024Updated 2 years ago
- Utensil's LLM Playground (2023)☆10May 24, 2026Updated 3 weeks ago
- I introduce the basic idea and implementation of 5 imputation approaches. In short, filling with a single value works well for a shorter…☆12Jan 11, 2023Updated 3 years ago
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆10,573Jul 1, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Practical Explainable AI Using Python by Pradeepta Mishra☆13May 18, 2022Updated 4 years ago
- ☆10Jul 25, 2024Updated last year
- Core contracts related to the Moremoney protocol☆10Jul 14, 2023Updated 2 years ago
- A GPU accelerated Mandelbrot viewer made using the new WebGPU API.☆10Oct 26, 2023Updated 2 years ago
- ☆20Sep 10, 2025Updated 9 months ago
- ☆13Dec 28, 2022Updated 3 years ago
- An autoregressive character-level language model for making more things☆4,034Jun 4, 2024Updated 2 years ago