☆45May 4, 2025Updated last year
Alternatives and similar repositories for tiny-mixtral
Users that are interested in tiny-mixtral are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆16Jul 7, 2025Updated 10 months ago
- DoubleAI’s hyperoptimised version of cuGraph☆52Mar 3, 2026Updated 2 months ago
- WhisperMesh is an advanced chatbot that integrates voice and text interactions, delivering personalized responses through LLM models and …☆16Apr 23, 2025Updated last year
- Training a BERT model from scratch.☆11Oct 15, 2023Updated 2 years ago
- An efficient and scalable attention module designed to reduce memory usage and improve inference speed in large language models. Designe…☆22Jun 25, 2025Updated 10 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 🎈 A series of lightweight GPT models featuring TinyGPT Base (~51M params) and TinyGPT2 (~95M params). Fast, creative text generation tra…☆17Apr 17, 2026Updated last month
- Will write CUDA for 100 days☆39May 25, 2025Updated 11 months ago
- Repository of notes, code and notebooks in Python for the book "Reinforcement Learning: An Introduction" by Richard S. Sutton and Andrew …☆37Mar 6, 2026Updated 2 months ago
- ☆14Aug 31, 2022Updated 3 years ago
- idk☆24Jun 7, 2025Updated 11 months ago
- PEP 503 repository index for jax[cuda]☆21Jan 14, 2025Updated last year
- Text Normalization utilities for normalizing text for TTS☆22Mar 4, 2026Updated 2 months ago
- ☆88Jan 24, 2026Updated 3 months ago
- A Bigram Language Model from scratch with no-smoothing and add-one smoothing. Outputs bigram counts, bigram probabilities and probability…☆15Jan 12, 2021Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆11Feb 22, 2025Updated last year
- [WIP] Transformer to embed Danbooru labelsets☆13Mar 31, 2024Updated 2 years ago
- Matching algorithms for Graphs.jl☆21May 8, 2026Updated 2 weeks ago
- ☆18Dec 17, 2024Updated last year
- ML algorithms implementations that are good for learning the underlying principles☆28Dec 7, 2024Updated last year
- KV Cache & LoRA for minGPT☆63Mar 4, 2026Updated 2 months ago
- Byte-Pair Encoding (BPE) (subword-based tokenization) algorithm implementaions from scratch with python☆18Jan 30, 2023Updated 3 years ago
- ☆23Oct 17, 2024Updated last year
- Learn how Transformer models are implemented from scratch.☆23Jun 3, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- CLIP is an open source, multimodal computer vision model and it's awesome!☆17Dec 16, 2024Updated last year
- Landing repository for the paper "Softpick: No Attention Sink, No Massive Activations with Rectified Softmax"☆93Sep 12, 2025Updated 8 months ago
- Repository for NLP project. Name to be changed when we decide on a project☆16Apr 19, 2022Updated 4 years ago
- Official implementation of 'A Large-Scale Exploration of mu-Transfer'☆32Jun 5, 2025Updated 11 months ago
- A light llama-like llm inference framework based on the triton kernel.☆186Jan 5, 2026Updated 4 months ago
- Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.☆19Feb 9, 2026Updated 3 months ago
- APPy (Annotated Parallelism for Python) enables users to annotate loops and tensor expressions in Python with compiler directives akin to…☆29Mar 22, 2026Updated 2 months ago
- LLM Embedding Sample App using Flask and PostgreSQL with pgvector extension.☆15Aug 20, 2023Updated 2 years ago
- GEMM☆10Aug 26, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Training framework for Large Behavioral Models☆28Sep 17, 2025Updated 8 months ago
- A straightforward method to reduce your LLM inference API costs and token usage.☆24May 18, 2025Updated last year
- Minimal TPU implementation with 8x8 systolic array and PyTorch integration☆61Jan 26, 2026Updated 3 months ago
- Implementation of BERT-based Language Models☆27Mar 12, 2026Updated 2 months ago
- This repo is used for archiving my notes, codes and materials of cs learning.☆86May 10, 2026Updated last week
- A Beginner's Guide to Monetizing Your Python AI Chatbot☆16Apr 22, 2025Updated last year
- Load and run Llama from safetensors files in C☆15Oct 24, 2024Updated last year