☆44May 4, 2025Updated last year
Alternatives and similar repositories for tiny-mixtral
Users that are interested in tiny-mixtral are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- DoubleAI’s hyperoptimised version of cuGraph☆60Mar 3, 2026Updated 4 months ago
- ☆22Aug 21, 2025Updated 10 months ago
- WhisperMesh is an advanced chatbot that integrates voice and text interactions, delivering personalized responses through LLM models and …☆16Apr 23, 2025Updated last year
- Training a BERT model from scratch.☆11Oct 15, 2023Updated 2 years ago
- Will write CUDA for 100 days☆39May 25, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Text Normalization utilities for normalizing text for TTS☆23Mar 4, 2026Updated 4 months ago
- ☆87Jan 24, 2026Updated 5 months ago
- Matching algorithms for Graphs.jl☆20May 8, 2026Updated last month
- ☆18Dec 17, 2024Updated last year
- See vLLM official support: https://github.com/vllm-project/vllm-ascend☆11Feb 5, 2025Updated last year
- ML algorithms implementations that are good for learning the underlying principles☆28Dec 7, 2024Updated last year
- KV Cache & LoRA for minGPT☆61Mar 4, 2026Updated 4 months ago
- Content Based Recommendation system uses attributes of the content to recommend similar content. It doesn't have a cold-start problem bec…☆18May 2, 2022Updated 4 years ago
- key/value store for Python based on Cloudflare workers☆33Jun 13, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆26Jun 8, 2026Updated 3 weeks ago
- Effective transpose on Hopper GPU☆29Sep 6, 2025Updated 9 months ago
- Landing repository for the paper "Softpick: No Attention Sink, No Massive Activations with Rectified Softmax"☆92Sep 12, 2025Updated 9 months ago
- Repository for NLP project. Name to be changed when we decide on a project☆16Apr 19, 2022Updated 4 years ago
- Official implementation of 'A Large-Scale Exploration of mu-Transfer'☆32Jun 5, 2025Updated last year
- Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.☆19Feb 9, 2026Updated 4 months ago
- APPy (Annotated Parallelism for Python) enables users to annotate loops and tensor expressions in Python with compiler directives akin to…☆29Mar 22, 2026Updated 3 months ago
- GEMM☆10Aug 26, 2023Updated 2 years ago
- Training framework for Large Behavioral Models☆28Sep 17, 2025Updated 9 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A straightforward method to reduce your LLM inference API costs and token usage.☆24May 18, 2025Updated last year
- Minimal TPU implementation with 8x8 systolic array and PyTorch integration☆63Jan 26, 2026Updated 5 months ago
- This repo is used for archiving my notes, codes and materials of cs learning.☆92Updated this week
- Email tracker that i will be using to track email that i send for MindKeeper AI and in general☆11Jan 17, 2025Updated last year
- Tidy autoregressive inference in JAX☆15Sep 1, 2025Updated 10 months ago
- amdgpu example code in hip/asm☆66Jun 3, 2026Updated last month
- Wave Partial Differential Equation Solver in Python☆14Jun 5, 2024Updated 2 years ago
- ☆11May 16, 2026Updated last month
- ☆11May 12, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Gaussian processes with spherical harmonic features in JAX☆16Aug 24, 2025Updated 10 months ago
- Yet Another Language Model: LLM inference in C++/CUDA, no libraries except for I/O☆590Sep 13, 2025Updated 9 months ago
- Public Repo for the paper "Overcoming The Spectral-Bias of Neural Value Approximation"☆11May 25, 2024Updated 2 years ago
- Stochastic trace estimation using JAX☆19Aug 20, 2025Updated 10 months ago
- A comprehensive hands-on project for learning GPU programming with CUDA and HIP, covering fundamental concepts through advanced optimizat…☆37Nov 20, 2025Updated 7 months ago
- A curation of awesome portfolio website ideas for developers and designers to draw inspiration from. Raise a pull request to add more. 💜…☆17Apr 15, 2025Updated last year
- This comprehensive guide provides a universal process for preparing your own speech datasets and training a custom Text-to-Speech (TTS) m…☆34Jun 17, 2026Updated 2 weeks ago