vikhyat/mixtral-inference

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/vikhyat/mixtral-inference)

vikhyat / mixtral-inference

inference code for mixtral-8x7b-32kseqlen

☆105

Alternatives and similar repositories for mixtral-inference

Users that are interested in mixtral-inference are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

deepshard / mixtral-8x7b-Inference
View on GitHub
Eh, simple and works.
☆27Dec 9, 2023Updated 2 years ago
Birch-san / booru-embed
View on GitHub
[WIP] Transformer to embed Danbooru labelsets
☆13Mar 31, 2024Updated 2 years ago
dzhulgakov / llama-mistral
View on GitHub
Inference code for Mistral and Mixtral hacked up into original Llama implementation
☆367Dec 9, 2023Updated 2 years ago
sdan / selfextend
View on GitHub
an implementation of Self-Extend, to expand the context window via grouped attention
☆119Jan 7, 2024Updated 2 years ago
scottlogic-alex / prm800k-denorm
View on GitHub
Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format
☆27Jul 12, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
tensoic / Cerule
View on GitHub
Cerule - A Tiny Mighty Vision Model
☆71Nov 9, 2025Updated 8 months ago
camenduru / playground-colab
View on GitHub
☆17Dec 5, 2023Updated 2 years ago
camenduru / Multi-LoRA-Composition-jupyter
View on GitHub
☆13Feb 28, 2024Updated 2 years ago
hrkrshnn / ethccgame
View on GitHub
☆13Sep 17, 2022Updated 3 years ago
camenduru / StreamDiffusion-colab
View on GitHub
☆13Dec 22, 2023Updated 2 years ago
camenduru / mimic-motion-tost
View on GitHub
☆23Oct 19, 2024Updated last year
camenduru / VisualStylePrompting-jupyter
View on GitHub
☆13Mar 15, 2024Updated 2 years ago
jedisct1 / nonce-extension
View on GitHub
Make AES-GCM safe to use with random nonces, for any practical number of messages.
☆19Sep 16, 2025Updated 10 months ago
altugbakan / altug-car
View on GitHub
AltugCar on 0xMonaco
☆10Aug 22, 2022Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
camenduru / FreeInit-colab
View on GitHub
☆25Dec 21, 2023Updated 2 years ago
camenduru / LucidDreamer-colab
View on GitHub
☆20Dec 19, 2023Updated 2 years ago
mistralai / megablocks-public
View on GitHub
☆865Dec 8, 2023Updated 2 years ago
joey00072 / microjax
View on GitHub
Jax like function transformation engine but micro, microjax
☆34Oct 25, 2024Updated last year
MF-FOOM / wikivec2text
View on GitHub
Simple embedding -> text model trained on a small subset of Wikipedia sentences.
☆158Aug 5, 2023Updated 2 years ago
yacineMTB / just-large-models
View on GitHub
Just large language models. Hackable, with as little abstraction as possible. Done for my own purposes, feel free to rip.
☆44Sep 6, 2023Updated 2 years ago
euclaise / SlimTrainer
View on GitHub
Full finetuning of large language models without large memory requirements
☆92Sep 22, 2025Updated 10 months ago
Figura-Labs-Inc / telegraf_nv_export
View on GitHub
Ultra low overhead NVIDIA GPU telemetry plugin for telegraf with memory temperature readings.
☆63Jul 8, 2024Updated 2 years ago
abacaj / openhermes-function-calling
View on GitHub
☆133Nov 24, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
official-elinas / zeus-llm-trainer
View on GitHub
Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models
☆69Aug 27, 2023Updated 2 years ago
AblateIt / finetune-study
View on GitHub
Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.
☆82Sep 10, 2023Updated 2 years ago
clabby / mips-huff
View on GitHub
A rewrite of Optimism's MIPS.sol thread context in Huff
☆26Aug 6, 2023Updated 2 years ago
camenduru / StoryDiffusion-replicate
View on GitHub
☆24May 5, 2024Updated 2 years ago
DS3Lab / CocktailSGD
View on GitHub
☆27Aug 25, 2023Updated 2 years ago
sabetAI / BLoRA
View on GitHub
batched loras
☆350Sep 6, 2023Updated 2 years ago
camenduru / marigold-lcm-hf
View on GitHub
☆12Mar 25, 2024Updated 2 years ago
ph-ausseil / llm-training-dataset-builder
View on GitHub
Streamlines the creation of dataset to train a Large Language Model with triplets : instruction-input-output . The default configuration …
☆13Apr 17, 2023Updated 3 years ago
abdo-eldesokey / latentman
View on GitHub
This is the official repository for "LatentMan: Generating Consistent Animated Characters using Image Diffusion Models" [CVPRW 2024]
☆22Jul 21, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
amotile / stable-diffusion-workshop
View on GitHub
React Frontend for stable diffusion
☆28Sep 23, 2022Updated 3 years ago
mmarcato / dog_posture
View on GitHub
Posture Recognition ML algorithms
☆12Nov 4, 2022Updated 3 years ago
VikParuchuri / textbook_quality
View on GitHub
Generate textbook-quality synthetic LLM pretraining data
☆508Oct 19, 2023Updated 2 years ago
jquesnelle / crt-terminal
View on GitHub
Retro styled terminal shell
☆26May 8, 2024Updated 2 years ago
kaiokendev / cutoff-len-is-context-len
View on GitHub
Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit
☆62Jun 21, 2023Updated 3 years ago
databricks / megablocks
View on GitHub
☆1,582Mar 25, 2026Updated 4 months ago
refcell / op-challenger
View on GitHub
A multi-mode op-stack challenge agent for dispute games written in golang.
☆30Apr 6, 2023Updated 3 years ago