deepshard / mixtral-8x7b-InferenceLinks

Eh, simple and works.

☆27

Alternatives and similar repositories for mixtral-8x7b-Inference

Users that are interested in mixtral-8x7b-Inference are comparing it to the libraries listed below

Sorting:

vikhyat / mixtral-inference
inference code for mixtral-8x7b-32kseqlen
☆100Updated last year
geronimi73 / phi2-finetune
☆87Updated last year
official-elinas / zeus-llm-trainer
Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models
☆69Updated last year
tensoic / Cerule
Cerule - A Tiny Mighty Vision Model
☆66Updated 10 months ago
minosvasilias / simple_grpo
Simple GRPO scripts and configurations.
☆59Updated 5 months ago
s-smits / grpo-optuna
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
☆55Updated 5 months ago
teknium1 / transformers-gptq-quant
☆47Updated last year
sdan / selfextend
an implementation of Self-Extend, to expand the context window via grouped attention
☆119Updated last year
teknium1 / ShareGPT-Builder
☆115Updated 6 months ago
notarussianteenager / srf-attention
Simplex Random Feature attention, in PyTorch
☆74Updated last year
NousResearch / StripedHyenaTrainer
☆61Updated last year
mzbac / mlx-moe
Scripts to create your own moe models using mlx
☆90Updated last year
austinsilveria / tricksy
Fast approximate inference on a single GPU with sparsity aware offloading
☆38Updated last year
vithursant / nanoGPT_mlx
Port of Andrej Karpathy's nanoGPT to Apple MLX framework.
☆110Updated last year
teknium1 / LLM-Benchmark-Logs
Just a bunch of benchmark logs for different LLMs
☆119Updated 11 months ago
AK391 / dailypapersHN
☆86Updated 9 months ago
teknium1 / stanford_alpaca-replit
Modified Stanford-Alpaca Trainer for Training Replit's Code Model
☆41Updated 2 years ago
SebastianBodza / EnsembleForecasting
Using multiple LLMs for ensemble Forecasting
☆16Updated last year
stunningpixels / lou-eval
Track the progress of LLM context utilisation
☆55Updated 3 months ago
AblateIt / finetune-study
Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.
☆82Updated last year
euclaise / SlimTrainer
Full finetuning of large language models without large memory requirements
☆94Updated last year
ElleLeonne / Lightning-ReLoRA
A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.
☆33Updated last year
joey00072 / ohara
Collection of autoregressive model implementation
☆85Updated 2 months ago
allenai / CommonGen-Eval
Evaluating LLMs with CommonGen-Lite
☆90Updated last year
VikParuchuri / classified
Score LLM pretraining data with classifiers
☆55Updated last year
JoeLi12345 / nGPT
an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)
☆101Updated 4 months ago
AtakanTekparmak / agento
Very minimal (and stateless) agent framework
☆44Updated 6 months ago
thooton / muse
Let's create synthetic textbooks together :)
☆75Updated last year
diicellman / dynamite-dogs
BH hackathon
☆14Updated last year
Birch-san / mpt-play
Command-line script for inferencing from models such as MPT-7B-Chat
☆101Updated 2 years ago