deepshard / mixtral-8x7b-Inference
Eh, simple and works.
☆27Updated last year
Alternatives and similar repositories for mixtral-8x7b-Inference:
Users that are interested in mixtral-8x7b-Inference are comparing it to the libraries listed below
- A repository of prompts and Python scripts for intelligent transformation of raw text into diverse formats.☆30Updated last year
- inference code for mixtral-8x7b-32kseqlen☆99Updated last year
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆70Updated last year
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated last year
- ☆108Updated last month
- Fast approximate inference on a single GPU with sparsity aware offloading☆38Updated last year
- Scripts to create your own moe models using mlx☆85Updated 10 months ago
- ☆87Updated 11 months ago
- tiny_fnc_engine is a minimal python library that provides a flexible engine for calling functions extracted from a LLM.☆38Updated 4 months ago
- LLM reads a paper and produce a working prototype☆46Updated 3 weeks ago
- Simplex Random Feature attention, in PyTorch☆72Updated last year
- Routing on Random Forest (RoRF)☆98Updated 3 months ago
- ☆48Updated last year
- An automated tool for discovering insights from research papaer corpora☆136Updated 7 months ago
- RAFT, or Retrieval-Augmented Fine-Tuning, is a method comprising of a fine-tuning and a RAG-based retrieval phase. It is particularly sui…☆83Updated 4 months ago
- ☆41Updated 7 months ago
- auto fine tune of models with synthetic data☆74Updated 11 months ago
- NanoGPT (124M) quality in 2.67B tokens☆26Updated this week
- KMD is a collection of conversational exchanges between patients and doctors on various medical topics. It aims to capture the intricaci…☆24Updated last year
- ☆96Updated 4 months ago
- Just a bunch of benchmark logs for different LLMs☆116Updated 5 months ago
- Full finetuning of large language models without large memory requirements☆93Updated last year
- Using multiple LLMs for ensemble Forecasting☆16Updated last year
- ☆65Updated 7 months ago
- ☆37Updated 5 months ago
- tickr-agent is an enterprise-ready, scalable Python library for building swarms of financial agents that conduct comprehensive stock anal…☆37Updated last week
- ☆38Updated 10 months ago
- ☆26Updated 10 months ago
- Video+code lecture on building nanoGPT from scratch☆65Updated 7 months ago
- Let's create synthetic textbooks together :)☆73Updated 11 months ago