dzhulgakov / llama-mistralLinks

Inference code for Mistral and Mixtral hacked up into original Llama implementation

☆371

Alternatives and similar repositories for llama-mistral

Users that are interested in llama-mistral are comparing it to the libraries listed below

Sorting:

jondurbin / bagel
A bagel, with everything.
☆323Updated last year
SkunkworksAI / hydra-moe
☆416Updated last year
abacusai / Long-Context
This repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and bench…
☆591Updated last year
sabetAI / BLoRA
batched loras
☆344Updated last year
apoorvumang / prompt-lookup-decoding
☆556Updated 11 months ago
DachengLi1 / LongChat
Official repository for LongChat and LongEval
☆524Updated last year
mistralai / megablocks-public
☆864Updated last year
epfml / landmark-attention
Landmark Attention: Random-Access Infinite Context Length for Transformers
☆423Updated last year
nlpxucan / evol-instruct
☆270Updated 2 years ago
VikParuchuri / textbook_quality
Generate textbook-quality synthetic LLM pretraining data
☆502Updated last year
tomaarsen / attention_sinks
Extend existing LLMs way beyond the original training length with constant memory usage, without retraining
☆702Updated last year
yxuansu / OpenAlpaca
OpenAlpaca: A Fully Open-Source Instruction-Following Model Based On OpenLLaMA
☆302Updated 2 years ago
Gryphe / BlockMerge_Gradient
Merge Transformers language models by use of gradient parameters.
☆206Updated 11 months ago
nexusflowai / NexusRaven
NexusRaven-13B, a new SOTA Open-Source LLM for function calling. This repo contains everything for reproducing our evaluation on NexusRav…
☆316Updated last year
persimmon-ai-labs / adept-inference
Inference code for Persimmon-8B
☆415Updated last year
LLM360 / amber-train
Pre-training code for Amber 7B LLM
☆167Updated last year
arielnlee / Platypus
Code for fine-tuning Platypus fam LLMs using LoRA
☆628Updated last year
lm-sys / llm-decontaminator
Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"
☆306Updated last year
hydrallm / llama-moe-v1
☆95Updated 2 years ago
OpenLemur / Lemur
[ICLR 2024] Lemur: Open Foundation Models for Language Agents
☆553Updated last year
QuixiAI / laserRMT
This is our own implementation of 'Layer Selective Rank Reduction'
☆239Updated last year
jondurbin / qlora
QLoRA: Efficient Finetuning of Quantized LLMs
☆78Updated last year
open-compass / MixtralKit
A toolkit for inference and evaluation of 'mixtral-8x7b-32kseqlen' from Mistral AI
☆767Updated last year
XuezheMax / megalodon
Reference implementation of Megalodon 7B model
☆524Updated 2 months ago
Vahe1994 / SpQR
☆546Updated 7 months ago
jzhang38 / EasyContext
Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.
☆739Updated 10 months ago
conceptofmind / toolformer
☆366Updated 2 years ago
FranxYao / Long-Context-Data-Engineering
Implementation of paper Data Engineering for Scaling Language Models to 128K Context
☆467Updated last year
TencentARC / LLaMA-Pro
[ACL 2024] Progressive LLaMA with Block Expansion.
☆507Updated last year
datamllab / LongLM
[ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning
☆660Updated last year