redotvideo / mamba-chatLinks

Mamba-Chat: A chat LLM based on the state-space model architecture 🐍

☆937

Alternatives and similar repositories for mamba-chat

Users that are interested in mamba-chat are comparing it to the libraries listed below

Sorting:

microsoft / Samba
[ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling
☆932Updated 2 weeks ago
lucidrains / self-rewarding-lm-pytorch
Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI
☆1,406Updated last year
datamllab / LongLM
[ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning
☆662Updated last year
jiaweizzhao / GaLore
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
☆1,630Updated last year
HazyResearch / m2
Repo for "Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture"
☆560Updated 11 months ago
jquesnelle / yarn
YaRN: Efficient Context Window Extension of Large Language Models
☆1,643Updated last year
tomaarsen / attention_sinks
Extend existing LLMs way beyond the original training length with constant memory usage, without retraining
☆732Updated last year
ezelikman / quiet-star
Code for Quiet-STaR
☆743Updated last year
mistralai / megablocks-public
☆863Updated last year
XueFuzhao / OpenMoE
A family of open-sourced Mixture-of-Experts (MoE) Large Language Models
☆1,640Updated last year
pbelcak / UltraFastBERT
The repository for the code of the UltraFastBERT paper
☆520Updated last year
pratyushasharma / laser
The Truth Is In There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction
☆389Updated last year
abacaj / fine-tune-mistral
Fine-tune mistral-7B on 3090s, a100s, h100s
☆717Updated 2 years ago
princeton-nlp / MeZO
[NeurIPS 2023] MeZO: Fine-Tuning Language Models with Just Forward Passes. https://arxiv.org/abs/2305.17333
☆1,136Updated last year
SkunkworksAI / hydra-moe
☆415Updated 2 years ago
uclaml / SPIN
The official implementation of Self-Play Fine-Tuning (SPIN)
☆1,222Updated last year
arielnlee / Platypus
Code for fine-tuning Platypus fam LLMs using LoRA
☆630Updated last year
hao-ai-lab / LookaheadDecoding
[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding
☆1,307Updated 9 months ago
xfactlab / orpo
Official repository for ORPO
☆467Updated last year
kyegomez / LongNet
Implementation of plug in and play Attention from "LongNet: Scaling Transformers to 1,000,000,000 Tokens"
☆715Updated last year
dingo-actual / infini-transformer
PyTorch implementation of Infini-Transformer from "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention…
☆294Updated last year
XuezheMax / megalodon
Reference implementation of Megalodon 7B model
☆525Updated 6 months ago
trotsky1997 / MathBlackBox
☆1,035Updated 11 months ago
persimmon-ai-labs / adept-inference
Inference code for Persimmon-8B
☆412Updated 2 years ago
stanfordnlp / pyreft
Stanford NLP Python library for Representation Finetuning (ReFT)
☆1,538Updated 9 months ago
epfml / landmark-attention
Landmark Attention: Random-Access Infinite Context Length for Transformers
☆427Updated last year
jondurbin / bagel
A bagel, with everything.
☆325Updated last year
dzhulgakov / llama-mistral
Inference code for Mistral and Mixtral hacked up into original Llama implementation
☆371Updated last year
lucidrains / MEGABYTE-pytorch
Implementation of MEGABYTE, Predicting Million-byte Sequences with Multiscale Transformers, in Pytorch
☆654Updated 11 months ago
apoorvumang / prompt-lookup-decoding
☆581Updated last year