zjersey / Lightseq-ARM

☆30

Alternatives and similar repositories for Lightseq-ARM:

Users that are interested in Lightseq-ARM are comparing it to the libraries listed below

kaiokendev / cutoff-len-is-context-len
Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit
☆63Updated last year
ConiferLabsWA / flan-ul2-alpaca
☆32Updated last year
jadechip / nanoXLSTM
The simplest, fastest repository for training/finetuning medium-sized xLSTMs.
☆39Updated 9 months ago
huggingface / peft-pytorch-conference
Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" given…
☆14Updated last year
qwopqwop200 / gptqlora
GPTQLoRA: Efficient Finetuning of Quantized LLMs with GPTQ
☆99Updated last year
Digitous / LLM-SLERP-Merge
Spherical Merge Pytorch/HF format Language Models with minimal feature loss.
☆115Updated last year
kyegomez / Finetuning-Suite
Finetune any model on HF in less than 30 seconds
☆58Updated last month
LegallyCoder / mamba-hf
Implementation of the Mamba SSM with hf_integration.
☆56Updated 5 months ago
huggingface / pixparse
Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data
☆21Updated 6 months ago
ElleLeonne / Lightning-ReLoRA
A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.
☆33Updated 11 months ago
TRI-ML / linear_open_lm
A repository for research on medium sized language models.
☆76Updated 9 months ago
austinsilveria / tricksy
Fast approximate inference on a single GPU with sparsity aware offloading
☆38Updated last year
KyujinHan / Sakura-SOLAR-DPO
Sakura-SOLAR-DPO: Merge, SFT, and DPO
☆116Updated last year
jondurbin / qlora
QLoRA: Efficient Finetuning of Quantized LLMs
☆77Updated 10 months ago
kyegomez / Kosmos-X
The Next Generation Multi-Modality Superintelligence
☆71Updated 5 months ago
cloneofsimo / fim-llama-deepspeed
☆31Updated last year
knoriy / CLARA
☆62Updated 7 months ago
NolanoOrg / sparse_quant_llms
SparseGPT + GPTQ Compression of LLMs like LLaMa, OPT, Pythia
☆41Updated last year
deep-diver / PingPong
manage histories of LLM applied applications
☆88Updated last year
BlinkDL / modded-nanogpt-rwkv
RWKV-7: Surpassing GPT
☆79Updated 3 months ago
swj0419 / detect-pretrain-code-contamination
☆74Updated last year
geov-ai / geov
The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER).…
☆121Updated last year
kyegomez / FastFF
Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"
☆15Updated 3 months ago
RobertCsordas / moe_attention
Official repository for the paper "SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention"
☆96Updated 4 months ago
rohandkn / skribble2vid
☆24Updated last year
NolanoOrg / llama-int4-quant
☆26Updated last year
kyleliang919 / Long-context-transformers
Exploring finetuning public checkpoints on filter 8K sequences on Pile
☆115Updated last year
cloneofsimo / auto_llm_codebase_analysis
☆26Updated 11 months ago
nyunAI / PruneGPT
☆53Updated 8 months ago