pHaeusler / tinycatstoriesLinks

☆10

Alternatives and similar repositories for tinycatstories

Users that are interested in tinycatstories are comparing it to the libraries listed below

Sorting:

imoneoi / multipack
Multipack distributed sampler for fast padding-free training of LLMs
☆199Updated last year
rmihaylov / mpttune
Tune MPTs
☆84Updated 2 years ago
NolanoOrg / sparse_quant_llms
SparseGPT + GPTQ Compression of LLMs like LLaMa, OPT, Pythia
☆41Updated 2 years ago
BlinkDL / WorldModel
Let us make Psychohistory (as in Asimov) a reality, and accessible to everyone. Useful for LLM grounding and games / fiction / business /…
☆40Updated 2 years ago
epfml / landmark-attention
Landmark Attention: Random-Access Infinite Context Length for Transformers
☆423Updated last year
luohongyin / SAIL
SAIL: Search Augmented Instruction Learning
☆157Updated last month
ArEnSc / Production-RWKV
This project aims to make RWKV Accessible to everyone using a Hugging Face like interface, while keeping it close to the R and D RWKV bra…
☆65Updated 2 years ago
RWKV / RWKV-infctx-trainer
RWKV infctx trainer, for training arbitary context sizes, to 10k and beyond!
☆147Updated last year
sshh12 / multi_token
Embed arbitrary modalities (images, audio, documents, etc) into large language models.
☆187Updated last year
Jellyfish042 / uncheatable_eval
Evaluating LLMs with Dynamic Data
☆91Updated last month
jondurbin / bagel
A bagel, with everything.
☆324Updated last year
BlinkDL / nanoRWKV
RWKV in nanoGPT style
☆192Updated last year
chrisociepa / allamo
Simple, hackable and fast implementation for training/finetuning medium-sized LLaMA-based models
☆177Updated this week
booydar / babilong
BABILong is a benchmark for LLM evaluation using the needle-in-a-haystack approach.
☆210Updated 3 months ago
harrisonvanderbyl / rwkvstic
Framework agnostic python runtime for RWKV models
☆146Updated 2 years ago
hydrallm / llama-moe-v1
☆96Updated 2 years ago
Gryphe / BlockMerge_Gradient
Merge Transformers language models by use of gradient parameters.
☆208Updated last year
sdan / selfextend
an implementation of Self-Extend, to expand the context window via grouped attention
☆118Updated last year
jondurbin / qlora
QLoRA: Efficient Finetuning of Quantized LLMs
☆78Updated last year
dingo-actual / infini-transformer
PyTorch implementation of Infini-Transformer from "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention…
☆291Updated last year
JD-P / minihf
MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…
☆180Updated last month
Dahoas / reward-modeling
☆98Updated 2 years ago
Digitous / LLM-SLERP-Merge
Spherical Merge Pytorch/HF format Language Models with minimal feature loss.
☆136Updated last year
uukuguy / multi_loras
Load multiple LoRA modules simultaneously and automatically switch the appropriate combination of LoRA modules to generate the best answe…
☆158Updated last year
HazyResearch / TART
TART: A plug-and-play Transformer module for task-agnostic reasoning
☆200Updated 2 years ago
deep-spin / infinite-former
☆66Updated last year
lxe / llama-peft-tuner
Tune LLaMa-7B on Alpaca Dataset using PEFT / LORA Based on @zphang's https://github.com/zphang/minimal-llama scripts.
☆25Updated 2 years ago
SmerkyG / gptcore
Fast modular code to create and train cutting edge LLMs
☆68Updated last year
Rallio67 / language-model-agents
Experiments with generating opensource language model assistants
☆97Updated 2 years ago
BlinkDL / modded-nanogpt-rwkv
RWKV-7: Surpassing GPT
☆94Updated 9 months ago