broskicodes / slmsLinks

Experimenting with small language models

☆68

Alternatives and similar repositories for slms

Users that are interested in slms are comparing it to the libraries listed below

Sorting:

nivibilla / build-nanogpt
Video+code lecture on building nanoGPT from scratch
☆69Updated last year
Vaibhavs10 / optimise-my-whisper
☆205Updated last year
Vaibhavs10 / notebooks
☆128Updated 3 months ago
Locutusque / TPU-Alignment
Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free
☆232Updated 8 months ago
AI-Commandos / LLaMa2lang
Convenience scripts to finetune (chat-)LLaMa3 and other models for any language
☆310Updated last year
TrelisResearch / one-click-llms
One click templates for inferencing Language Models
☆195Updated last month
cognitivecomputations / grokadamw
☆134Updated 10 months ago
pranavjad / tinyllama-bitnet
Train your own small bitnet model
☆74Updated 8 months ago
severian42 / Vodalus-Expert-LLM-Forge
Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …
☆185Updated 11 months ago
thomasgauthier / LoRD
Low-Rank adapter extraction for fine-tuned transformers models
☆173Updated last year
nath1295 / LLMFlex
A python package for developing AI applications with local LLMs.
☆150Updated 6 months ago
CohleM / lilLM
A little(lil) Language Model (LM). A tiny reproduction of LLaMA 3's model architecture.
☆53Updated 2 months ago
QuixiAI / laserRMT
This is our own implementation of 'Layer Selective Rank Reduction'
☆239Updated last year
rombodawg / Easy_training
☆49Updated 4 months ago
rafacelente / bllama
1.58-bit LLaMa model
☆81Updated last year
Mihaiii / backtrack_sampler
An easy-to-understand framework for LLM samplers that rewind and revise generated tokens
☆140Updated 4 months ago
epolewski / EricLLM
A fast batching API to serve LLM models
☆183Updated last year
TheBlokeAI / AIScripts
Some simple scripts that I use day-to-day when working with LLMs and Huggingface Hub
☆162Updated last year
Gryphe / MergeMonster
An unsupervised model merging algorithm for Transformers-based language models.
☆105Updated last year
joey00072 / ohara
Collection of autoregressive model implementation
☆85Updated 2 months ago
QuixiAI / kraken
☆66Updated last year
geronimi73 / qlora-minimal
☆86Updated last year
davanstrien / haiku-dpo
Using open source LLMs to build synthetic datasets for direct preference optimization
☆64Updated last year
writer / writing-in-the-margins
☆118Updated 10 months ago
keeeeenw / MicroLlama
Micro Llama is a small Llama based model with 300M parameters trained from scratch with $500 budget
☆153Updated last year
Pints-AI / 1.5-Pints
A compact LLM pretrained in 9 days by using high quality data
☆318Updated 3 months ago
Vaibhavs10 / gpu-poor-llm-notebooks
☆74Updated 9 months ago
huggingface / competitions
☆124Updated 8 months ago
abhishekkrthakur / chat-ext
chrome & firefox extension to chat with webpages: local llms
☆119Updated 6 months ago
l4b4r4b4b4 / AIDocks
LLM-Training-API: Including Embeddings & ReRankers, mergekit, LaserRMT
☆27Updated last year