broskicodes / slmsLinks
Experimenting with small language models
☆68Updated last year
Alternatives and similar repositories for slms
Users that are interested in slms are comparing it to the libraries listed below
Sorting:
- Video+code lecture on building nanoGPT from scratch☆69Updated last year
- ☆205Updated last year
- ☆128Updated 3 months ago
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆232Updated 8 months ago
- Convenience scripts to finetune (chat-)LLaMa3 and other models for any language☆310Updated last year
- One click templates for inferencing Language Models☆195Updated last month
- ☆134Updated 10 months ago
- Train your own small bitnet model☆74Updated 8 months ago
- Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …☆185Updated 11 months ago
- Low-Rank adapter extraction for fine-tuned transformers models☆173Updated last year
- A python package for developing AI applications with local LLMs.☆150Updated 6 months ago
- A little(lil) Language Model (LM). A tiny reproduction of LLaMA 3's model architecture.☆53Updated 2 months ago
- This is our own implementation of 'Layer Selective Rank Reduction'☆239Updated last year
- ☆49Updated 4 months ago
- 1.58-bit LLaMa model☆81Updated last year
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆140Updated 4 months ago
- A fast batching API to serve LLM models☆183Updated last year
- Some simple scripts that I use day-to-day when working with LLMs and Huggingface Hub☆162Updated last year
- An unsupervised model merging algorithm for Transformers-based language models.☆105Updated last year
- Collection of autoregressive model implementation☆85Updated 2 months ago
- ☆66Updated last year
- ☆86Updated last year
- Using open source LLMs to build synthetic datasets for direct preference optimization☆64Updated last year
- ☆118Updated 10 months ago
- Micro Llama is a small Llama based model with 300M parameters trained from scratch with $500 budget☆153Updated last year
- A compact LLM pretrained in 9 days by using high quality data☆318Updated 3 months ago
- ☆74Updated 9 months ago
- ☆124Updated 8 months ago
- chrome & firefox extension to chat with webpages: local llms☆119Updated 6 months ago
- LLM-Training-API: Including Embeddings & ReRankers, mergekit, LaserRMT☆27Updated last year