broskicodes / slms
Experimenting with small language models
☆65Updated last year
Alternatives and similar repositories for slms:
Users that are interested in slms are comparing it to the libraries listed below
- ☆129Updated 8 months ago
- Video+code lecture on building nanoGPT from scratch☆65Updated 10 months ago
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆231Updated 5 months ago
- ☆126Updated last month
- 1.58-bit LLaMa model☆81Updated last year
- ☆46Updated 2 months ago
- Low-Rank adapter extraction for fine-tuned transformers models☆171Updated 11 months ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆139Updated 2 months ago
- Reweight GPT - a simple neural network using transformer architecture for next character prediction☆53Updated last year
- The simplest, fastest repository for training/finetuning medium-sized xLSTMs.☆42Updated 11 months ago
- ☆66Updated 11 months ago
- Let's create synthetic textbooks together :)☆74Updated last year
- Fine Tuning Multimodal LLM "Idefics 9B" on Pokemon Go Dataset available on Hugging Face.☆19Updated last year
- Some simple scripts that I use day-to-day when working with LLMs and Huggingface Hub☆160Updated last year
- ☆113Updated 2 weeks ago
- ☆204Updated 10 months ago
- So, I trained a Llama a 130M architecture I coded from ground up to build a small instruct model from scratch. Trained on FineWeb dataset…☆14Updated 3 weeks ago
- A python package for developing AI applications with local LLMs.☆147Updated 3 months ago
- entropix style sampling + GUI☆25Updated 5 months ago
- ☆74Updated 6 months ago
- chrome & firefox extension to chat with webpages: local llms☆113Updated 4 months ago
- A fast batching API to serve LLM models☆182Updated 11 months ago
- ☆130Updated last week
- An introduction to LLM Sampling☆77Updated 4 months ago
- Set of scripts to finetune LLMs☆37Updated last year
- Experimental BitNet Implementation☆64Updated last year
- Collection of autoregressive model implementation☆85Updated 2 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆61Updated last year
- This is our own implementation of 'Layer Selective Rank Reduction'☆235Updated 11 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 9 months ago