broskicodes / slms
Experimenting with small language models
☆64Updated last year
Alternatives and similar repositories for slms:
Users that are interested in slms are comparing it to the libraries listed below
- Video+code lecture on building nanoGPT from scratch☆66Updated 9 months ago
- A little(lil) Language Model (LM)☆47Updated 3 weeks ago
- ☆201Updated 9 months ago
- Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …☆177Updated 8 months ago
- Train your own small bitnet model☆65Updated 5 months ago
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆230Updated 4 months ago
- ☆126Updated 7 months ago
- A simple Jupyter Notebook for learning MLX text-completion fine-tuning!☆114Updated 4 months ago
- ☆75Updated 5 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆59Updated last year
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning☆31Updated last month
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆138Updated last month
- Collection of autoregressive model implementation☆83Updated last month
- Testing LLM reasoning abilities with family relationship quizzes.☆62Updated last month
- 1.58-bit LLaMa model☆82Updated 11 months ago
- a simplified version of Google's Gemma model to be used for learning☆24Updated last year
- Reweight GPT - a simple neural network using transformer architecture for next character prediction☆53Updated last year
- ☆66Updated 10 months ago
- Experimental BitNet Implementation☆61Updated last year
- Self-contained, minimalistic implementation of a language model that generates coherent and normal sounding names. It uses an input datas…☆50Updated last year
- Convenience scripts to finetune (chat-)LLaMa3 and other models for any language☆300Updated 9 months ago
- ☆114Updated 6 months ago
- customizable template GPT code designed for easy novel architecture experimentation☆26Updated last week
- ☆111Updated 3 months ago
- Set of scripts to finetune LLMs☆37Updated 11 months ago
- A fast batching API to serve LLM models☆183Updated 11 months ago
- One click templates for inferencing Language Models☆165Updated last week
- Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.☆148Updated 10 months ago
- An introduction to LLM Sampling☆77Updated 3 months ago
- ☆125Updated last week