broskicodes / slms
Experimenting with small language models
☆47Updated 9 months ago
Related projects ⓘ
Alternatives and complementary repositories for slms
- 1.58-bit LLaMa model☆79Updated 7 months ago
- ☆116Updated 2 months ago
- Video+code lecture on building nanoGPT from scratch☆64Updated 4 months ago
- Train your own small bitnet model☆55Updated 3 weeks ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆105Updated 2 weeks ago
- The simplest, fastest repository for training/finetuning medium-sized xLSTMs.☆38Updated 5 months ago
- A fast batching API to serve LLM models☆172Updated 6 months ago
- ☆39Updated 2 months ago
- entropix style sampling + GUI☆25Updated last week
- ☆103Updated 2 months ago
- The training notebooks that were similar to the original script used to train TinyMistral.☆19Updated 11 months ago
- Self-contained, minimalistic implementation of a language model that generates coherent and normal sounding names. It uses an input datas…☆49Updated 8 months ago
- autologic is a Python package that implements the SELF-DISCOVER framework proposed in the paper SELF-DISCOVER: Large Language Models Self…☆57Updated 8 months ago
- ☆92Updated last month
- This is our own implementation of 'Layer Selective Rank Reduction'☆231Updated 5 months ago
- A pipeline parallel training script for LLMs.☆83Updated this week
- idea: https://github.com/nyxkrage/ebook-groupchat/☆81Updated 2 months ago
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.☆172Updated 3 months ago
- Easily view and modify JSON datasets for large language models☆62Updated last month
- Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …☆160Updated 3 months ago
- GPT-2 small trained on phi-like data☆65Updated 8 months ago
- Model REVOLVER, a human in the loop model mixing system.☆33Updated last year
- Large Model Proxy is designed to make it easy to run multiple resource-heavy Large Models (LM) on the same machine with limited amount of…☆46Updated last month
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆219Updated last week
- run ollama & gguf easily with a single command☆47Updated 5 months ago
- Experimental BitNet Implementation☆60Updated 7 months ago
- Embed arbitrary modalities (images, audio, documents, etc) into large language models.☆176Updated 7 months ago
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆221Updated 6 months ago
- B-Llama3o a llama3 with Vision Audio and Audio understanding as well as text and Audio and Animation Data output.☆26Updated 5 months ago
- An efficent implementation of the method proposed in "The Era of 1-bit LLMs"☆154Updated 3 weeks ago