nivibilla / build-nanogpt
Video+code lecture on building nanoGPT from scratch
☆64Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for build-nanogpt
- ☆103Updated 7 months ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆105Updated last week
- Cerule - A Tiny Mighty Vision Model☆67Updated 2 months ago
- ☆93Updated 2 months ago
- ☆116Updated 2 months ago
- ☆64Updated 5 months ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated 10 months ago
- Scripts to create your own moe models using mlx☆86Updated 8 months ago
- ☆28Updated 7 months ago
- entropix style sampling + GUI☆25Updated last week
- Fast approximate inference on a single GPU with sparsity aware offloading☆38Updated 10 months ago
- The simplest, fastest repository for training/finetuning medium-sized xLSTMs.☆38Updated 5 months ago
- Full finetuning of large language models without large memory requirements☆93Updated 10 months ago
- look how they massacred my boy☆53Updated 3 weeks ago
- ☆61Updated 3 months ago
- All the world is a play, we are but actors in it.☆47Updated 4 months ago
- Low-Rank adapter extraction for fine-tuned transformers model☆162Updated 6 months ago
- inference code for mixtral-8x7b-32kseqlen☆98Updated 10 months ago
- Simple examples using Argilla tools to build AI☆38Updated this week
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration☆81Updated last month
- Fast parallel LLM inference for MLX☆146Updated 4 months ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆53Updated this week
- ☆148Updated 3 months ago
- Gradio based tool to run opensource LLM models directly from Huggingface☆87Updated 4 months ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆77Updated 5 months ago
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆222Updated 6 months ago
- ☆48Updated last year
- smolLM with Entropix sampler on pytorch☆137Updated last week
- ☆49Updated 7 months ago
- Port of Suno's Bark TTS transformer in Apple's MLX Framework☆71Updated 8 months ago