Algomancer / The-Daily-Train
Training Models Daily
☆17Updated 11 months ago
Related projects ⓘ
Alternatives and complementary repositories for The-Daily-Train
- alternative way to calculating self attention☆18Updated 5 months ago
- ☆36Updated 3 months ago
- Using modal.com to process FineWeb-edu data☆19Updated 2 months ago
- RWKV-7: Surpassing GPT☆45Updated this week
- ☆25Updated 10 months ago
- Training hybrid models for dummies.☆15Updated 3 weeks ago
- [WIP] Transformer to embed Danbooru labelsets☆13Updated 7 months ago
- ☆22Updated last year
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆22Updated last month
- Cerule - A Tiny Mighty Vision Model☆67Updated 2 months ago
- BH hackathon☆14Updated 7 months ago
- look how they massacred my boy☆58Updated last month
- ☆27Updated 4 months ago
- Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"☆15Updated last week
- ☆20Updated 2 weeks ago
- A synthetic story narration dataset to study small audio LMs.☆30Updated 10 months ago
- Sparse autoencoders for Contra text embedding models☆24Updated 6 months ago
- new optimizer☆19Updated 3 months ago
- ☆34Updated last year
- ☆28Updated this week
- Because it's there.☆14Updated 2 months ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆55Updated 2 weeks ago
- utilities for loading and running text embeddings with onnx☆39Updated 3 months ago
- ☆57Updated 11 months ago
- Karpathy's llama2.c transpiled to MLX for Apple Silicon☆15Updated 10 months ago
- Collection of autoregressive model implementation☆67Updated this week
- a highly efficient compression algorithm for the n1 implant (neuralink's compression challenge)☆45Updated 5 months ago
- An introduction to LLM Sampling☆64Updated 2 weeks ago
- ☆48Updated last year
- Losslessly encode text natively with arithmetic coding and HuggingFace Transformers☆71Updated 3 months ago