tensoic / Cerule
Cerule - A Tiny Mighty Vision Model
☆67Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for Cerule
- Routing on Random Forest (RoRF)☆82Updated last month
- ☆62Updated last month
- Video+code lecture on building nanoGPT from scratch☆64Updated 4 months ago
- Full finetuning of large language models without large memory requirements☆93Updated 10 months ago
- [WIP] Transformer to embed Danbooru labelsets☆13Updated 7 months ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated 10 months ago
- An introduction to LLM Sampling☆18Updated this week
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆53Updated this week
- ☆48Updated last year
- Using modal.com to process FineWeb-edu data☆19Updated 2 months ago
- look how they massacred my boy☆53Updated 3 weeks ago
- Simple program to manually caption your images (or any other file types) so you can use them for AI training☆37Updated last year
- ☆49Updated 7 months ago
- Fast approximate inference on a single GPU with sparsity aware offloading☆38Updated 10 months ago
- inference code for mixtral-8x7b-32kseqlen☆98Updated 10 months ago
- alternative way to calculating self attention☆18Updated 5 months ago
- ☆103Updated 7 months ago
- Simple examples using Argilla tools to build AI☆38Updated this week
- An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.☆20Updated 4 months ago
- ☆116Updated 2 months ago
- ☆22Updated last year
- Collection of autoregressive model implementation☆66Updated this week
- ☆55Updated 11 months ago
- KMD is a collection of conversational exchanges between patients and doctors on various medical topics. It aims to capture the intricaci…☆23Updated 11 months ago
- Simplex Random Feature attention, in PyTorch☆71Updated last year
- ☆48Updated last year
- The Next Generation Multi-Modality Superintelligence☆70Updated 2 months ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆105Updated last week
- Modified Stanford-Alpaca Trainer for Training Replit's Code Model☆40Updated last year
- Using multiple LLMs for ensemble Forecasting☆16Updated 9 months ago