yanivle / fast_minbpe
☆12Updated last month
Alternatives and similar repositories for fast_minbpe:
Users that are interested in fast_minbpe are comparing it to the libraries listed below
- ☆27Updated 7 months ago
- A dashboard for exploring timm learning rate schedulers☆19Updated 3 months ago
- ☆20Updated 9 months ago
- ☆26Updated 11 months ago
- Let's try and finetune the OpenAI consistency decoder to work for SDXL☆23Updated last year
- QLoRA for Masked Language Modeling☆21Updated last year
- Collaborative inference of latent diffusion via hivemind☆12Updated last year
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆34Updated last year
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and te…☆42Updated last year
- Hugging Face Deep RL Class notes☆10Updated 2 years ago
- ImageSlider custom component for gradio.☆39Updated 9 months ago
- CHARacter-awaRE Diffusion: Multilingual Character-Aware Encoders for Font-Aware Diffusers That Can Actually Spell☆14Updated last year
- DiCE: The Infinitely Differentiable Monte-Carlo Estimator☆31Updated last year
- Recaption large (Web)Datasets with vllm and save the artifacts.☆47Updated 3 months ago
- ☆49Updated 11 months ago
- Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)☆17Updated 6 months ago
- Tools for content datamining and NLP at scale☆42Updated 8 months ago
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated 7 months ago
- ☆37Updated last year
- Writing FLUX in Triton☆32Updated 5 months ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Updated last year
- Experiments with generating opensource language model assistants☆97Updated last year
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- A library for squeakily cleaning and filtering language datasets.☆46Updated last year
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" given…☆14Updated last year
- A place to store reusable transformer components of my own creation or found on the interwebs☆47Updated last week
- ☆17Updated 11 months ago
- Collection of autoregressive model implementation☆81Updated 3 weeks ago
- Merge LLM that are split in to parts☆26Updated last year