yanivle / fast_minbpe

☆12

Alternatives and similar repositories for fast_minbpe:

Users that are interested in fast_minbpe are comparing it to the libraries listed below

ethansmith2000 / AutoLoRADiscovery
☆27Updated 7 months ago
crypdick / timm-lr-scheduler-explorer
A dashboard for exploring timm learning rate schedulers
☆19Updated 3 months ago
LLM360 / k2-data-prep
☆20Updated 9 months ago
cloneofsimo / auto_llm_codebase_analysis
☆26Updated 11 months ago
Birch-san / sdxl-diffusion-decoder
Let's try and finetune the OpenAI consistency decoder to work for SDXL
☆23Updated last year
ChrisHayduk / QLoRA-for-MLM
QLoRA for Masked Language Modeling
☆21Updated last year
learning-at-home / collaborative-latent-diffusion
Collaborative inference of latent diffusion via hivemind
☆12Updated last year
google-research-datasets / QAmeleon
QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…
☆34Updated last year
rwightman / genalog
Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and te…
☆42Updated last year
Elameri / huggingface-deep-rl-class-notes
Hugging Face Deep RL Class notes
☆10Updated 2 years ago
pngwn / gradio-imageslider
ImageSlider custom component for gradio.
☆39Updated 9 months ago
contrebande-labs / charred
CHARacter-awaRE Diffusion: Multilingual Character-Aware Encoders for Font-Aware Diffusers That Can Actually Spell
☆14Updated last year
crowsonkb / dice-mc
DiCE: The Infinitely Differentiable Monte-Carlo Estimator
☆31Updated last year
sayakpaul / simple-image-recaptioning
Recaption large (Web)Datasets with vllm and save the artifacts.
☆47Updated 3 months ago
euclaise / supertrainer2000
☆49Updated 11 months ago
peanutcocktail / CogVideo
Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
☆17Updated 6 months ago
LAION-AI / riverbed
Tools for content datamining and NLP at scale
☆42Updated 8 months ago
huggingface / pixparse
Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data
☆21Updated 7 months ago
deep-diver / LLM-Pref-Mark-UI
☆37Updated last year
timudk / flux_triton
Writing FLUX in Triton
☆32Updated 5 months ago
kaiokendev / cutoff-len-is-context-len
Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit
☆63Updated last year
Rallio67 / language-model-agents
Experiments with generating opensource language model assistants
☆97Updated last year
ElleLeonne / Lightning-ReLoRA
A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.
☆33Updated last year
CarperAI / squeakily
A library for squeakily cleaning and filtering language datasets.
☆46Updated last year
huggingface / peft-pytorch-conference
Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" given…
☆14Updated last year
drisspg / transformer_nuggets
A place to store reusable transformer components of my own creation or found on the interwebs
☆47Updated last week
wangitu / Ada-Instruct
☆17Updated 11 months ago
joey00072 / ohara
Collection of autoregressive model implementation
☆81Updated 3 weeks ago
donaldafeith / Pytorch_Merge
Merge LLM that are split in to parts
☆26Updated last year