huggingface / fuego
[WIP] A π₯ interface for running code in the cloud
β85Updated 2 years ago
Alternatives and similar repositories for fuego:
Users that are interested in fuego are comparing it to the libraries listed below
- Scripts to convert datasets from various sources to Hugging Face Datasets.β57Updated 2 years ago
- π€ Disaggregators: Curated data labelers for in-depth analysis.β65Updated 2 years ago
- A library for squeakily cleaning and filtering language datasets.β47Updated last year
- ππ€ A collection of templates for Hugging Face Spacesβ35Updated last year
- Exploring finetuning public checkpoints on filter 8K sequences on Pileβ115Updated 2 years ago
- Pipeline for pulling and processing online language model pretraining data from the webβ177Updated last year
- β19Updated 2 years ago
- One stop shop for all things carpβ59Updated 2 years ago
- **ARCHIVED** Filesystem interface to π€ Hubβ58Updated 2 years ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.β93Updated 2 years ago
- β22Updated last year
- GitHub action that'll sync files from a GitHub Repo with the Hugging Face Hub π€β70Updated 6 months ago
- Experiments with generating opensource language model assistantsβ97Updated last year
- π¨ Imagine what Picasso could have done with AI. Self-host your StableDiffusion API.β50Updated last year
- β130Updated 2 years ago
- Fast AI Practical Deep Learning for Coders experiments in Stable Diffusionβ25Updated 2 years ago
- β92Updated last year
- [Added T5 support to TRLX] A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)β47Updated 2 years ago
- DiffusionWithAutoscalerβ29Updated last year
- Train vision models using JAX and π€ transformersβ97Updated 3 weeks ago
- QLoRA with Enhanced Multi GPU Supportβ37Updated last year
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.β82Updated last year
- Reimplementation of the task generation part from the Alpaca paperβ119Updated 2 years ago
- β123Updated 6 months ago
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and teβ¦β42Updated last year
- β50Updated last year
- A miniture AI training framework for PyTorchβ42Updated 3 months ago
- Latent Diffusion Language Modelsβ68Updated last year
- β67Updated 2 years ago
- Smol but mighty language modelβ63Updated 2 years ago