Trainy-ai / trainy
A simple Pure Python/PyTorch performance daemon for training workloads
☆15Updated last year
Alternatives and similar repositories for trainy:
Users that are interested in trainy are comparing it to the libraries listed below
- Profiling tools for distributed training☆38Updated last year
- Cerule - A Tiny Mighty Vision Model☆67Updated 8 months ago
- Retrieve the source code for any model made available on replicate.com!☆34Updated last year
- ☆199Updated last year
- Examples of apps built with Nendo, the AI Audio Tool Suite☆55Updated last year
- Run GGML models with Kubernetes.☆173Updated last year
- A synthetic story narration dataset to study small audio LMs.☆32Updated last year
- Fine-tuning and serving LLMs on any cloud☆89Updated last year
- Joint speech-language model - respond directly to audio!☆30Updated 11 months ago
- Helpers and such for working with Lambda Cloud☆51Updated last year
- ☆40Updated 2 years ago
- ☆62Updated 9 months ago
- 🚀 End-to-end examples and analysis of deploying LLMs serverless using Modal, Runpod, and Beam☆27Updated last year
- A simple, hackable text-to-speech system in PyTorch and MLX☆154Updated 2 months ago
- Cedana: Access and run on compute anywhere in the world, on any provider. Migrate seamlessly between providers, arbitraging price/perform…☆58Updated this week
- Full finetuning of large language models without large memory requirements☆94Updated last year
- https://hf.co/hexgrad/Kokoro-82M☆14Updated 2 months ago
- Focused on fast experimentation and simplicity☆71Updated 4 months ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆119Updated last year
- ☆42Updated 9 months ago
- JAX implementation of the Llama 2 model☆218Updated last year
- DiffusionWithAutoscaler☆29Updated last year
- Pytorch script hot swap: Change code without unloading your LLM from VRAM☆123Updated 2 weeks ago
- git extension for {collaborative, communal, continual} model development☆211Updated 5 months ago
- MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…☆171Updated last week
- An unsupervised model merging algorithm for Transformers-based language models.☆105Updated last year
- ☆33Updated 7 months ago
- Converts JSON-Schema to GBNF grammar to use with llama.cpp☆52Updated last year
- ☆16Updated 6 months ago
- Official code for "SWARM Parallelism: Training Large Models Can Be Surprisingly Communication-Efficient"☆140Updated last year