thomwolf / sesame-explorationsLinks
☆30Updated 7 months ago
Alternatives and similar repositories for sesame-explorations
Users that are interested in sesame-explorations are comparing it to the libraries listed below
Sorting:
- ☆124Updated last year
- Video+code lecture on building nanoGPT from scratch☆68Updated last year
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆108Updated 9 months ago
- Collection of autoregressive model implementation☆85Updated 7 months ago
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆123Updated 4 months ago
- ☆160Updated last year
- Datamodels for hugging face tokenizers☆86Updated 2 weeks ago
- Collection of Open Source Speech Data☆163Updated 2 months ago
- An introduction to LLM Sampling☆79Updated last year
- ☆136Updated last year
- ☆138Updated 3 months ago
- A simple, hackable text-to-speech system in PyTorch and MLX☆184Updated 4 months ago
- ☆89Updated 5 months ago
- ☆210Updated last year
- ☆53Updated 10 months ago
- SlamKit is an open source tool kit for efficient training of SpeechLMs. It was used for "Slamming: Training a Speech Language Model on On…☆224Updated 6 months ago
- python bindings for symphonia/opus - read various audio formats from python and write opus files☆70Updated 4 months ago
- code for training & evaluating Contextual Document Embedding models☆201Updated 7 months ago
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆232Updated last year
- ☆318Updated last year
- NanoGPT-speedrunning for the poor T4 enjoyers☆73Updated 7 months ago
- Simple & Scalable Pretraining for Neural Architecture Research☆304Updated last week
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines☆196Updated last year
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆153Updated 5 months ago
- Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.☆160Updated last year
- ☆56Updated 10 months ago
- Optimus is a flexible and scalable framework built to train language models efficiently across diverse hardware configurations, including…☆67Updated last week
- ☆208Updated last year
- Let's build better datasets, together!☆265Updated 11 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Updated last month