Thytu / SMITLinks
SMIT: A Simple Modality Integration Tool
☆15Updated last year
Alternatives and similar repositories for SMIT
Users that are interested in SMIT are comparing it to the libraries listed below
Sorting:
- ML/DL Math and Method notes☆66Updated 2 years ago
- Lightweight package that tracks and summarizes code changes using LLMs (Large Language Models)☆34Updated 11 months ago
- coloring terminal text with intensities (used for plotting probability, entropy with tokens)☆12Updated last year
- Swarming algorithms like PSO, Ant Colony, Sakana, and more in PyTorch 😊☆137Updated last week
- HomebrewNLP in JAX flavour for maintable TPU-Training☆51Updated 2 years ago
- NLP with Rust for Python 🦀🐍☆71Updated 8 months ago
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆103Updated last year
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆18Updated 6 months ago
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines☆196Updated last year
- ☆10Updated last year
- Train vision models using JAX and 🤗 transformers☆100Updated last month
- Scaling is a distributed training library and installable dependency designed to scale up neural networks, with a dedicated module for tr…☆66Updated 2 months ago
- Cost aware hyperparameter tuning algorithm☆179Updated last year
- Tools to make language models a bit easier to use☆64Updated last week
- ☆91Updated 7 months ago
- ☆22Updated 2 years ago
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and te…☆44Updated 2 years ago
- ☆102Updated 2 weeks ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆83Updated 2 years ago
- A Jax-based library for building transformers, includes implementations of GPT, Gemma, LlaMa, Mixtral, Whisper, SWin, ViT and more.☆300Updated last year
- Just some miscellaneous utility functions / decorators / modules related to Pytorch and Accelerate to help speed up implementation of new…☆126Updated last year
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆132Updated last year
- ☆31Updated last year
- Chat Markup Language conversation library☆55Updated 2 years ago
- Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.☆160Updated last year
- Clean RL implementation using MLX☆34Updated last year
- QLoRA with Enhanced Multi GPU Support☆37Updated 2 years ago
- Code for the paper Don't Pay Attention☆51Updated 4 months ago
- Various transformers for FSDP research☆38Updated 3 years ago
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆32Updated last year