Thytu / SMITLinks
SMIT: A Simple Modality Integration Tool
☆15Updated last year
Alternatives and similar repositories for SMIT
Users that are interested in SMIT are comparing it to the libraries listed below
Sorting:
- ☆22Updated 2 years ago
- Swarming algorithms like PSO, Ant Colony, Sakana, and more in PyTorch 😊☆137Updated last week
- coloring terminal text with intensities (used for plotting probability, entropy with tokens)☆12Updated last year
- A Jax-based library for building transformers, includes implementations of GPT, Gemma, LlaMa, Mixtral, Whisper, SWin, ViT and more.☆300Updated last year
- ML/DL Math and Method notes☆66Updated 2 years ago
- ☆10Updated last year
- ☆27Updated last year
- HomebrewNLP in JAX flavour for maintable TPU-Training☆51Updated 2 years ago
- Lightweight package that tracks and summarizes code changes using LLMs (Large Language Models)☆34Updated 11 months ago
- Cost aware hyperparameter tuning algorithm☆179Updated last year
- Mobile Viewer for W&B, built on top of Flutter.☆40Updated last year
- Automatically take good care of your preemptible TPUs☆37Updated 2 years ago
- Serialize JAX, Flax, Haiku, or Objax model params with 🤗`safetensors`☆47Updated last year
- Train vision models using JAX and 🤗 transformers☆100Updated last month
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆103Updated last year
- ☆47Updated 2 years ago
- An easy to use tool to apply adversarial attacks☆12Updated last year
- NLP with Rust for Python 🦀🐍☆71Updated 8 months ago
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆18Updated 6 months ago
- Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.☆130Updated 2 months ago
- Scaling is a distributed training library and installable dependency designed to scale up neural networks, with a dedicated module for tr…☆66Updated 2 months ago
- A user-friendly tool chain that enables the seamless execution of ONNX models using JAX as the backend.☆131Updated last week
- Datamodels for hugging face tokenizers☆99Updated this week
- Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.☆32Updated 8 months ago
- DiffusionWithAutoscaler☆29Updated last year
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆132Updated last year
- Clean RL implementation using MLX☆34Updated last year
- Code for the paper Don't Pay Attention☆51Updated 4 months ago
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆32Updated last year
- OMNI: Open-endedness via Models of human Notions of Interestingness☆58Updated last year