Thytu / SMITLinks
SMIT: A Simple Modality Integration Tool
☆15Updated last year
Alternatives and similar repositories for SMIT
Users that are interested in SMIT are comparing it to the libraries listed below
Sorting:
- NLP with Rust for Python 🦀🐍☆70Updated 7 months ago
- HomebrewNLP in JAX flavour for maintable TPU-Training☆51Updated last year
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆32Updated last year
- ☆22Updated 2 years ago
- ☆90Updated 6 months ago
- coloring terminal text with intensities (used for plotting probability, entropy with tokens)☆12Updated last year
- Lightweight package that tracks and summarizes code changes using LLMs (Large Language Models)☆34Updated 10 months ago
- Train fastai models faster (and other useful tools)☆72Updated 7 months ago
- Tools to make language models a bit easier to use☆63Updated this week
- Simple repository for training small reasoning models☆47Updated 11 months ago
- ML/DL Math and Method notes☆65Updated 2 years ago
- Chat Markup Language conversation library☆55Updated 2 years ago
- ☆10Updated last year
- Scaling is a distributed training library and installable dependency designed to scale up neural networks, with a dedicated module for tr…☆66Updated last month
- ☆27Updated last year
- DiffusionWithAutoscaler☆29Updated last year
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆103Updated last year
- ☆53Updated 11 months ago
- ☆125Updated last year
- Small, simple agent task environments for training and evaluation☆19Updated last year
- Cost aware hyperparameter tuning algorithm☆177Updated last year
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines☆196Updated last year
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆67Updated 3 months ago
- ☆47Updated last year
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆83Updated 2 years ago
- Code for the paper Don't Pay Attention☆50Updated 3 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆34Updated 8 months ago
- ☆94Updated 2 years ago
- Mobile Viewer for W&B, built on top of Flutter.☆39Updated last year
- Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.☆32Updated 7 months ago