EthanBnntt / tinygrad-vitLinks
A minimalist implementation of the ViT (Vision Transformer) model, using tinygrad
☆15Updated last year
Alternatives and similar repositories for tinygrad-vit
Users that are interested in tinygrad-vit are comparing it to the libraries listed below
Sorting:
- Simple GRPO scripts and configurations.☆59Updated 9 months ago
- Curriculum training of instruction-following LLMs with Unsloth☆14Updated 8 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆58Updated last month
- ☆58Updated this week
- ☆136Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆68Updated this week
- ☆51Updated 9 months ago
- 🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)☆17Updated last year
- NLP with Rust for Python 🦀🐍☆66Updated 6 months ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆66Updated last month
- Datamodels for hugging face tokenizers☆86Updated this week
- Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles☆60Updated 6 months ago
- ☆39Updated last year
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆98Updated 6 months ago
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆78Updated last year
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated last year
- code for training and using chess embeddings models☆12Updated last year
- Pre-train Static Word Embeddings☆90Updated 2 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆69Updated last year
- ☆55Updated last year
- ☆36Updated 3 months ago
- Aana SDK is a powerful framework for building AI enabled multimodal applications.☆53Updated 3 months ago
- ☆45Updated 2 years ago
- QAlign is a new test-time alignment approach that improves language model performance by using Markov chain Monte Carlo methods.☆24Updated last week
- [WIP] Transformer to embed Danbooru labelsets☆13Updated last year
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆145Updated 9 months ago
- Small and Efficient Mathematical Reasoning LLMs☆72Updated last year
- Pressure testing the context window of open LLMs☆25Updated last year
- Train your own SOTA deductive reasoning model☆107Updated 8 months ago
- ☆138Updated 3 months ago