EthanBnntt / tinygrad-vitLinks

A minimalist implementation of the ViT (Vision Transformer) model, using tinygrad

☆15

Alternatives and similar repositories for tinygrad-vit

Users that are interested in tinygrad-vit are comparing it to the libraries listed below

Sorting:

minosvasilias / simple_grpo
Simple GRPO scripts and configurations.
☆59Updated 9 months ago
oKatanaaa / kolibrify
Curriculum training of instruction-following LLMs with Unsloth
☆14Updated 8 months ago
s-smits / grpo-optuna
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
☆58Updated last month
Aleph-Alpha-Research / trigrams
☆58Updated this week
QuixiAI / grokadamw
☆136Updated last year
louisbrulenaudet / ragoon
High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡
☆68Updated this week
AnswerDotAI / ModernBERT-Instruct-mini-cookbook
☆51Updated 9 months ago
alvarobartt / vertex-ai-huggingface-inference-toolkit
🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)
☆17Updated last year
raphaelsty / LeNLP
NLP with Rust for Python 🦀🐍
☆66Updated 6 months ago
Knowledgator / FlashDeBERTa
Trully flash implementation of DeBERTa disentangled attention mechanism.
☆66Updated last month
stephantul / skeletoken
Datamodels for hugging face tokenizers
☆86Updated this week
Columbia-NLP-Lab / PAPILLON
Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles
☆60Updated 6 months ago
Zyphra / Zyda_processing
☆39Updated last year
tiiuae / onebitllms
Lightweight toolkit package to train and fine-tune 1.58bit Language models
☆98Updated 6 months ago
flowaicom / flow-judge
Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…
☆78Updated last year
argilla-io / distilabel-spin-dibt
Repository containing the SPIN experiments on the DIBT 10k ranked prompts
☆24Updated last year
broskicodes / chess-position-embeddings
code for training and using chess embeddings models
☆12Updated last year
MinishLab / tokenlearn
Pre-train Static Word Embeddings
☆90Updated 2 months ago
davanstrien / haiku-dpo
Using open source LLMs to build synthetic datasets for direct preference optimization
☆69Updated last year
arcee-ai / DAM
☆55Updated last year
axolotl-ai-cloud / axolotl-cookbook
☆36Updated 3 months ago
dropbox / aana_sdk
Aana SDK is a powerful framework for building AI enabled multimodal applications.
☆53Updated 3 months ago
teknium1 / transformers-gptq-quant
☆45Updated 2 years ago
goncalorafaria / qalign
QAlign is a new test-time alignment approach that improves language model performance by using Markov chain Monte Carlo methods.
☆24Updated last week
Birch-san / booru-embed
[WIP] Transformer to embed Danbooru labelsets
☆13Updated last year
Mihaiii / backtrack_sampler
An easy-to-understand framework for LLM samplers that rewind and revise generated tokens
☆145Updated 9 months ago
akjindal53244 / Arithmo
Small and Efficient Mathematical Reasoning LLMs
☆72Updated last year
LeonEricsson / llmcontext
Pressure testing the context window of open LLMs
☆25Updated last year
OpenPipe / deductive-reasoning
Train your own SOTA deductive reasoning model
☆107Updated 8 months ago
QuixiAI / spectrum
☆138Updated 3 months ago