Jaykef / min-patchnizer
Minimal, clean code for video/image "patchnization" - a process commonly used in tokenizing visual data for use in a Transformer encoder.
☆11Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for min-patchnizer
- Efficient Dictionary Learning with Switch Sparse Autoencoders (SAEs)☆13Updated 3 weeks ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated this week
- Rust bindings for CTranslate2☆13Updated last year
- ☆12Updated 3 weeks ago
- Learning to Retrieve by Trying - Source code for Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval☆24Updated last week
- This repository contains a fork from "language-models-trajectory-generators", the goal is to test the same functionality with Mistrals LL…☆19Updated last month
- This is a repository that collects common audio noise reduction models, using Gradio to demonstrate the use of each model, which is very …☆16Updated last month
- ☆12Updated 7 months ago
- The open source implementation of "NeVA: NeMo Vision and Language Assistant"☆18Updated last year
- Cog wrapper for collabora/WhisperSpeech☆24Updated 8 months ago
- implementation of https://arxiv.org/pdf/2312.09299☆19Updated 4 months ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆18Updated last year
- Lottery Ticket Adaptation☆35Updated last month
- Code accompanying the paper "A Language Model's Guide Through Latent Space". It contains functionality for training and using concept vec…☆16Updated 8 months ago
- Visual RAG using less than 300 lines of code.☆23Updated 8 months ago
- Training hybrid models for dummies.☆15Updated last week
- A library for simplifying fine tuning with multi gpu setups in the Huggingface ecosystem.☆15Updated 2 weeks ago
- Let's try and finetune the OpenAI consistency decoder to work for SDXL☆23Updated 11 months ago
- YouTube Assistant☆12Updated last year
- ☆19Updated last year
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Updated 5 months ago
- Official repository for the paper "Images as Weight Matrices: Sequential Image Generation Through Synaptic Learning Rules" (ICLR 2023)☆12Updated last year
- Apps that run on modal.com☆12Updated 5 months ago
- ☆41Updated 4 months ago
- Example of finetuning CLIP to identify plants.☆10Updated 4 months ago
- Floral Diffusion is a custom diffusion model trained by jags using a DD 5.6 version☆26Updated 2 years ago
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆36Updated 7 months ago
- Latent Large Language Models☆16Updated 2 months ago
- An AI character interaction system with emotional modeling and advanced memory management☆14Updated 2 weeks ago