Jaykef / min-patchnizer
Minimal, clean code for video/image "patchnization" - a process commonly used in tokenizing visual data for use in a Transformer encoder.
☆11Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for min-patchnizer
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated last week
- Using multiple LLMs for ensemble Forecasting☆16Updated 10 months ago
- This is a repository that collects common audio noise reduction models, using Gradio to demonstrate the use of each model, which is very …☆16Updated last month
- Visual RAG using less than 300 lines of code.☆23Updated 8 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated 8 months ago
- Learning to Retrieve by Trying - Source code for Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval☆24Updated 3 weeks ago
- ☆14Updated this week
- implementation of https://arxiv.org/pdf/2312.09299☆19Updated 4 months ago
- Let's try and finetune the OpenAI consistency decoder to work for SDXL☆23Updated 11 months ago
- Efficient Dictionary Learning with Switch Sparse Autoencoders (SAEs)☆12Updated last month
- Digital daydreaming with CLIP Interrogator and Diffusers☆13Updated 2 months ago
- Fast approximate inference on a single GPU with sparsity aware offloading☆38Updated 10 months ago
- Official repository for the paper "Images as Weight Matrices: Sequential Image Generation Through Synaptic Learning Rules" (ICLR 2023)☆12Updated last year
- Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"☆15Updated last week
- Training hybrid models for dummies.☆15Updated 3 weeks ago
- Rust bindings for CTranslate2☆13Updated last year
- Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in MLX☆13Updated 3 weeks ago
- This repository contains a fork from "language-models-trajectory-generators", the goal is to test the same functionality with Mistrals LL…☆19Updated last month
- ☆27Updated 3 months ago
- ☆12Updated 7 months ago
- A library for simplifying fine tuning with multi gpu setups in the Huggingface ecosystem.☆15Updated 3 weeks ago
- Code accompanying the paper "A Language Model's Guide Through Latent Space". It contains functionality for training and using concept vec…☆16Updated 8 months ago
- ☆15Updated 11 months ago
- Example of finetuning CLIP to identify plants.☆10Updated 4 months ago
- ☆27Updated last year
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Updated 5 months ago
- Guide diffusion on ImageBind embedding similarity☆28Updated last year
- Load any clip model with a standardized interface☆21Updated 6 months ago
- ☆41Updated 5 months ago