Jaykef / min-patchnizerLinks
Minimal, clean code for video/image "patchnization" - a process commonly used in tokenizing visual data for use in a Transformer encoder.
☆11Updated last year
Alternatives and similar repositories for min-patchnizer
Users that are interested in min-patchnizer are comparing it to the libraries listed below
Sorting:
- Create topological graph for image segments.☆22Updated last year
- Search through Facebook Research's PyTorch BigGraph Wikidata-dataset with the Weaviate vector search engine☆31Updated 4 years ago
- Make-A-Video Latent Diffusion Model☆19Updated 2 years ago
- Cerule - A Tiny Mighty Vision Model☆68Updated last month
- my solution for Abstaction and reasoning challenge on kaggle☆10Updated last year
- Latent Diffusion Language Models☆70Updated 2 years ago
- assign color hues to a collection of text fragments based on embeddings☆20Updated last year
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆19Updated 2 years ago
- Let's try and finetune the OpenAI consistency decoder to work for SDXL☆24Updated 2 years ago
- Implementation of a holodeck, written in Pytorch☆18Updated 2 years ago
- A synthetic story narration dataset to study small audio LMs.☆31Updated last year
- Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"☆13Updated 11 months ago
- ☆63Updated last year
- Colab notebook to finetune GLIDE.☆12Updated 3 years ago
- ☆27Updated last year
- Rust bindings for CTranslate2☆14Updated 2 years ago
- implementation of https://arxiv.org/pdf/2312.09299☆21Updated last year
- Visual RAG using less than 300 lines of code.☆29Updated last year
- ☆27Updated 2 years ago
- Aggregating embeddings over time☆32Updated 2 years ago
- This contains the Flax model of min(DALL·E) and code for converting it to PyTorch☆45Updated 3 years ago
- An EXA-Scale repository of Multi-Modality AI resources from papers and models, to foundational libraries!☆40Updated last year
- Experiments with generating GPT-2 fanfiction on specified topics.☆11Updated 6 years ago
- The Next Generation Multi-Modality Superintelligence☆70Updated last year
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆25Updated 2 weeks ago
- Implementation of the Mamba SSM with hf_integration.☆56Updated last year
- 🎨 Imagine what Picasso could have done with AI. Self-host your StableDiffusion API.☆50Updated 2 years ago
- A simple package for leveraging Falcon 180B and the HF ecosystem's tools, including training/inference scripts, safetensors, integrations…☆12Updated last year
- GPT as Knowledger Worker (or if you really want, GPT Sorta' Takes the CPA Exam)☆13Updated 2 years ago
- Official Code for MIMETIC^2☆13Updated last year