Jaykef / min-patchnizerLinks
Minimal, clean code for video/image "patchnization" - a process commonly used in tokenizing visual data for use in a Transformer encoder.
☆11Updated last year
Alternatives and similar repositories for min-patchnizer
Users that are interested in min-patchnizer are comparing it to the libraries listed below
Sorting:
- Make-A-Video Latent Diffusion Model☆19Updated last year
- Load any clip model with a standardized interface☆21Updated last week
- GPT as Knowledger Worker (or if you really want, GPT Sorta' Takes the CPA Exam)☆13Updated 2 years ago
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Updated last year
- LoRA fine-tuned Stable Diffusion Deployment☆31Updated 2 years ago
- ☆24Updated 2 years ago
- Let's try and finetune the OpenAI consistency decoder to work for SDXL☆24Updated last year
- ☆17Updated last year
- implementation of https://arxiv.org/pdf/2312.09299☆21Updated last year
- Implementation of a holodeck, written in Pytorch☆18Updated last year
- ☆63Updated last year
- Cerule - A Tiny Mighty Vision Model☆67Updated last year
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆42Updated last year
- Create topological graph for image segments.☆22Updated last year
- 🎨 Imagine what Picasso could have done with AI. Self-host your StableDiffusion API.☆50Updated 2 years ago
- ☆27Updated last year
- Run Vision LLMs, TTS and STT APIs. Website and API for https://text-generator.io☆36Updated last month
- Floral Diffusion is a custom diffusion model trained by jags using a DD 5.6 version☆25Updated 3 years ago
- Search through Facebook Research's PyTorch BigGraph Wikidata-dataset with the Weaviate vector search engine☆31Updated 3 years ago
- Visual search interface☆11Updated 3 years ago
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆45Updated last year
- ☆20Updated 7 months ago
- my solution for Abstaction and reasoning challenge on kaggle☆10Updated last year
- Backend for the diffusion-ui frontend☆24Updated last year
- The Next Generation Multi-Modality Superintelligence☆69Updated last year
- A JAX implementation of the continuous time formulation of Consistency Models☆84Updated 2 years ago
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆29Updated this week
- Unified API to facilitate usage of pre-trained "perceptor" models, a la CLIP☆39Updated 2 years ago
- assign color hues to a collection of text fragments based on embeddings☆20Updated last year
- Repository with which to explore k-diffusion and diffusers, and within which changes to said packages may be tested.☆52Updated last year