Jaykef / min-patchnizerLinks
Minimal, clean code for video/image "patchnization" - a process commonly used in tokenizing visual data for use in a Transformer encoder.
☆11Updated last year
Alternatives and similar repositories for min-patchnizer
Users that are interested in min-patchnizer are comparing it to the libraries listed below
Sorting:
- Let's try and finetune the OpenAI consistency decoder to work for SDXL☆24Updated last year
- implementation of https://arxiv.org/pdf/2312.09299☆21Updated last year
- Floral Diffusion is a custom diffusion model trained by jags using a DD 5.6 version☆26Updated 3 years ago
- Create topological graph for image segments.☆22Updated 11 months ago
- ☆63Updated 11 months ago
- Backend for the diffusion-ui frontend☆25Updated last year
- Colab notebook to finetune GLIDE.☆13Updated 3 years ago
- Run Vision LLMs, TTS and STT APIs. Website and API for https://text-generator.io☆37Updated this week
- Make-A-Video Latent Diffusion Model☆19Updated last year
- Simple script to re-rank images using OpenAI's CLIP https://github.com/openai/CLIP.☆16Updated 4 years ago
- 🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch☆50Updated 2 years ago
- The implementation of "Leeroo Orchestrator: Elevating LLMs Performance Through Model Integration"☆56Updated last year
- LoRA fine-tuned Stable Diffusion Deployment☆31Updated 2 years ago
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆43Updated last year
- Load any clip model with a standardized interface☆22Updated 2 weeks ago
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated last year
- Unified API to facilitate usage of pre-trained "perceptor" models, a la CLIP☆40Updated 2 years ago
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Updated last year
- Implementation of the proposed Spline-Based Transformer from Disney Research☆103Updated 9 months ago
- GPT as Knowledger Worker (or if you really want, GPT Sorta' Takes the CPA Exam)☆13Updated 2 years ago
- ☆20Updated 5 months ago
- Repository with which to explore k-diffusion and diffusers, and within which changes to said packages may be tested.☆53Updated last year
- Training hybrid models for dummies.☆25Updated 7 months ago
- ☆28Updated last year
- assign color hues to a collection of text fragments based on embeddings☆20Updated last year
- Experiments with generating GPT-2 fanfiction on specified topics.☆11Updated 6 years ago
- ☆25Updated last year
- Exploration into the Firefly algorithm in Pytorch☆40Updated 6 months ago
- Visual RAG using less than 300 lines of code.☆28Updated last year
- Latent Diffusion Language Models☆69Updated last year