Jaykef / min-patchnizerLinks
Minimal, clean code for video/image "patchnization" - a process commonly used in tokenizing visual data for use in a Transformer encoder.
☆11Updated last year
Alternatives and similar repositories for min-patchnizer
Users that are interested in min-patchnizer are comparing it to the libraries listed below
Sorting:
- Cerule - A Tiny Mighty Vision Model☆68Updated last year
- GPT as Knowledger Worker (or if you really want, GPT Sorta' Takes the CPA Exam)☆13Updated 2 years ago
- assign color hues to a collection of text fragments based on embeddings☆20Updated last year
- ☆14Updated last year
- Latent Diffusion Language Models☆69Updated 2 years ago
- Run Vision LLMs, TTS and STT APIs. Website and API for https://text-generator.io☆37Updated 2 weeks ago
- The implementation of "Leeroo Orchestrator: Elevating LLMs Performance Through Model Integration"☆56Updated last year
- Make-A-Video Latent Diffusion Model☆19Updated last year
- Visual RAG using less than 300 lines of code.☆29Updated last year
- Simple script to re-rank images using OpenAI's CLIP https://github.com/openai/CLIP.☆16Updated 4 years ago
- ☆20Updated 6 months ago
- An EXA-Scale repository of Multi-Modality AI resources from papers and models, to foundational libraries!☆40Updated last year
- A synthetic story narration dataset to study small audio LMs.☆32Updated last year
- ☆15Updated last year
- Create topological graph for image segments.☆22Updated 11 months ago
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Updated last year
- ☆27Updated last year
- The official Python SDK for the Perceptron API☆22Updated last week
- my solution for Abstaction and reasoning challenge on kaggle☆10Updated last year
- LoRA fine-tuned Stable Diffusion Deployment☆31Updated 2 years ago
- ☆63Updated last year
- Visual search interface☆11Updated 3 years ago
- The Next Generation Multi-Modality Superintelligence☆70Updated last year
- Backend for the diffusion-ui frontend☆25Updated last year
- 🎨 Imagine what Picasso could have done with AI. Self-host your StableDiffusion API.☆50Updated 2 years ago
- Search through Facebook Research's PyTorch BigGraph Wikidata-dataset with the Weaviate vector search engine☆31Updated 3 years ago
- Floral Diffusion is a custom diffusion model trained by jags using a DD 5.6 version☆26Updated 3 years ago
- ☆62Updated last year
- implementation of https://arxiv.org/pdf/2312.09299☆21Updated last year
- ☆27Updated 2 years ago