Jaykef / min-patchnizerLinks
Minimal, clean code for video/image "patchnization" - a process commonly used in tokenizing visual data for use in a Transformer encoder.
☆11Updated last year
Alternatives and similar repositories for min-patchnizer
Users that are interested in min-patchnizer are comparing it to the libraries listed below
Sorting:
- Create topological graph for image segments.☆23Updated last year
- Make-A-Video Latent Diffusion Model☆19Updated 2 years ago
- Cerule - A Tiny Mighty Vision Model☆68Updated 2 months ago
- ☆15Updated 2 years ago
- ☆27Updated 2 years ago
- Run Vision LLMs, TTS and STT APIs. Website and API for https://text-generator.io☆38Updated 2 weeks ago
- Image Generation API Server - Similar to https://text-generator.io but for images☆51Updated 5 months ago
- ☆27Updated last year
- The implementation of "Leeroo Orchestrator: Elevating LLMs Performance Through Model Integration"☆56Updated last year
- ☆63Updated last year
- Load any clip model with a standardized interface☆22Updated 3 months ago
- ☆14Updated 2 years ago
- my solution for Abstaction and reasoning challenge on kaggle☆10Updated last year
- The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…☆90Updated last year
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆44Updated last year
- Floral Diffusion is a custom diffusion model trained by jags using a DD 5.6 version☆26Updated 3 years ago
- implementation of https://arxiv.org/pdf/2312.09299☆21Updated last year
- Gradio app to track objects in video and add visual effects☆17Updated 6 months ago
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Updated last year
- GPT as Knowledger Worker (or if you really want, GPT Sorta' Takes the CPA Exam)☆13Updated 3 years ago
- A simple package for leveraging Falcon 180B and the HF ecosystem's tools, including training/inference scripts, safetensors, integrations…☆12Updated last year
- ☆27Updated last year
- LoRA fine-tuned Stable Diffusion Deployment☆31Updated 2 years ago
- Search through Facebook Research's PyTorch BigGraph Wikidata-dataset with the Weaviate vector search engine☆31Updated 4 years ago
- Port of Facebook's LLaMA model in C/C++☆21Updated 2 years ago
- ☆19Updated 2 years ago
- 🎨 Imagine what Picasso could have done with AI. Self-host your StableDiffusion API.☆50Updated 2 years ago
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆23Updated last year
- BH hackathon☆14Updated last year
- assign color hues to a collection of text fragments based on embeddings☆20Updated last year