Jaykef / min-patchnizerLinks
Minimal, clean code for video/image "patchnization" - a process commonly used in tokenizing visual data for use in a Transformer encoder.
☆11Updated last year
Alternatives and similar repositories for min-patchnizer
Users that are interested in min-patchnizer are comparing it to the libraries listed below
Sorting:
- The implementation of "Leeroo Orchestrator: Elevating LLMs Performance Through Model Integration"☆56Updated last year
- Load any clip model with a standardized interface☆22Updated 3 months ago
- Make-A-Video Latent Diffusion Model☆19Updated 2 years ago
- GPT as Knowledger Worker (or if you really want, GPT Sorta' Takes the CPA Exam)☆13Updated 3 years ago
- implementation of https://arxiv.org/pdf/2312.09299☆21Updated last year
- Unified API to facilitate usage of pre-trained "perceptor" models, a la CLIP☆39Updated 3 years ago
- An EXA-Scale repository of Multi-Modality AI resources from papers and models, to foundational libraries!☆40Updated 2 years ago
- ☆63Updated last year
- my solution for Abstaction and reasoning challenge on kaggle☆10Updated last year
- ☆27Updated 2 years ago
- Implementation of a holodeck, written in Pytorch☆18Updated 2 years ago
- Colab notebook to finetune GLIDE.☆12Updated 3 years ago
- assign color hues to a collection of text fragments based on embeddings☆20Updated last year
- A synthetic story narration dataset to study small audio LMs.☆31Updated 2 years ago
- Cerule - A Tiny Mighty Vision Model☆68Updated 2 months ago
- A GPT with self-similar nested properties☆20Updated last year
- Search through Facebook Research's PyTorch BigGraph Wikidata-dataset with the Weaviate vector search engine☆31Updated 4 years ago
- ☆20Updated 11 months ago
- Floral Diffusion is a custom diffusion model trained by jags using a DD 5.6 version☆26Updated 3 years ago
- ☆19Updated 2 years ago
- LoRA fine-tuned Stable Diffusion Deployment☆31Updated 2 years ago
- The Next Generation Multi-Modality Superintelligence☆70Updated last year
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆19Updated 2 years ago
- Create topological graph for image segments.☆23Updated last year
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆46Updated 2 years ago
- Finetune any model on HF in less than 30 seconds☆56Updated last week
- Run Vision LLMs, TTS and STT APIs. Website and API for https://text-generator.io☆38Updated 3 weeks ago
- Latent Diffusion Language Models☆70Updated 2 years ago
- DiCE: The Infinitely Differentiable Monte-Carlo Estimator☆32Updated 2 years ago
- ☆14Updated 2 years ago