iejMac/clip-video-encode

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/iejMac/clip-video-encode)

iejMac / clip-video-encode

Easily compute clip embeddings from video frames

☆149

Alternatives and similar repositories for clip-video-encode

Users that are interested in clip-video-encode are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

iejMac / video2numpy
View on GitHub
Optimized library for large-scale extraction of frames and audio from video.
☆204Sep 11, 2023Updated 2 years ago
LAION-AI / temporal-embedding-aggregation
View on GitHub
Aggregating embeddings over time
☆32Jan 19, 2023Updated 3 years ago
iejMac / video2dataset
View on GitHub
Easily create large video dataset from video urls
☆662Jul 30, 2024Updated last year
LAION-AI / video-clip
View on GitHub
Let's make a video clip
☆97Jul 29, 2022Updated 4 years ago
rom1504 / python-template
View on GitHub
Simple python template
☆44Apr 25, 2024Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
m-bain / webvid
View on GitHub
Large-scale text-video dataset. 10 million captioned short videos.
☆686Aug 14, 2024Updated last year
borisdayma / clip-jax
View on GitHub
Train vision models using JAX and 🤗 transformers
☆103Dec 14, 2025Updated 7 months ago
m-bain / frozen-in-time
View on GitHub
Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval [ICCV'21]
☆376May 19, 2022Updated 4 years ago
duchesneaumathieu / pyperlin
View on GitHub
GPU accelerated Perlin Noise in python
☆11Oct 23, 2020Updated 5 years ago
pbaylies / Augmented_CLIP
View on GitHub
Training simple models to predict CLIP image embeddings from text embeddings, and vice versa.
☆60Mar 31, 2022Updated 4 years ago
rom1504 / clip-retrieval
View on GitHub
Easily compute clip embeddings and build a clip retrieval system with them
☆2,789Mar 28, 2026Updated 4 months ago
iejMac / GPTReview
View on GitHub
Get OpenAI GPT models to review your PR's
☆45Jun 9, 2023Updated 3 years ago
rom1504 / embedding-reader
View on GitHub
Efficiently read embedding in streaming from any filesystem
☆106Aug 9, 2025Updated 11 months ago
nostalgebraist / improved-diffusion
View on GitHub
Text-writing denoising diffusion (and much more)
☆30May 14, 2023Updated 3 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
LAION-AI / General-GPT
View on GitHub
☆65Oct 4, 2023Updated 2 years ago
rom1504 / gpu-tester
View on GitHub
gpu tester detects broken and slow gpus in a cluster
☆72Feb 19, 2023Updated 3 years ago
young-geng / tpu_pod_commander
View on GitHub
TPU pod commander is a package for managing and launching jobs on Google Cloud TPU pods.
☆20Sep 24, 2025Updated 10 months ago
renwang435 / video-ttt-release
View on GitHub
Test-Time Training on Video Streams
☆70Jul 24, 2023Updated 3 years ago
antoine77340 / howto100m
View on GitHub
Code for the HowTo100M paper
☆304Mar 10, 2020Updated 6 years ago
christophschuhmann / 4MC-4M-Image-Text-Pairs-with-CLIP-embeddings
View on GitHub
I have created a dataset of Image-Text-Pairs by using the cosine similarity of the CLIP embeddings of the image & it's caption derrived f…
☆17Apr 22, 2021Updated 5 years ago
m-bain / CondensedMovies
View on GitHub
Story-Based Retrieval with Contextual Embeddings. Largest freely available movie video dataset. [ACCV'20]
☆205Sep 21, 2022Updated 3 years ago
CasualGANPapers / Make-A-Scene
View on GitHub
Pytorch implementation of Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors
☆335Aug 9, 2022Updated 3 years ago
rom1504 / img2dataset
View on GitHub
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
☆4,438Oct 19, 2025Updated 9 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Jack000 / DALLE-pytorch
View on GitHub
Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch
☆89Dec 3, 2021Updated 4 years ago
klauscc / VindLU
View on GitHub
☆108Dec 23, 2022Updated 3 years ago
mehdidc / feed_forward_vqgan_clip
View on GitHub
Feed forward VQGAN-CLIP model, where the goal is to eliminate the need for optimizing the latent space of VQGAN for each input prompt
☆140Jan 3, 2024Updated 2 years ago
LAION-AI / OCR-ensemble
View on GitHub
☆42Jun 15, 2023Updated 3 years ago
LAION-AI / interesting-text-datasets
View on GitHub
☆47Dec 28, 2022Updated 3 years ago
TheoCoombes / ClipCap
View on GitHub
Using pretrained encoder and language models to generate captions from multimedia inputs.
☆101Mar 11, 2023Updated 3 years ago
kostarion / guided-diffusion
View on GitHub
☆13Jun 7, 2023Updated 3 years ago
OpenGVLab / efficient-video-recognition
View on GitHub
☆184Aug 20, 2022Updated 3 years ago
sallymmx / ActionCLIP
View on GitHub
This is the official implement of paper "ActionCLIP: A New Paradigm for Action Recognition"
☆615Dec 6, 2023Updated 2 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
allenai / mmc4
View on GitHub
MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.
☆953Mar 19, 2025Updated last year
linkedin / ControlLLM
View on GitHub
Control LLM
☆23Apr 6, 2025Updated last year
MCG-NJU / VideoMAE
View on GitHub
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
☆1,777Dec 8, 2023Updated 2 years ago
dzryk / clip-grams
View on GitHub
☆30Nov 25, 2021Updated 4 years ago
JD-P / cloob-latent-diffusion
View on GitHub
CLOOB Conditioned Latent Diffusion training and inference code
☆113Apr 15, 2022Updated 4 years ago
maxencefaldor / learned-qd
View on GitHub
Discovering Quality-Diversity Algorithms via Meta-Black-Box Optimization
☆25Dec 1, 2025Updated 7 months ago
ArrowLuo / CLIP4Clip
View on GitHub
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
☆1,030Apr 12, 2024Updated 2 years ago