LAION-AI/video-clip

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/LAION-AI/video-clip)

LAION-AI / video-clip

Let's make a video clip

☆97

Alternatives and similar repositories for video-clip

Users that are interested in video-clip are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

LAION-AI / temporal-embedding-aggregation
View on GitHub
Aggregating embeddings over time
☆32Jan 19, 2023Updated 3 years ago
iejMac / clip-video-encode
View on GitHub
Easily compute clip embeddings from video frames
☆149Oct 31, 2023Updated 2 years ago
TheoCoombes / ClipCap
View on GitHub
Using pretrained encoder and language models to generate captions from multimedia inputs.
☆101Mar 11, 2023Updated 3 years ago
google-research-datasets / videoCC-data
View on GitHub
VideoCC is a dataset containing (video-URL, caption) pairs for training video-text machine learning models. It is created using an automa…
☆78Dec 5, 2022Updated 3 years ago
LAION-AI / laion-dreams
View on GitHub
Aim for the moon. If you miss, you may hit a star.
☆168Feb 14, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
rom1504 / python-template
View on GitHub
Simple python template
☆44Apr 25, 2024Updated 2 years ago
rom1504 / embedding-reader
View on GitHub
Efficiently read embedding in streaming from any filesystem
☆106Aug 9, 2025Updated 11 months ago
Sense-GVT / BigPretrain
View on GitHub
A Simple Framwork for CV Pre-training Model (SOCO, VirTex, BEiT)
☆15Oct 18, 2021Updated 4 years ago
iejMac / video2dataset
View on GitHub
Easily create large video dataset from video urls
☆661Jul 30, 2024Updated last year
arijitray1993 / COLA
View on GitHub
COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!
☆25May 14, 2026Updated 2 months ago
LAION-AI / General-GPT
View on GitHub
☆65Oct 4, 2023Updated 2 years ago
pbaylies / clustering-laion400m
View on GitHub
Script and models for clustering LAION-400m CLIP embeddings.
☆26Jan 10, 2022Updated 4 years ago
klauscc / VindLU
View on GitHub
☆108Dec 23, 2022Updated 3 years ago
m-bain / webvid
View on GitHub
Large-scale text-video dataset. 10 million captioned short videos.
☆685Aug 14, 2024Updated last year
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
donglixp / ICL_PaperList
View on GitHub
Paper List for In-context Learning 🌷
☆19Jan 3, 2023Updated 3 years ago
The-Swarm-Corporation / AgentGym
View on GitHub
A framework making it effortless to convert any llm model into a reasoning agent like o1 or DeepSeek's r1
☆24Oct 13, 2025Updated 9 months ago
antoyang / FrozenBiLM
View on GitHub
[NeurIPS 2022] Zero-Shot Video Question Answering via Frozen Bidirectional Language Models
☆159Dec 9, 2024Updated last year
borisdayma / clip-jax
View on GitHub
Train vision models using JAX and 🤗 transformers
☆103Dec 14, 2025Updated 7 months ago
rowanz / merlot_reserve
View on GitHub
Code release for "MERLOT Reserve: Neural Script Knowledge through Vision and Language and Sound"
☆146Jun 1, 2022Updated 4 years ago
antoine77340 / howto100m
View on GitHub
Code for the HowTo100M paper
☆303Mar 10, 2020Updated 6 years ago
cair / rl
View on GitHub
☆13Sep 15, 2021Updated 4 years ago
CompVis / imagebart
View on GitHub
ImageBART: Bidirectional Context with Multinomial Diffusion for Autoregressive Image Synthesis
☆126Mar 14, 2022Updated 4 years ago
TheoCoombes / crawlingathome
View on GitHub
A client library for LAION's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.
☆33Mar 21, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
muzairkhattak / ViFi-CLIP
View on GitHub
[CVPR 2023] Official repository of paper titled "Fine-tuned CLIP models are efficient video learners".
☆309Apr 3, 2024Updated 2 years ago
fpv-iplab / stillfast
View on GitHub
Code for the paper: F. Ragusa, G. M. Farinella, A. Furnari. StillFast: An End-to-End Approach for Short-Term Object Interaction Anticipat…
☆14Apr 11, 2023Updated 3 years ago
jimimvp / torch_rl
View on GitHub
Reinforcement learning library for PyTorch.
☆11Jun 15, 2018Updated 8 years ago
ohmydroid / AugShuffleNet-Plus
View on GitHub
Faster version of AugShuffleNet without channel shuffle, computes partially, crossovers swiftly
☆10Feb 17, 2025Updated last year
zinengtang / DeCEMBERT
View on GitHub
Pytorch version of DeCEMBERT: Learning from Noisy Instructional Videos via Dense Captions and Entropy Minimization (NAACL 2021)
☆17Jan 12, 2023Updated 3 years ago
google-research / scenic
View on GitHub
Scenic: A Jax Library for Computer Vision Research and Beyond
☆3,819Updated this week
BAAI-DCAI / Visual-Instruction-Tuning
View on GitHub
SVIT: Scaling up Visual Instruction Tuning
☆167Jun 20, 2024Updated 2 years ago
cbschaff / benchmark-rrc
View on GitHub
☆15Aug 3, 2021Updated 4 years ago
dbash / visda2022-org
View on GitHub
[NeurIPS 2022] VisDA 2022 Challenge Toolkit
☆20Oct 1, 2022Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
lxj616 / make-a-stable-diffusion-video
View on GitHub
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch - fork with video pseudo3d
☆99Mar 5, 2023Updated 3 years ago
MCG-NJU / VideoMAE
View on GitHub
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
☆1,775Dec 8, 2023Updated 2 years ago
ali-design / GenRep
View on GitHub
☆87Mar 24, 2022Updated 4 years ago
CasualGANPapers / Make-A-Scene
View on GitHub
Pytorch implementation of Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors
☆335Aug 9, 2022Updated 3 years ago
webaverse / LJSpeechTools
View on GitHub
Tools to isolate speaker and transcribe unstructured audio clips
☆11Dec 4, 2022Updated 3 years ago
m-bain / frozen-in-time
View on GitHub
Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval [ICCV'21]
☆376May 19, 2022Updated 4 years ago
R34LUS3R / GPT3-cli
View on GitHub
A python script which lets you ask questions OPEN AI chat GPT-3 using OPENAI API
☆13Jan 31, 2023Updated 3 years ago