Let's make a video clip
☆96Jul 29, 2022Updated 3 years ago
Alternatives and similar repositories for video-clip
Users that are interested in video-clip are comparing it to the libraries listed below
Sorting:
- Aggregating embeddings over time☆32Jan 19, 2023Updated 3 years ago
- Easily compute clip embeddings from video frames☆145Oct 31, 2023Updated 2 years ago
- Using pretrained encoder and language models to generate captions from multimedia inputs.☆100Mar 11, 2023Updated 2 years ago
- Aim for the moon. If you miss, you may hit a star.☆164Feb 14, 2023Updated 3 years ago
- I have created a dataset of Image-Text-Pairs by using the cosine similarity of the CLIP embeddings of the image & it's caption derrived f…☆16Apr 22, 2021Updated 4 years ago
- Script and models for clustering LAION-400m CLIP embeddings.☆26Jan 10, 2022Updated 4 years ago
- ☆10Nov 11, 2019Updated 6 years ago
- COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!☆25Nov 23, 2024Updated last year
- Efficiently read embedding in streaming from any filesystem☆105Aug 9, 2025Updated 6 months ago
- ☆65Oct 4, 2023Updated 2 years ago
- Large-scale text-video dataset. 10 million captioned short videos.☆677Aug 14, 2024Updated last year
- Unofficial Pytorch implementation of Style GAN paper☆17Oct 11, 2019Updated 6 years ago
- Simple python template☆43Apr 25, 2024Updated last year
- Code release for "MERLOT Reserve: Neural Script Knowledge through Vision and Language and Sound"☆146Jun 1, 2022Updated 3 years ago
- Pytorch version of DeCEMBERT: Learning from Noisy Instructional Videos via Dense Captions and Entropy Minimization (NAACL 2021)☆17Jan 12, 2023Updated 3 years ago
- Open source code combining implementations of Upside Down Reinforcement Learning and Reward Conditioned Policies☆19Mar 10, 2021Updated 4 years ago
- ☆17May 28, 2018Updated 7 years ago
- Utilities for sequential processing of tar files.☆24Feb 16, 2022Updated 4 years ago
- ImageBART: Bidirectional Context with Multinomial Diffusion for Autoregressive Image Synthesis☆126Mar 14, 2022Updated 3 years ago
- A framework making it effortless to convert any llm model into a reasoning agent like o1 or DeepSeek's r1☆24Oct 13, 2025Updated 4 months ago
- A gym environment for Stuart Armstrong's model of a treacherous turn.☆18Jul 28, 2018Updated 7 years ago
- ☆22Nov 8, 2021Updated 4 years ago
- Command-line tool for downloading and extending the RedCaps dataset.☆50Dec 18, 2023Updated 2 years ago
- Pytorch implementation of Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors☆337Aug 9, 2022Updated 3 years ago
- ☆20Mar 14, 2021Updated 4 years ago
- Code for ICML2021 paper 'Commutative Lie Group VAE for Disentanglement Learning'.☆23Nov 2, 2022Updated 3 years ago
- ☆87Mar 24, 2022Updated 3 years ago
- Big-Interleaved-Dataset☆58Jan 21, 2023Updated 3 years ago
- ☆129Jul 30, 2024Updated last year
- See details in https://github.com/pytorch/xla/blob/r1.12/torch_xla/distributed/fsdp/README.md☆25Dec 22, 2022Updated 3 years ago
- ☆23Oct 4, 2021Updated 4 years ago
- ☆28Jan 11, 2021Updated 5 years ago
- This repository will contain code for the paper "CLIP meets GamePhysics: Towards bug identification in gameplay videos using zero-shot tr…☆26Dec 23, 2023Updated 2 years ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆23Dec 22, 2020Updated 5 years ago
- Code release for DriveGAN (CVPR 2021)☆98Nov 11, 2021Updated 4 years ago
- Scenic: A Jax Library for Computer Vision Research and Beyond☆3,772Updated this week
- [NeurIPS 2022] Zero-Shot Video Question Answering via Frozen Bidirectional Language Models☆158Dec 9, 2024Updated last year
- Entity Abstraction in Visual Model-Based Reinforcement Learning☆57Jan 1, 2021Updated 5 years ago
- ☆24Jul 25, 2024Updated last year