asgaardlab / CLIPxGamePhysics
This repository will contain code for the paper "CLIP meets GamePhysics: Towards bug identification in gameplay videos using zero-shot transfer learning"
☆24Updated 8 months ago
Related projects: ⓘ
- ☆38Updated this week
- JAX implementation ViT-VQGAN☆77Updated 2 years ago
- ☆40Updated this week
- ☆27Updated 2 years ago
- FID computation in Jax/Flax.☆23Updated 2 months ago
- Implementation of Retrieval-Augmented Denoising Diffusion Probabilistic Models in Pytorch☆64Updated 2 years ago
- Script and models for clustering LAION-400m CLIP embeddings.☆25Updated 2 years ago
- ☆24Updated this week
- Implementation of a holodeck, written in Pytorch☆17Updated 10 months ago
- Exploration into the proposed "Self Reasoning Tokens" by Felipe Bonetto☆53Updated 4 months ago
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing☆47Updated 2 years ago
- Inverts CLIP text embeds to image embeds and visualizes with deep-image-prior.☆35Updated 2 years ago
- ESGD-M is a stochastic non-convex second order optimizer, suitable for training deep learning models, for PyTorch.☆56Updated 2 years ago
- Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch☆90Updated 2 years ago
- Official repository for MaGNET, ICLR 2022☆26Updated last year
- Finetune the 1.4B latent diffusion text2img-large checkpoint from CompVis using deepspeed. (work-in-progress)☆36Updated 2 years ago
- ☆41Updated last month
- OpenAI CLIP based image generator with complex config file controlled transformation and training pipelines☆18Updated 2 years ago
- Pedagogical codebase for a simplified score-based generative model design, with training loop☆39Updated 3 years ago
- ☆14Updated 3 years ago
- Implementation of some personal helper functions for Einops, my most favorite tensor manipulation library ❤️☆52Updated last year
- Official repository for the paper "End-to-End Visual Editing with a Generatively Pre-Trained Artist", which is accepted at ECCV 2022. Her…☆29Updated last year
- Un-*** 50 billions multimodality dataset☆24Updated 2 years ago
- Latent Diffusion Language Models☆66Updated last year
- Unified API to facilitate usage of pre-trained "perceptor" models, a la CLIP☆39Updated last year
- codebase for the SIMAT dataset and evaluation☆38Updated 2 years ago
- CLOOB training (JAX) and inference (JAX and PyTorch)☆70Updated 2 years ago
- A Versatile Face Encoder for Zero-Shot Diffusion Model Personalization☆18Updated this week
- ☆23Updated this week
- CogView2 for GPUs with 12/16/24GB vRAM☆16Updated 2 years ago
- ☆14Updated this week