huskydoge / CS2612-Programming-Languages-and-Compilers
SJTU | CS 2612, Programming Languages and Compilers, Fall 2023
☆10Updated 10 months ago
Related projects ⓘ
Alternatives and complementary repositories for CS2612-Programming-Languages-and-Compilers
- Shanghai Jiao Tong University 2023-2024, CS3601 Operating System☆18Updated 10 months ago
- Official implementation for BroadWay: Boost Your Text-to-Video Generation Model in a Training-free Way☆18Updated last month
- Papers and codes collection for customized, personalized and editable generative models☆23Updated last month
- The paper collections for the autoregressive models in vision.☆229Updated this week
- The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation☆72Updated 3 weeks ago
- Accepted by CVPR 2024☆28Updated 6 months ago
- official implementation of "Interpreting CLIP's Image Representation via Text-Based Decomposition"☆164Updated 2 months ago
- The code for Fine-grained HBOE | AAAI 2024 (official version and optimized version).☆16Updated 7 months ago
- You can easily calculate FVD, PSNR, SSIM, LPIPS for evaluating the quality of generated or predicted videos.☆240Updated 5 months ago
- >>> 异常中断 + 虚存页表 + 分支预测 + TLB + Cache + Flash + VGA + uCore☆15Updated last year
- 📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.☆215Updated 2 weeks ago
- A list of works on evaluation of visual generation models, including evaluation metrics, models, and systems☆197Updated 2 months ago
- 🏆 See How Top MLLMs Understand Video Compositions.☆14Updated this week
- ☆19Updated 2 months ago
- [ICLR 2024 (Spotlight)] "Frozen Transformers in Language Models are Effective Visual Encoder Layers"☆225Updated 10 months ago
- Empowering Unified MLLM with Multi-granular Visual Generation☆106Updated last month
- Chat about anything on any video!☆34Updated last year
- SpeeD: A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training☆162Updated last month
- 🔥 [CVPR2024] Official implementation of "Self-correcting LLM-controlled Diffusion Models (SLD)☆155Updated 7 months ago
- ☆193Updated 4 months ago
- Official implementation of ECCV 2024 paper: Take A Step Back: Rethinking the Two Stages in Visual Reasoning☆9Updated last month
- 🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).☆363Updated last week
- [CVPR 2024] EvalCrafter: Benchmarking and Evaluating Large Video Generation Models☆143Updated last month
- ☆21Updated 6 months ago
- VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation☆141Updated 3 weeks ago
- Binding Touch to Everything: Learning Unified Multimodal Tactile Representations☆24Updated 8 months ago
- Official repository of DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models☆75Updated 2 months ago
- This is the official code of VideoAgent: A Memory-augmented Multimodal Agent for Video Understanding (ECCV 2024)☆131Updated 2 months ago
- [NeurIPS 2024] This repo contains evaluation code for the paper "Are We on the Right Way for Evaluating Large Vision-Language Models"☆148Updated last month