☆16Feb 3, 2026Updated 3 months ago
Alternatives and similar repositories for IT5007-Lecture5
Users that are interested in IT5007-Lecture5 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- RLHF for Video Diffusion Models☆26Jul 30, 2025Updated 9 months ago
- Code and datasets for 'Few-Shot Audio-Visual Learning of Environment Acoustics' (NeurIPS 2022)☆23Nov 20, 2023Updated 2 years ago
- [NeurIPS 2023] AV-NeRF: Learning Neural Fields for Real-World Audio-Visual Scene Synthesis☆36Feb 15, 2024Updated 2 years ago
- ☆40Sep 12, 2025Updated 7 months ago
- The offical implemention of JM3D.☆31Apr 8, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Project Asteria: A Naïve Introductory to Advanced Mathematics and Theoretical Physics for Gaokao Students☆44Mar 27, 2026Updated last month
- Acoustic impulse response generation using diffusion models☆76Oct 3, 2023Updated 2 years ago
- This project provides a bridge for communication between the autonomous driving platform Apollo and Carla simulator. Receiving data from …☆73Feb 20, 2024Updated 2 years ago
- ✨A curated list of papers on the uncertainty in multi-modal large language model (MLLM).☆58Apr 2, 2025Updated last year
- From Flatland to Space (SPAR). Accepted to NeurIPS 2025 Datasets & Benchmarks. A large-scale dataset & benchmark for 3D spatial perceptio…☆84Jan 5, 2026Updated 4 months ago
- 清华大学软件学院攻略资料☆71Sep 7, 2021Updated 4 years ago
- kaggle比赛—otto多目标推荐系统源代码,单模型分数0.594,LB排名30左右☆89Mar 16, 2023Updated 3 years ago
- [CVPR 2023]Official Pytorch code for paper "Prototype-based Embedding Network for Scene Graph Generation"☆61Jun 8, 2023Updated 2 years ago
- ☆74Mar 29, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [NeurIPS 2025 DB] OneIG-Bench is a meticulously designed comprehensive benchmark framework for fine-grained evaluation of T2I models acro…☆119Feb 10, 2026Updated 3 months ago
- Official code for "Learning Neural Acoustic Fields" (NeurIPS 2022)☆164Jan 20, 2024Updated 2 years ago
- [ICCV 2021] Official code for "Learning to Generate Scene Graph from Natural Language Supervision"☆100Apr 4, 2023Updated 3 years ago
- This is the code of ECCV 2022 (Oral) paper "Fine-Grained Scene Graph Generation with Data Transfer".☆103Jan 24, 2023Updated 3 years ago
- A Large-scale Multimodal Dataset for recommender System☆183Mar 19, 2025Updated last year
- ☆147Apr 16, 2025Updated last year
- Template for a compact LaTeX Cheatsheet I made some years ago.☆233Sep 5, 2018Updated 7 years ago
- ☆173Sep 17, 2020Updated 5 years ago
- SceneFun3D ToolKit☆175Apr 17, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code for CVPR25 paper "VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos"☆159Jun 23, 2025Updated 10 months ago
- 清华大学学位论文Word模板。A Word thesis template for Tsinghua University.☆225Mar 22, 2023Updated 3 years ago
- Resource, Evaluation and Detection Papers for ChatGPT☆455Mar 21, 2024Updated 2 years ago
- Integration of AutoWare AV software with the CARLA simulator☆283Jun 10, 2024Updated last year
- Video Chain of Thought, Codes for ICML 2024 paper: "Video-of-Thought: Step-by-Step Video Reasoning from Perception to Cognition"☆181Feb 25, 2025Updated last year
- NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions (CVPR'21)☆190Aug 2, 2025Updated 9 months ago
- GLM-Image: Auto-regressive for Dense-knowledge and High-fidelity Image Generation.☆896Mar 20, 2026Updated last month
- [NeurIPS 2024 & TPAMI 2026] Chat-Scene: Bridging 3D Scene and Large Language Models with Object Identifiers☆211Apr 12, 2026Updated 3 weeks ago
- [ECCV24] VISA: Reasoning Video Object Segmentation via Large Language Model☆212Aug 5, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 使用预训练语言模型ALBERT做中文NER☆478Jan 13, 2021Updated 5 years ago
- VisionReasoner: Unified Reasoning-Integrated Visual Perception via Reinforcement Learning☆337Feb 9, 2026Updated 3 months ago
- 🔧Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆371Apr 3, 2026Updated last month
- Codes for VPGTrans: Transfer Visual Prompt Generator across LLMs. VL-LLaMA, VL-Vicuna.☆268Oct 13, 2023Updated 2 years ago
- AutoDriving-Planning-Control-Algorithm-Simulation-Carla☆318Sep 1, 2023Updated 2 years ago
- This repository is an official implementation of ADAPT: Action-aware Driving Caption Transformer, accepted by ICRA 2023.☆422Jun 11, 2024Updated last year
- Official inference code and LongText-Bench benchmark for our paper X-Omni (https://arxiv.org/pdf/2507.22058).☆425Aug 26, 2025Updated 8 months ago