一个强大的 多模态大语言模型(MLLM),支持 文本、图像、视频等多模态输入,具备强大的理解、推理和生成能力。
☆23Mar 19, 2025Updated last year
Alternatives and similar repositories for MUG-U
Users that are interested in MUG-U are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11May 24, 2024Updated 2 years ago
- Scaling Multi-modal Instruction Fine-tuning with Tens of Thousands Vision Task Types☆32Jul 16, 2025Updated 10 months ago
- Resilient fork of OpenClaw Browser Relay extension — auto-reconnect, state persistence, keepalive☆27Feb 21, 2026Updated 3 months ago
- The official implementation of InfoRM [NeurIPS 2024].☆15Oct 25, 2025Updated 7 months ago
- ☆38Jan 9, 2026Updated 4 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆24Jun 18, 2025Updated 11 months ago
- semantically labels kinect pointclouds☆22Aug 30, 2013Updated 12 years ago
- ☆15Apr 25, 2023Updated 3 years ago
- Spatial Aptitude Training for Multimodal Langauge Models☆32Feb 8, 2026Updated 3 months ago
- simpledet和mmdetection源码阅读笔记☆27May 21, 2019Updated 7 years ago
- Video Frame Interpolation without Temporal Priors (a general method for blurry video interpolation)☆35Jul 21, 2021Updated 4 years ago
- Incremental Learning in Person Re-Identification☆17Jun 21, 2022Updated 3 years ago
- PyTorch implementation of Delving Deep into Spatial Pooling for Squeeze-and-Excitation Networks.☆17Dec 10, 2019Updated 6 years ago
- Implementation of the model: "(MC-ViT)" from the paper: "Memory Consolidation Enables Long-Context Video Understanding"☆27May 11, 2026Updated 2 weeks ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Channel Equilibrium Networks for Learning Deep Representation, ICML2020☆22Jul 28, 2020Updated 5 years ago
- Instance-Level Salient Object Detection, Computer Vision and Image Understanding (CVIU), 2021.☆12Apr 23, 2021Updated 5 years ago
- Code for WisdoM: Improving Multimodal Sentiment Analysis by Fusing Contextual World Knowledge☆17Dec 31, 2024Updated last year
- PyTorch implementation of A Lightweight Encoder-Decoder Path for Deep Residual Networks.☆19Dec 10, 2019Updated 6 years ago
- Tool to parse wiki tables from the HTML dump of Wikipedia☆11Jun 12, 2022Updated 3 years ago
- Code for "Revisiting Batch Norm Initialization".☆12Jul 14, 2022Updated 3 years ago
- ☆10Apr 10, 2019Updated 7 years ago
- Rectified Convolution☆45Oct 16, 2022Updated 3 years ago
- ☆12Feb 27, 2025Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Paper list of network architecture search (NAS)☆21Nov 18, 2018Updated 7 years ago
- 本项目是基于Unity编写的打鸭子游戏,包含打鸭子核心玩法实现,资源管理,移动端摇杆控制模块等元素。☆11Apr 13, 2022Updated 4 years ago
- Evaluation codes and data for GenEval2☆71Jan 8, 2026Updated 4 months ago
- 🚀enhanced GRPO with more verifiable rewards and real-time evaluators☆37Jan 27, 2026Updated 3 months ago
- ☆18May 27, 2021Updated 4 years ago
- The official codes for Fast Monte Carlo Rendering via Multi-Resolution Sampling☆15Dec 2, 2021Updated 4 years ago
- a fully open-source implementation of a GPT-4o-like speech-to-speech video understanding model.☆38Apr 7, 2025Updated last year
- ECCV24, NeurIPS24, CVPR26*2, Benchmarking Generalized Out-of-Distribution Detection with Vision-Language Models☆33Apr 12, 2026Updated last month
- SPEC-RL: Accelerating On-Policy Reinforcement Learning via Speculative Rollouts☆65Dec 1, 2025Updated 5 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A unofficial pytorch implementation of "Long-term Forecasting with TiDE: Time-series Dense Encoder" and its sample code of applications☆20May 9, 2023Updated 3 years ago
- Automatically generates captions for an image using Image processing and NLP. Model was trained on Flickr30K dataset.☆11Jun 11, 2020Updated 5 years ago
- Multispectral Imaging for Fine-Grained Recognition of Powders on Complex Backgrounds (CVPR 2019)☆23Oct 10, 2021Updated 4 years ago
- ☆68May 2, 2026Updated 3 weeks ago
- Laplacian-Pyramid-Reconstruction-and-Refinement-for-Semantic-Segmentation in Pytorch☆12Nov 3, 2018Updated 7 years ago
- Support finetuning GLM4v with zero2☆16Jun 29, 2024Updated last year
- Official implementation of Leveraging Visual Tokens for Extended Text Contexts in Multi-Modal Learning☆28Oct 30, 2024Updated last year