MetabrainAGI / Awaker
☆21Updated this week
Related projects ⓘ
Alternatives and complementary repositories for Awaker
- Implementation of SmoothCache, a project aimed at speeding-up Diffusion Transformer (DiT) based GenAI models with error-guided caching.☆18Updated this week
- Official implementation of the paper "MMInA: Benchmarking Multihop Multimodal Internet Agents"☆38Updated 7 months ago
- Official implementation of MagicFace: Training-free Universal-Style Human Image Customized Synthesis.☆51Updated this week
- Pytorch implementation of HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models☆28Updated 8 months ago
- ☆59Updated last month
- From Code to Correctness: Closing the Last Mile of Code Generation with Hierarchical Debugging☆56Updated last month
- XVERSE-MoE-A36B: A multilingual large language model developed by XVERSE Technology Inc.☆36Updated 2 months ago
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding☆38Updated last month
- Modern Stable Diffusion models family - Fluently☆26Updated 5 months ago
- ☆27Updated 3 months ago
- ☆30Updated 11 months ago
- This is a repository that collects common audio noise reduction models, using Gradio to demonstrate the use of each model, which is very …☆16Updated last month
- [NeurIPS 2024] Official Implementation for Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks☆25Updated last month
- Here we will track the latest AI Multimodal Models, including Multimodal Foundation Models, LLM, Agent, Audio, Image, Video, Music and 3D…☆31Updated this week
- A streamlined implementation of Grounding DINO and SAM for advanced image segmentation. This lightweight solution simplifies the integrat…☆63Updated last month
- Code release for "SegLLM: Multi-round Reasoning Segmentation"☆35Updated 2 weeks ago
- Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)☆16Updated 2 months ago
- Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs