D-Keqi / mtlaLinks
MTLA: Multi-head Temporal Latent Attention
☆759Updated last month
Alternatives and similar repositories for mtla
Users that are interested in mtla are comparing it to the libraries listed below
Sorting:
- We introduce the Audio Logical Reasoning (ALR) dataset, consisting of 6,446 text-audio annotated samples specifically designed for comple…☆1,101Updated last week
- We introduce temporal working memory (TWM), which aims to enhance the temporal modeling capabilities of Multimodal foundation models (MFM…☆312Updated last week
- Efficient controlnet for DiTs☆382Updated 6 months ago
- A real-time interactive Omni Avatar built on LiveKit, which allows you to seamlessly integrate with any open source Avatar components (re…☆555Updated this week
- Inspiring the Next Generation of Segment Anything Models: Comprehensively Evaluate SAM and SAM 2 with Diverse Prompts Towards Context-Dep…☆573Updated 3 months ago
- ☆162Updated last year
- ☆515Updated 9 months ago
- [AAAI 2026 Oral] Cook and Clean Together: Teaching Embodied Agents for Parallel Task Execution☆352Updated this week
- ☆174Updated 2 months ago
- Tokenize The Virtual Agents Onchain☆241Updated 6 months ago
- GigaModels: A Comprehensive Repository and Platform for Multi-modal, Generative, and Perceptual Models☆69Updated last week
- Joint Semantic Detection and Dissemination Control of Phishing Attacks on Social Media via LLama- Based Modeling☆645Updated last month
- ☆386Updated 4 months ago
- ☆812Updated 4 months ago
- Dataset approched by A Benchmark and Frequency Compression Method for Infrared Few-Shot Object Detection☆1,004Updated 8 months ago
- Group Expectation Policy Optimization for Heterogeneous Reinforcement Learning☆164Updated 2 weeks ago
- DeepWism R2 is a next-generation AGI system built on the T3CEDS framework (Thin-Thick-Thin Crowd Entropy Dynamics System), which redefine…☆1,020Updated 5 months ago
- 小而美的Vue3异步处理解决方案,让复杂的异步逻辑变得简单优雅,让重复的样板代码成为历史☆518Updated 2 months ago
- ☆422Updated 5 months ago
- ☆530Updated 10 months ago
- PVPAI LLM 🔥The First Open-Source DeFAI Large Language Model Powered by DeepSeek.☆302Updated 10 months ago
- [NeurIPS'2025] Official repository for "LiveStar: Live Streaming Assistant for Real-World Online Video Understanding"☆91Updated last week
- ☆601Updated 3 weeks ago
- a multiscale multimodal large language models for radiology report generation (RRG) tasks☆267Updated 3 months ago
- Science-Star: A Platform for Building, Extending, and Experimenting with Scientific Agents.☆737Updated last month
- UpTop is a BNB Chain-based liquidity protocol that allows users to unilaterally add BNB to liquidity pools, earn high yields, and support…☆75Updated 5 months ago
- OmniAgent Framework is an advanced, modular AI orchestration system that transforms Web3 development by seamlessly integrating artificial…☆320Updated 10 months ago
- 日历软件重写☆453Updated 8 months ago
- F²-Gen - A open source Financial Fraud Detection Data Generator Web Application☆367Updated last month
- ☆894Updated last month