cuiyuhao1996 / mcan-vqaView external linksLinks
Deep Modular Co-Attention Networks for Visual Question Answering
☆10Jul 10, 2019Updated 6 years ago
Alternatives and similar repositories for mcan-vqa
Users that are interested in mcan-vqa are comparing it to the libraries listed below
Sorting:
- Virtual news production using Tacotron2 and Wav2Lip☆11Nov 14, 2023Updated 2 years ago
- VexFS is a Linux kernel-native file system with built-in vector search and semantic memory. Designed for AI agents, RAG, and LLM workload…☆24Oct 19, 2025Updated 3 months ago
- Eliza Agent Weaver enables you to develop a set of Character files based on your own lore, and connects the narratives of multiple agents…☆10Dec 12, 2024Updated last year
- automatic music transcription application written in java☆12Jan 13, 2013Updated 13 years ago
- Use multi-agent ReinForcement Learning on mobile crowd sensing.☆12Sep 30, 2021Updated 4 years ago
- BanterBot: An OpenAI ChatGPT-powered chatbot with Azure Neural Voices. Supports multilingual speech-to-text and text-to-speech interactio…☆11Jan 23, 2026Updated 3 weeks ago
- Python implementation of the Louvain method for community detection☆12Jul 4, 2017Updated 8 years ago
- An open-source platform for building and deploying real-time, low-latency AI voice agents for call automation for marketing.☆18Oct 16, 2025Updated 3 months ago
- Fine-Grained Knowledge Fusion for Retrieval-Augmented Medical Visual Question☆11Jul 18, 2024Updated last year
- Official repository of Tapir Lab.'s Lip-Sync Method☆10Oct 3, 2023Updated 2 years ago
- Code for TCSVT paper "Exploring Spatio-Temporal Graph Convolution for Video-based Human-Object Interaction Recognition"☆12Mar 30, 2023Updated 2 years ago
- A project about Virtual Try-On. Lines of code ~5,200.☆10Jan 27, 2021Updated 5 years ago
- calvis: Chest, wAist and peLVIS circumference from 3D human Body meshes for Deep Learning.☆11May 15, 2025Updated 9 months ago
- Implementing an interactive AI avatar using Python, Blender and GPT☆11Dec 5, 2023Updated 2 years ago
- A proposed GPT chatbot for teachers that uses retrieval-augmentation to answer questions about their students.☆10Dec 7, 2024Updated last year
- Chatbot for NHS Medicines A-Z. Agentic Retrieval Augmented Generation utilising the OpenAI API, LangChain, and LangGraph to query a vecto…☆10Jun 24, 2024Updated last year
- Conversational Speaker Diarization using OpenAI AI Language Models(gpt-4) and OpenAI Whisper.☆14Aug 13, 2023Updated 2 years ago
- A Cyberpunk 2077 First-Person Multi Rig for Blender (4.0+)☆11Jan 10, 2026Updated last month
- >>PhysWikiQuiz<< - a Physics Question Generation and Interrogation System☆11Feb 25, 2023Updated 2 years ago
- real-time web visualizer for 3D gaussian splatting☆10Jan 31, 2025Updated last year
- Talk to your database as if you were chatting with a friend. Turn natural language into powerful SQL queries effortlessly, and get your a…☆10Nov 12, 2024Updated last year
- Multi-tenant RAG API powered by LightRAG/RAG-Anything. Auto-selects best parser (DeepSeek-OCR/MinerU/Docling) via complexity scoring☆24Dec 15, 2025Updated 2 months ago
- ☆12Feb 20, 2021Updated 4 years ago
- ☆11May 2, 2022Updated 3 years ago
- ☆10Apr 22, 2021Updated 4 years ago
- A high-performance, distributed memory management system for LLM agents built with LangGraph, LangChain, Ray, and vLLM. Features multi-la…☆11Apr 23, 2025Updated 9 months ago
- Agent building tools via block diagram UI☆12Dec 31, 2025Updated last month
- end-to-end automated video generation pipeline designed to create engaging, TikTok-style viral short videos using AI.☆20Jun 7, 2025Updated 8 months ago
- ☆15Oct 10, 2023Updated 2 years ago
- ☆14Apr 14, 2021Updated 4 years ago
- ☆12Sep 11, 2020Updated 5 years ago
- Python app to sync Video Files to the beat of a song☆12Aug 5, 2019Updated 6 years ago
- Implementation for WatchYourMouth: Silent Speech Recognition with Depth Sensing presented at CHI 2024☆15Oct 6, 2025Updated 4 months ago
- An SMPL body model implementation for NumPy, PyTorch and TensorFlow☆10Apr 17, 2023Updated 2 years ago
- gpt-4 支持+中文+视频+语音对话☆11Feb 18, 2024Updated last year
- A PyTorch port of the Neural 3D Mesh Renderer☆12Jul 27, 2022Updated 3 years ago
- Create Assets from Video. Transform your video into a professional production package. Automated shot lists, color scripts, screenplays, …☆42Dec 6, 2025Updated 2 months ago
- Human Pose Estimation in Real-World Metric Coordinates☆12Jul 6, 2023Updated 2 years ago
- Easily create video datasets with auto-captioning for Hunyuan-Video LoRA finetuning☆13Apr 2, 2025Updated 10 months ago