Deep Modular Co-Attention Networks for Visual Question Answering
☆10Jul 10, 2019Updated 6 years ago
Alternatives and similar repositories for mcan-vqa
Users that are interested in mcan-vqa are comparing it to the libraries listed below
Sorting:
- Virtual news production using Tacotron2 and Wav2Lip☆11Nov 14, 2023Updated 2 years ago
- Use multi-agent ReinForcement Learning on mobile crowd sensing.☆12Sep 30, 2021Updated 4 years ago
- Eliza Agent Weaver enables you to develop a set of Character files based on your own lore, and connects the narratives of multiple agents…☆10Dec 12, 2024Updated last year
- Fine-Grained Knowledge Fusion for Retrieval-Augmented Medical Visual Question☆11Jul 18, 2024Updated last year
- Python implementation of the Louvain method for community detection☆12Jul 4, 2017Updated 8 years ago
- BanterBot: An OpenAI ChatGPT-powered chatbot with Azure Neural Voices. Supports multilingual speech-to-text and text-to-speech interactio…☆11Jan 23, 2026Updated last month
- An open-source platform for building and deploying real-time, low-latency AI voice agents for call automation for marketing.☆18Oct 16, 2025Updated 4 months ago
- automatic music transcription application written in java☆12Jan 13, 2013Updated 13 years ago
- end-to-end automated video generation pipeline designed to create engaging, TikTok-style viral short videos using AI.☆20Jun 7, 2025Updated 9 months ago
- ☆12Feb 20, 2021Updated 5 years ago
- >>PhysWikiQuiz<< - a Physics Question Generation and Interrogation System☆11Feb 25, 2023Updated 3 years ago
- Implementing an interactive AI avatar using Python, Blender and GPT☆11Dec 5, 2023Updated 2 years ago
- Agent building tools via block diagram UI☆12Dec 31, 2025Updated 2 months ago
- calvis: Chest, wAist and peLVIS circumference from 3D human Body meshes for Deep Learning.☆11May 15, 2025Updated 9 months ago
- ☆11May 2, 2022Updated 3 years ago
- real-time web visualizer for 3D gaussian splatting☆10Jan 31, 2025Updated last year
- Chatbot for NHS Medicines A-Z. Agentic Retrieval Augmented Generation utilising the OpenAI API, LangChain, and LangGraph to query a vecto…☆10Jun 24, 2024Updated last year
- A proposed GPT chatbot for teachers that uses retrieval-augmentation to answer questions about their students.☆10Dec 7, 2024Updated last year
- VexFS is a Linux kernel-native file system with built-in vector search and semantic memory. Designed for AI agents, RAG, and LLM workload…☆24Oct 19, 2025Updated 4 months ago
- Code for TCSVT paper "Exploring Spatio-Temporal Graph Convolution for Video-based Human-Object Interaction Recognition"☆12Mar 30, 2023Updated 2 years ago
- ☆10Apr 22, 2021Updated 4 years ago
- A Cyberpunk 2077 First-Person Multi Rig for Blender (4.0+)☆11Jan 10, 2026Updated last month
- Talk to your database as if you were chatting with a friend. Turn natural language into powerful SQL queries effortlessly, and get your a…☆10Nov 12, 2024Updated last year
- A project about Virtual Try-On. Lines of code ~5,200.☆10Jan 27, 2021Updated 5 years ago
- Conversational Speaker Diarization using OpenAI AI Language Models(gpt-4) and OpenAI Whisper.☆14Aug 13, 2023Updated 2 years ago
- Official repository of Tapir Lab.'s Lip-Sync Method☆10Oct 3, 2023Updated 2 years ago
- A high-performance, distributed memory management system for LLM agents built with LangGraph, LangChain, Ray, and vLLM. Features multi-la…☆11Apr 23, 2025Updated 10 months ago
- A non-slop skill creator for competent expert-level skills. Extract expertise through guided interviews or expert conversations, separate…☆23Dec 24, 2025Updated 2 months ago
- 共享电动车后台管理系统☆12Sep 4, 2020Updated 5 years ago
- Easily create video datasets with auto-captioning for Hunyuan-Video LoRA finetuning☆14Apr 2, 2025Updated 11 months ago
- Python app to sync Video Files to the beat of a song☆12Aug 5, 2019Updated 6 years ago
- Code and data for our paper "High-Fidelity 3D Digital Human Creation from RGB-D Selfies".☆20Dec 30, 2024Updated last year
- ☆12Sep 11, 2020Updated 5 years ago
- (平台:抖音)弹幕礼物监听数据 Postgresql存储+ 后端管理(golang gin gorm grpc) + UI(vite vue tailwindcss)☆29Jan 25, 2026Updated last month
- Implementation for WatchYourMouth: Silent Speech Recognition with Depth Sensing presented at CHI 2024☆16Oct 6, 2025Updated 5 months ago
- automatically creates videos on any topic you give by web scraping and image processing☆13Mar 2, 2025Updated last year
- ☆14Jun 16, 2023Updated 2 years ago
- Rendering SMPL using neural-mesh-render!!☆12Aug 6, 2020Updated 5 years ago
- Research on algorithms for garment perception, manipulation...☆12Sep 15, 2023Updated 2 years ago