traveler-framework/TraveLER

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/traveler-framework/TraveLER)

traveler-framework / TraveLER

[EMNLP 2024] TraveLER: A Modular Multi-LMM Agent Framework for Video Question-Answering

☆18

Alternatives and similar repositories for TraveLER

Users that are interested in TraveLER are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

3DAgentWorld / LLM-Game-Agent
View on GitHub
☆24Oct 13, 2024Updated last year
sanjayss34 / codevqa
View on GitHub
☆83Jul 16, 2023Updated 3 years ago
aurooj / SHG-VQA
View on GitHub
Learning Situation Hyper-Graphs for Video Question Answering
☆23Feb 16, 2024Updated 2 years ago
nkotech / EcoSteno-Firmware
View on GitHub
Firmware for the EcoSteno stenographer keyboard
☆12Feb 17, 2023Updated 3 years ago
zhengrongz / AoTD
View on GitHub
[CVPR 2025] Official PyTorch code of "Enhancing Video-LLM Reasoning via Agent-of-Thoughts Distillation".
☆58Updated this week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
CreativeInquiry / handsfree-js
View on GitHub
Recovered copy of Oz Ramos' Handsfree.js Project
☆14Mar 7, 2019Updated 7 years ago
asuprem / ODIN
View on GitHub
☆11Sep 1, 2020Updated 5 years ago
lih627 / MLMSNet
View on GitHub
Lightweight Multi-Level Multi-Scale Feature Fusion Network for Semantic Segmentation
☆11May 31, 2021Updated 5 years ago
callaunchpad / pytorch-optimem
View on GitHub
A Python package for reducing memory footprint of PyTorch models
☆15May 3, 2023Updated 3 years ago
amirgamil / curius-search
View on GitHub
Search engine of my Curius data
☆16Apr 10, 2022Updated 4 years ago
Karine-Huang / GenMAC
View on GitHub
[AAAI 2026] GenMAC for Compositional Text-to-Video Generation
☆35Jan 10, 2026Updated 6 months ago
lfedgeai / shifu
View on GitHub
Kubernetes-native IoT gateway
☆14Jul 21, 2025Updated last year
RobertCsordas / switchhead
View on GitHub
☆16Jun 11, 2025Updated last year
lfedgeai / eda
View on GitHub
Data on-Prem, Code on-the-Fly
☆15Nov 22, 2025Updated 8 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
wangsen99 / LMEE
View on GitHub
(CVPR 26) Explore with Long-term Memory: A Benchmark and Multimodal LLM-based Reinforcement Learning Framework for Embodied Exploration
☆36Mar 8, 2026Updated 4 months ago
alipay / PC2-NoiseofWeb
View on GitHub
Noise of Web (NoW) is a challenging noisy correspondence learning (NCL) benchmark containing 100K image-text pairs for robust image-text …
☆16Nov 20, 2025Updated 8 months ago
wenh18 / AdaptiveNet_artifact
View on GitHub
☆16Jul 25, 2023Updated 3 years ago
lfedgeai / yomo
View on GitHub
🦖 Stateful Serverless Framework for Edge AI Infra
☆15Sep 3, 2025Updated 10 months ago
deep-spin / Infinite-Video
View on GitHub
\infty-Video: A Training-Free Approach to Long Video Understanding via Continuous-Time Memory Consolidation
☆21Feb 14, 2025Updated last year
gtdong-ustc / tof-mpi-remove
View on GitHub
ECCV2020_Spatial Hierarchy Aware Residual Pyramid Network for Time-of-Flight Depth Denoising
☆12Sep 24, 2020Updated 5 years ago
WangFei-2019 / SNARE
View on GitHub
Project for SNARE benchmark
☆11Jun 5, 2024Updated 2 years ago
simarmehta / chessAutomation_CV
View on GitHub
This repository implements computer vision for real-time chessboard detection and piece recognition. Using OpenCV and Numpy, the system p…
☆15Sep 24, 2024Updated last year
Yui010206 / SeViLA
View on GitHub
[NeurIPS 2023] Self-Chained Image-Language Model for Video Localization and Question Answering
☆197Jan 14, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
NeuSpeech / NeuGaze
View on GitHub
☆14Aug 29, 2025Updated 11 months ago
allen4747 / Ferret
View on GitHub
This is the official implementation for the paper: Ferret: Federated Full-Parameter Tuning at Scale for Large Language Models
☆19Sep 11, 2024Updated last year
quanshr / DMoERM
View on GitHub
[ACL2024 Findings]DMoERM: Recipes of Mixture-of-Experts for Effective Reward Modeling
☆17Jun 6, 2024Updated 2 years ago
wxh1996 / VideoAgent
View on GitHub
☆150Apr 16, 2025Updated last year
idanshen / Value-Augmented-Sampling
View on GitHub
☆20May 16, 2024Updated 2 years ago
HE-diffusion / HE-diffusion
View on GitHub
☆17Apr 30, 2024Updated 2 years ago
To-Data-Beyond / Multimodal-RAG
View on GitHub
Hands-On Tutorial on Building Multimodal RAG Systems
☆14Apr 10, 2025Updated last year
j10labs / wandview
View on GitHub
Mobile Viewer for W&B, built on top of Flutter.
☆41Mar 2, 2024Updated 2 years ago
SEU-VIPGroup / Understanding_Vision_Tasks
View on GitHub
☆13Feb 2, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
doc-doc / NExT-OE
View on GitHub
NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions (CVPR'21)
☆30Jul 18, 2023Updated 3 years ago
tctco / DCCCSlicer
View on GitHub
Calculate Centiloid / CenTauR and other imaging biomarkers in seconds: An open-source 3D Slicer extension and agent-friendly tool for ult…
☆20Jul 24, 2026Updated last week
liyingxuan1012 / zeroshot-speaker-prediction
View on GitHub
Official repository of "Zero-Shot Character Identification and Speaker Prediction in Comics via Iterative Multimodal Fusion" (ACMMM 2024)
☆16Oct 31, 2024Updated last year
z-x-yang / DoraemonGPT
View on GitHub
Official repository of DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models
☆91Jun 19, 2026Updated last month
WissingChen / CRA-GQA
View on GitHub
The official implementation of "Cross-modal Causal Relation Alignment for Video Question Grounding. (CVPR 2025 Highlight)"
☆52Apr 27, 2025Updated last year
apple / ml-mmtoolsandbox
View on GitHub
MM-ToolSandBox: A Unified Framework for Evaluating Visual Tool-Calling Agents
☆20Jul 14, 2026Updated 2 weeks ago
zjucsq / PLA
View on GitHub
[ICLR2023] Video Scene Graph Generation from Single-Frame Weak Supervision
☆12Sep 17, 2023Updated 2 years ago