☆14Apr 25, 2025Updated 10 months ago
Alternatives and similar repositories for FRAG
Users that are interested in FRAG are comparing it to the libraries listed below
Sorting:
- MR. Video: MapReduce is the Principle for Long Video Understanding☆31Apr 23, 2025Updated 10 months ago
- OmniZip: Audio-Guided Dynamic Token Compression for Fast Omnimodal Large Language Models☆64Feb 1, 2026Updated last month
- [ICCV'25] The official code of paper "Combining Similarity and Importance for Video Token Reduction on Large Visual Language Models"☆71Jan 13, 2026Updated 2 months ago
- Code for Unsupervised interpretation of instructional recipes☆10Jun 30, 2018Updated 7 years ago
- ☆23Jul 20, 2025Updated 8 months ago
- Conda build scripts for OpenCV 2.x☆10Jun 16, 2016Updated 9 years ago
- ☆27May 13, 2025Updated 10 months ago
- Few-shot Bayesian Imitation Learning with Policies as Logic over Programs☆20Oct 19, 2025Updated 5 months ago
- A Massive Multi-Discipline Lecture Understanding Benchmark☆33Nov 1, 2025Updated 4 months ago
- [ICCV'25] HERMES: temporal-coHERent long-forM understanding with Episodes and Semantics☆38Sep 10, 2025Updated 6 months ago
- ☆11Jun 21, 2025Updated 9 months ago
- [ICLR 2025] IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model☆37Nov 27, 2024Updated last year
- [CVPR 2025] DyCoke: Dynamic Compression of Tokens for Fast Video Large Language Models☆103Nov 22, 2025Updated 4 months ago
- Symbolic computer vision tool☆21Jan 8, 2019Updated 7 years ago
- ☆13May 17, 2025Updated 10 months ago
- ☆24May 23, 2025Updated 9 months ago
- [TPAMI2025] BackMix: Regularizing Open Set Recognition by Removing Underlying Fore-Background Priors☆15Apr 23, 2025Updated 10 months ago
- An adaptive sampling framework for Reinforce-style LLM post training.☆92Nov 29, 2025Updated 3 months ago
- A mini-app to solve the heat conduction equation☆15Jul 1, 2020Updated 5 years ago
- Advances in recent large vision language models (LVLMs)☆15Sep 23, 2024Updated last year
- (Pattern Recognition 2025) Towards Trustworthy Dataset Distillation☆14Dec 8, 2024Updated last year
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆31Oct 9, 2025Updated 5 months ago
- [ICLR 2026] Official PyTorch implementation for "ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding"☆61Dec 26, 2025Updated 2 months ago
- ☆19Jun 29, 2025Updated 8 months ago
- An array of noisy, reactive little computers.☆37May 24, 2025Updated 9 months ago
- [ICML 2025] Official Implementation of Hierarchical Masked Autoregressive Models with Low-Resolution Token Pivots☆30May 28, 2025Updated 9 months ago
- ☆61Mar 7, 2026Updated 2 weeks ago
- Multi-Modal Tree of thoughts for DALLE-3 like auto self improvement☆17Nov 11, 2024Updated last year
- ☆10Apr 16, 2024Updated last year
- Java web application backed by the Ethereum-Blockchain network. Powered by RESTful web services (JAX-RS && Spring Boot) , Docker, Kuberne…☆14Feb 19, 2019Updated 7 years ago
- RelNN is a novel first-order deep neural model for relational learning.☆28Nov 15, 2017Updated 8 years ago
- mit6.830 all-pass☆12Mar 25, 2022Updated 3 years ago
- Official Codebase for "Generative Multimodal Model Features Are Discriminative Vision-Language Classifiers"☆25Jun 7, 2025Updated 9 months ago
- ☆11Oct 13, 2024Updated last year
- [AAAI 2025] HiRED strategically drops visual tokens in the image encoding stage to improve inference efficiency for High-Resolution Visio…☆45Apr 18, 2025Updated 11 months ago
- Creating Custom Action Recognition Model using TensorFlow (CNN + LSTM)☆12Feb 22, 2023Updated 3 years ago
- Code for the COG dataset and network☆44Oct 17, 2018Updated 7 years ago
- video captioning using 3DCNN and LSTM (pytorch)☆11Sep 26, 2019Updated 6 years ago
- [CVPR 2025] Your Large Vision-Language Model Only Needs A Few Attention Heads For Visual Grounding☆16Oct 4, 2025Updated 5 months ago