A collection of strong multimodal models for building multimodal AGI agents
☆44Jul 9, 2024Updated last year
Alternatives and similar repositories for OmModel
Users that are interested in OmModel are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A suite of multimodal language models that are powerful and efficient☆17Jan 13, 2025Updated last year
- A Comprehensive Evaluation Benchmark for Open-Vocabulary Detection (AAAI 2024)☆62May 7, 2024Updated last year
- Reproducible Language Agent Research☆34Jun 25, 2025Updated 9 months ago
- ☆11Oct 31, 2024Updated last year
- Elastic Workplace Search Official Python Client☆10Aug 8, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆14Apr 25, 2025Updated 11 months ago
- Code for SIGDial 2019 Best Paper: Structured Fusion Networks for Dialog https://arxiv.org/abs/1907.10016☆30Aug 19, 2019Updated 6 years ago
- Repository for ACL2020 paper "Refer360° A Referring Expression Recognition Dataset in 360°Images"☆13Jun 26, 2021Updated 4 years ago
- Dialog State Tracking Challenge viewer and tracker☆15Nov 19, 2016Updated 9 years ago
- This project uses ORB-SLAM3 for dense mapping, uses a Realsense D455 camera, and is tested on the ROS2-Humble version.☆37Sep 13, 2025Updated 6 months ago
- RefDrone: A Challenging Benchmark for Drone Scene Referring Expression Comprehension☆32Dec 23, 2025Updated 3 months ago
- 🐜🔧 A minimalistic tool to fine-tune your LLMs☆18Aug 17, 2023Updated 2 years ago
- VideoMathQA is a benchmark designed to evaluate mathematical reasoning in real-world educational videos☆23Jan 26, 2026Updated last month
- A neural text process python lib for context-based feature extraction on Seq-Tagging data.☆10Jul 27, 2018Updated 7 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Tool capable to translate Scilab code into C code.☆12Nov 24, 2017Updated 8 years ago
- GRiT: A Generative Region-to-text Transformer for Object Understanding (ECCV2024)☆341Jan 8, 2024Updated 2 years ago
- PyTorch Implementation of NeurIPS 2020 paper "Learning Sparse Prototypes for Text Generation"☆22Jul 8, 2021Updated 4 years ago
- Awesome Chinese Corpus Datasets and Models.☆18Oct 28, 2019Updated 6 years ago
- Snow HLSL shader for URP.☆15Dec 15, 2020Updated 5 years ago
- [LDF File Parser]This C# code can be used for parsing .LDF LIN files which are the signal description files for LIN (Local Interconnect N…☆10Feb 23, 2018Updated 8 years ago
- PyTorch implementation of Gaussian word embeddings☆19Apr 7, 2018Updated 7 years ago
- Code for ICCV2021: Discovering Human Interactions with Large-Vocabulary Objects via Query and Multi-Scale Detection☆28Oct 12, 2021Updated 4 years ago
- Codes for arXiv paper "Semi-supervised Few-shot Atomic Action Recognition".☆18Jan 2, 2021Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- VideoNIAH: A Flexible Synthetic Method for Benchmarking Video MLLMs☆55Mar 9, 2025Updated last year
- baseline mode for the ObjectNet competition☆18Jan 13, 2021Updated 5 years ago
- ☆21Feb 29, 2024Updated 2 years ago
- 语雀 Yuque python SDK & Command line interface☆17Sep 11, 2019Updated 6 years ago
- ☆21Aug 8, 2024Updated last year
- Fitting stochastic blockmodels to graphs☆17Jul 8, 2016Updated 9 years ago
- Kong API Gateway Sidecar Image☆16Aug 6, 2018Updated 7 years ago
- Notes on Deep Reinforcement Learning for Natural Language Processing papers☆30Jul 17, 2017Updated 8 years ago
- ☆11Jul 7, 2020Updated 5 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Official implementation: Large Language Models are Interpretable Learners - Google☆13Jun 29, 2024Updated last year
- ☆32Jul 29, 2024Updated last year
- ☆32Mar 7, 2022Updated 4 years ago
- MR. Video: MapReduce is the Principle for Long Video Understanding☆31Apr 23, 2025Updated 11 months ago
- Comprehensive benchmark for video text understanding☆28Jun 4, 2025Updated 9 months ago
- Unsupervised Learning of Transferable Relational Graphs☆69Mar 20, 2019Updated 7 years ago
- Using business-level retrieval system (BM25) with Python in just a few lines.☆31Feb 3, 2023Updated 3 years ago