A suite of multimodal language models that are powerful and efficient
☆19Jan 13, 2025Updated last year
Alternatives and similar repositories for OmChat
Users that are interested in OmChat are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A collection of strong multimodal models for building multimodal AGI agents☆45Jul 9, 2024Updated last year
- A Comprehensive Evaluation Benchmark for Open-Vocabulary Detection (AAAI 2024)☆64Apr 10, 2026Updated 2 months ago
- 手搓Llama,个人学习用☆16May 21, 2024Updated 2 years ago
- ✨✨VITA: Towards Open-Source Interactive Omni Multimodal LLM☆11Jun 16, 2025Updated last year
- UFT: Unifying Supervised and Reinforcement Fine-Tuning☆31Jun 30, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- The official implement of "Routing Experts: Learning to Route Dynamic Experts in Existing Multi-modal Large Language Models"☆17Mar 24, 2025Updated last year
- Code and data for the paper "Steering Conversational Large Language Models for Long Emotional Support Conversations" along with a UI to v…☆15Apr 14, 2025Updated last year
- RefDrone: A Challenging Benchmark for Drone Scene Referring Expression Comprehension☆43Dec 23, 2025Updated 6 months ago
- [EMNLP 2023]Context Compression for Auto-regressive Transformers with Sentinel Tokens☆25Nov 6, 2023Updated 2 years ago
- PyTorch code for CVPR 2022 paper Unbiased Teacher v2 Semi-supervised Object Detection for Anchor-free and Anchor-based Detectors☆100Jul 27, 2022Updated 3 years ago
- The code implementation of Skill-MoE☆46May 22, 2026Updated last month
- Code and dataset link for "DenseWorld-1M: Towards Detailed Dense Grounded Caption in the Real World"☆128Oct 2, 2025Updated 9 months ago
- This repo contains the source code for: Model Tells You What to Discard: Adaptive KV Cache Compression for LLMs☆43Aug 14, 2024Updated last year
- Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone☆131Oct 10, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- This repository contains the code for the paper: SirLLM: Streaming Infinite Retentive LLM☆60May 28, 2024Updated 2 years ago
- A tiny search engine.☆13Sep 6, 2022Updated 3 years ago
- Keyformer proposes KV Cache reduction through key tokens identification and without the need for fine-tuning☆57Mar 26, 2024Updated 2 years ago
- 新版《Redis 设计与实现》的支持 网站。☆12May 1, 2024Updated 2 years ago
- A tiny package supporting distributed computation of COCO metrics for PyTorch models.☆15Feb 28, 2023Updated 3 years ago
- Elastic Workplace Search Official Python Client☆10Aug 8, 2024Updated last year
- [ACL'25] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.☆87Nov 2, 2025Updated 8 months ago
- Deep Learning for Video Retrieval by Natural Language☆11Oct 20, 2019Updated 6 years ago
- [EMNLP 2025 Findings] Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language Models☆148Aug 21, 2025Updated 10 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆72Jun 1, 2025Updated last year
- MLOps Pipeline for Amazon Forecast written in AWS CDK☆11Apr 10, 2025Updated last year
- Exploring Efficient Fine-Grained Perception of Multimodal Large Language Models☆68Nov 1, 2024Updated last year
- 中文原生等级化代码能力测试基准☆15Apr 11, 2024Updated 2 years ago
- Byzer VSCode Extension☆12Apr 19, 2023Updated 3 years ago
- Sample AutoML notebooks evolving towards MLOps☆11Feb 15, 2022Updated 4 years ago
- Repository for ACL2020 paper "Refer360° A Referring Expression Recognition Dataset in 360°Images"☆14Jun 26, 2021Updated 5 years ago
- [ECCV 2024] Elysium: Exploring Object-level Perception in Videos via MLLM☆88Oct 25, 2024Updated last year
- A part of the course Mobile Application Development☆13Nov 30, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 《Redis入门与实战》的读者服务网站。☆15Aug 6, 2020Updated 5 years ago
- ☆10Aug 19, 2022Updated 3 years ago
- 这是我在 2017 年 9 月 9 日广州 Gopher meetup 演讲时的演讲稿以及代码,演讲视频请见:http://www.itdks.com/dakashuo/new/eventlist/detail/1262 ,其他演讲者的演讲稿请见:https://githu…☆14Sep 12, 2017Updated 8 years ago
- Predict the number of deaths due to covid19 in the next two weeks☆11Oct 2, 2022Updated 3 years ago
- [ICLR 2025 Oral] "Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free"☆92Oct 15, 2024Updated last year
- Azure Machine Learning と GitHub を利用した MLOps のサンプルコード☆13Jun 7, 2023Updated 3 years ago
- Data and Code for Program of Thoughts [TMLR 2023]☆318May 15, 2024Updated 2 years ago