☆17Oct 22, 2024Updated last year
Alternatives and similar repositories for LMM-Engines
Users that are interested in LMM-Engines are comparing it to the libraries listed below
Sorting:
- Phys4DGen: A Physics-Driven Framework for Controllable and Efficient 4D Content Generation from a Single Image☆12May 10, 2025Updated 9 months ago
- Main repo for GIOROM☆18Sep 28, 2025Updated 5 months ago
- A dataset of 80 millon constraint preserving transformations of CAD sketches☆13Nov 22, 2024Updated last year
- ☆18Jan 3, 2025Updated last year
- Official Implementation of UA^{2}-Agent and other baseline algorithms of "Towards Unified Alignment Between Agents, Humans, and Environme…☆19Nov 12, 2024Updated last year
- ☆50Jun 7, 2025Updated 8 months ago
- FleVRS: Towards Flexible Visual Relationship Segmentation, NeurIPS 2024☆22Dec 9, 2024Updated last year
- Automatic Integration for Neural Spatio-Temporal Point Process models (AI-STPP) is a new paradigm for exact, efficient, non-parametric inf…☆25Oct 14, 2024Updated last year
- Code for our ACL '23 paper titled "Grokking of Hierarchical Structure in Vanilla Transformers"☆24Oct 8, 2023Updated 2 years ago
- This repo contains evaluation code for the paper "AV-Odyssey: Can Your Multimodal LLMs Really Understand Audio-Visual Information?"☆31Dec 23, 2024Updated last year
- [SCIS 2024] The official implementation of the paper "MMInstruct: A High-Quality Multi-Modal Instruction Tuning Dataset with Extensive Di…☆62Nov 7, 2024Updated last year
- I learn about and explain quantization☆26Apr 19, 2024Updated last year
- [arXiv, 2024] Show Me What and Where has Changed? Question Answering and Grounding for Remote Sensing Change Detection☆34Jul 2, 2025Updated 8 months ago
- Pytorch implementation of HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models☆28Mar 22, 2024Updated last year
- (NeurIPS 2024) What Makes CLIP More Robust to Long-Tailed Pre-Training Data? A Controlled Study for Transferable Insights☆28Oct 28, 2024Updated last year
- Pytorch implementation of same-family gaussian mixture models with guardrails. Features separable parameter optimization and singularity …☆26May 31, 2025Updated 9 months ago
- WONDERBREAD benchmark + dataset for BPM tasks☆34Jul 30, 2025Updated 7 months ago
- [EMNLP 2024 Findings] Benchmarking Language Model Agents for Data-Driven Science☆34Oct 25, 2024Updated last year
- ☆30May 19, 2024Updated last year
- Official Repo for the paper: VCR: Visual Caption Restoration. Check arxiv.org/pdf/2406.06462 for details.☆32Feb 26, 2025Updated last year
- ☆17Sep 1, 2024Updated last year
- ☆37Mar 17, 2025Updated 11 months ago
- Codebase for fine-tuning Llama2 70B to generate math test questions and answers.☆11Aug 30, 2024Updated last year
- The official implementation of 《MLLMs-Augmented Visual-Language Representation Learning》☆31Mar 12, 2024Updated last year
- A collection of code(or link) for awesome blender script for 3D content creation.☆30Aug 7, 2024Updated last year
- ☆37Nov 8, 2024Updated last year
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆40Jun 10, 2024Updated last year
- Agent-based LLM modeling of mechanics problems☆39Feb 10, 2024Updated 2 years ago
- ☆13Apr 27, 2021Updated 4 years ago
- Concurrency library☆17Oct 13, 2024Updated last year
- ☆11Dec 23, 2024Updated last year
- Repo for paper "CODIS: Benchmarking Context-Dependent Visual Comprehension for Multimodal Large Language Models".☆12Oct 14, 2024Updated last year
- This repository is the official implementation of Topology-Informed Graph Transformer (Choi et al., GRaM Workshop at ICML 2024).☆12Dec 28, 2024Updated last year
- Benchmarking LLMs with Challenging Tasks from Real Users☆246Nov 3, 2024Updated last year
- [CVPR 2024] Robust Self-calibration of Focal Lengths from the Fundamental Matrix☆46Jan 1, 2025Updated last year
- Official implementation of paper "Efficient Tuning and Inference for Large Language Models on Textual Graphs"☆38Jun 24, 2024Updated last year
- Concurrent data extraction from unstructured text and images using AI models.☆18Aug 10, 2025Updated 6 months ago
- [AAAI2024] An official pytorch implement of the paper: Vision-Language Pre-training with Object Contrastive Learning for 3D Scene Underst…☆13Dec 8, 2024Updated last year
- Original VinVL visual backbone with simplified APIs to easily extract features, boxes, object detections, in a few lines of Python code.☆11Nov 27, 2022Updated 3 years ago