AI4WA / OpenOmniFramework
Multimodal Open Source Framework for Conversational Agent Research and Development.
☆19Updated 2 months ago
Alternatives and similar repositories for OpenOmniFramework:
Users that are interested in OpenOmniFramework are comparing it to the libraries listed below
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆16Updated last year
- ☆13Updated 7 months ago
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated last month
- Code, results and other artifacts from the paper introducing the WildChat-50m dataset and the Re-Wild model family.☆29Updated 2 weeks ago
- ☆62Updated 2 weeks ago
- ☆16Updated last month
- ☆16Updated 6 months ago
- Measuring and Controlling Persona Drift in Language Model Dialogs☆17Updated last year
- ☆16Updated 2 months ago
- ☆24Updated 6 months ago
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Updated 10 months ago
- Aioli: A unified optimization framework for language model data mixing☆23Updated 2 months ago
- RuleRAG: Rule-guided Retrieval-Augmented Generation with Language Models for Question Answering☆21Updated 5 months ago
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆30Updated this week
- PyTorch code for System-1.x: Learning to Balance Fast and Slow Planning with Language Models☆21Updated 8 months ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated this week
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆55Updated 7 months ago
- ☆27Updated last month
- ☆13Updated 4 months ago
- Official implementation of the paper "MMInA: Benchmarking Multihop Multimodal Internet Agents"☆42Updated last month
- Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"☆12Updated 2 months ago
- A Data Source for Reasoning Embodied Agents☆19Updated last year
- Online Preference Alignment for Language Models via Count-based Exploration☆14Updated 3 months ago
- ☆20Updated 10 months ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated last year
- ☆15Updated last week
- Training hybrid models for dummies.☆20Updated 3 months ago
- This repo contains code and data for ICLR 2025 paper MIA-Bench: Towards Better Instruction Following Evaluation of Multimodal LLMs☆26Updated last month
- Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044☆32Updated 6 months ago
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated 8 months ago