AI4WA / OpenOmniFrameworkLinks
Multimodal Open Source Framework for Conversational Agent Research and Development.
☆21Updated 10 months ago
Alternatives and similar repositories for OpenOmniFramework
Users that are interested in OpenOmniFramework are comparing it to the libraries listed below
Sorting:
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated 9 months ago
- Verifiers for LLM Reinforcement Learning☆80Updated 8 months ago
- [ACL2025 Findings] Benchmarking Multihop Multimodal Internet Agents☆47Updated 9 months ago
- Official Repository for Task-Circuit Quantization☆24Updated 6 months ago
- 🚀 LLM-I: Transform LLMs into natural interleaved multimodal creators! ✨ Tool-use framework supporting image search, generation, code ex…☆34Updated 2 months ago
- ☆35Updated 2 years ago
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆15Updated last year
- XmodelLM☆38Updated last year
- ☆67Updated 8 months ago
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding☆53Updated last year
- ☆56Updated last year
- A Data Source for Reasoning Embodied Agents☆19Updated 2 years ago
- ☆28Updated 4 months ago
- The official repository of "R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Integration"☆129Updated 3 months ago
- ☆21Updated last year
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning☆35Updated 2 years ago
- SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems☆84Updated last year
- 🎮Manipulates mobile phones just like how you would. Official code for "MobA: Multifaceted Memory-Enhanced Adaptive Planning for Efficien…☆26Updated 2 months ago
- MEXMA: Token-level objectives improve sentence representations☆42Updated 11 months ago
- This is the repo for the paper "PANGEA: A FULLY OPEN MULTILINGUAL MULTIMODAL LLM FOR 39 LANGUAGES"☆117Updated 5 months ago
- List of papers on Self-Correction of LLMs.☆81Updated 11 months ago
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆29Updated last week
- Generating Summaries with Controllable Readability Levels (EMNLP 2023)☆14Updated 4 months ago
- imagetokenizer is a python package, helps you encoder visuals and generate visuals token ids from codebook, supports both image and video…☆37Updated last year
- ☆24Updated last year
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆123Updated 4 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆35Updated last year
- Official code repository for the paper "ToMAP: Training Opponent-Aware LLM Persuaders with Theory of Mind"☆22Updated 2 months ago
- ☆50Updated 6 months ago
- Visualize multi-model embedding spaces. The first goal is to quickly get a lay of the land of any embedding space. Then be able to scroll…☆27Updated last year