AI4WA / OpenOmniFramework
Multimodal Open Source Framework for Conversational Agent Research and Development.
☆19Updated 2 months ago
Alternatives and similar repositories for OpenOmniFramework
Users that are interested in OpenOmniFramework are comparing it to the libraries listed below
Sorting:
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated 2 months ago
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆16Updated last year
- Code and data from the paper 'Human Feedback is not Gold Standard'☆19Updated 10 months ago
- Measuring and Controlling Persona Drift in Language Model Dialogs☆17Updated last year
- AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories☆12Updated 3 weeks ago
- Visualize multi-model embedding spaces. The first goal is to quickly get a lay of the land of any embedding space. Then be able to scroll…☆27Updated 11 months ago
- ☆17Updated 3 months ago
- Verifiers for LLM Reinforcement Learning☆18Updated 3 weeks ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- a tool for gerenate dataset from doc☆12Updated last month
- Enhancement in Multimodal Representation Learning.☆40Updated last year
- Official Repository for Task-Circuit Quantization☆20Updated 2 weeks ago
- ☆13Updated 8 months ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated 3 weeks ago
- Code, results and other artifacts from the paper introducing the WildChat-50m dataset and the Re-Wild model family.☆29Updated last month
- ☆57Updated 5 months ago
- MEXMA: Token-level objectives improve sentence representations☆41Updated 4 months ago
- Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"☆12Updated 3 months ago
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Updated 11 months ago
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated 9 months ago
- Multi-Modal Language Modeling with Image, Audio and Text Integration, included multi-images and multi-audio in a single multiturn.☆17Updated last year
- ☆37Updated 2 years ago
- Official implementation of the paper "MMInA: Benchmarking Multihop Multimodal Internet Agents"☆43Updated 2 months ago
- ☆13Updated 5 months ago
- ☆16Updated 2 months ago
- Training hybrid models for dummies.☆21Updated 3 months ago
- ☆64Updated last month
- The official code repo and data hub of top_nsigma sampling strategy for LLMs.☆24Updated 3 months ago
- ☆27Updated 2 weeks ago
- Official implementation of ECCV24 paper: POA☆24Updated 9 months ago