xverse-ai / XVERSE-V-13BView external linksLinks
☆79May 6, 2024Updated last year
Alternatives and similar repositories for XVERSE-V-13B
Users that are interested in XVERSE-V-13B are comparing it to the libraries listed below
Sorting:
- Its an open source LLM based on MOE Structure.☆58Jul 2, 2024Updated last year
- Qwen-WisdomVast is a large model trained on 1 million high-quality Chinese multi-turn SFT data, 200,000 English multi-turn SFT data, and …☆18Apr 12, 2024Updated last year
- XVERSE-7B: A multilingual large language model developed by XVERSE Technology Inc.☆53Apr 9, 2024Updated last year
- ☆189Feb 5, 2026Updated last week
- XVERSE-13B: A multilingual large language model developed by XVERSE Technology Inc.☆644Apr 9, 2024Updated last year
- Mixture-of-Experts (MoE) Language Model☆194Sep 9, 2024Updated last year
- XVERSE-MoE-A36B: A multilingual large language model developed by XVERSE Technology Inc.☆39Sep 12, 2024Updated last year
- ☆83Sep 5, 2024Updated last year
- XVERSE-MoE-A4.2B: A multilingual large language model developed by XVERSE Technology Inc.☆39May 8, 2024Updated last year
- Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"☆11Oct 11, 2024Updated last year
- ☆11Nov 5, 2024Updated last year
- A Framework for Decoupling and Assessing the Capabilities of VLMs☆43Jun 28, 2024Updated last year
- The OBMO module embedded in PatchNet☆10Feb 21, 2024Updated last year
- [ICCV 2025] Explore the Limits of Omni-modal Pretraining at Scale☆123Sep 2, 2024Updated last year
- Official implementation for "Diffusion Instruction Tuning"☆31Jun 10, 2025Updated 8 months ago
- [Official] [IROS 2024] A goal-oriented planning to lift VLN performance for Closed-Loop Navigation: Simple, Yet Effective☆28Apr 4, 2024Updated last year
- Fathom-DeepResearch: Unlocking Long Horizon Information Retrieval And Synthesis For SLMs☆54Oct 7, 2025Updated 4 months ago
- A survey on MM-LLMs for long video understanding: From Seconds to Hours: Reviewing MultiModal Large Language Models on Comprehensive Long…☆18Sep 12, 2025Updated 5 months ago
- Recursive Abstractive Processing for Tree-Organized Retrieval☆10May 30, 2024Updated last year
- Official code for ICLR 2024 paper "Do Generated Data Always Help Contrastive Learning?"☆31Apr 4, 2024Updated last year
- ☆20Nov 20, 2024Updated last year
- Delta-CoMe can achieve near loss-less 1-bit compressin which has been accepted by NeurIPS 2024☆58Nov 16, 2024Updated last year
- [ACM'MM 2024 Oral] Official code for "OneChart: Purify the Chart Structural Extraction via One Auxiliary Token"☆259Apr 14, 2025Updated 10 months ago
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆139Jun 12, 2024Updated last year
- CFT-RAG: An Entity Tree Based Retrieval Augmented Generation Algorithm With Cuckoo Filter☆22May 28, 2025Updated 8 months ago
- This is the official implementation of ICLR 2024 paper "VDC: Versatile Data Cleanser based on Visual-Linguistic Inconsistency by Multimod…☆19Feb 24, 2025Updated 11 months ago
- [ECCV 2024] AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation☆35Sep 12, 2024Updated last year
- [NeurIPS 2023 Datasets and Benchmarks Track] LAMM: Multi-Modal Large Language Models and Applications as AI Agents☆318Apr 16, 2024Updated last year
- 万卷1.0多模态语料☆569Oct 20, 2023Updated 2 years ago
- OPT-BENCH: Evaluating LLM Agent on Large-Scale Search Spaces Optimization Problems☆120Jul 13, 2025Updated 7 months ago
- ☆18Apr 18, 2025Updated 10 months ago
- An open-source toolkit helping developers build natural language database query solutions☆27May 5, 2025Updated 9 months ago
- ☆15Jun 20, 2024Updated last year
- Large Multimodal Model☆15Apr 8, 2024Updated last year
- ☆16Oct 21, 2024Updated last year
- The code of the paper "NExT-Chat: An LMM for Chat, Detection and Segmentation".☆252Feb 5, 2024Updated 2 years ago
- ☆37Sep 16, 2024Updated last year
- ☆40Oct 17, 2024Updated last year
- Official implement of MIA-DPO☆70Jan 23, 2025Updated last year