Yuan-lab-LLM / Yuan3.0View external linksLinks
Yuan3.0: Mixture-of-Experts (MoE) Language Model
☆87Jan 9, 2026Updated last month
Alternatives and similar repositories for Yuan3.0
Users that are interested in Yuan3.0 are comparing it to the libraries listed below
Sorting:
- Model Merging with Functional Dual Anchors☆45Nov 23, 2025Updated 2 months ago
- PeRL: Parameter-Efficient Reinforcement Learning☆70Updated this week
- Tools for OpenDataArena: Fair, Open, and Transparent Arena for Data☆132Jan 31, 2026Updated 2 weeks ago
- ThinkGen: Generalized Thinking for Visual Generation☆51Dec 30, 2025Updated last month
- Our solution to Putnam 2025.☆73Jan 9, 2026Updated last month
- Utility to convert Spade device video streams to MJPEG for live viewing in web browsers, VLC, etc.☆11Nov 20, 2023Updated 2 years ago
- ☆34Nov 11, 2025Updated 3 months ago
- ☆29Jan 15, 2026Updated last month
- Attributes Recognition of Apparel☆10Jan 8, 2019Updated 7 years ago
- [NeurIPS 2025] Official implementation of the paper "BecomingLit: Relightable Gaussian Avatars with Hybrid Neural Shading"☆26Nov 27, 2025Updated 2 months ago
- ☆22Feb 2, 2026Updated 2 weeks ago
- The data-centric paradigm is vital for Text-to-SQL tasks, where performance suffers from scarce and simplistic datasets. We propose Text2…☆17Feb 7, 2026Updated last week
- a script from ERNIE1.0 or ERNIE2.0 to transfomers' BERT format☆10Mar 28, 2020Updated 5 years ago
- 中文分 词 Mac版☆10Jul 5, 2017Updated 8 years ago
- Cython library for random number generation☆10Jan 24, 2017Updated 9 years ago
- Implementation of the ACL Findings paper "OutFlip: Generating Examples for Unknown Intent Detection with Natural Language Attack"☆10May 24, 2021Updated 4 years ago
- Official implementation for “SafeMVDrive: Multi-view Safety-Critical Driving Video Synthesis in the Real World Domain”☆20Dec 11, 2025Updated 2 months ago
- ☆36Dec 18, 2025Updated last month
- ☆30Sep 19, 2025Updated 4 months ago
- Helper functions for the visjs family☆15Feb 9, 2026Updated last week
- Mixture-of-Basis-Experts for Compressing MoE-based LLMs☆27Dec 24, 2025Updated last month
- Quartet II Official Code☆43Feb 2, 2026Updated last week
- [ICCV 2025] LIRA☆21Nov 25, 2025Updated 2 months ago
- This is the official repository of the paper "Atomic-to-Compositional Generalization for Mobile Agents with A New Benchmark and Schedulin…☆13Jul 27, 2025Updated 6 months ago
- Materials for paper "Are Large Language Models Temporally Grounded?"☆13Nov 16, 2023Updated 2 years ago
- 藏头诗生成器 Chinese poem generator with LSTM network☆11May 30, 2019Updated 6 years ago
- Official Implementation of "Steering Vision-Language-Action Models as Anti-Exploration: A Test-Time Scaling Approach"☆29Dec 3, 2025Updated 2 months ago
- Interactive Article Explaining Isomap☆44Jan 6, 2026Updated last month
- ☆11Nov 5, 2024Updated last year
- Self Evolving Large Multimodal Models with Continuous Rewards☆19Nov 21, 2025Updated 2 months ago
- Keras Label Smoothing for Supervised Learning☆11May 15, 2020Updated 5 years ago
- ☆20Jun 6, 2021Updated 4 years ago
- ☆20Dec 3, 2025Updated 2 months ago
- Rethinking the Trust Region in LLM Reinforcement Learning☆34Feb 5, 2026Updated last week
- ☆26Feb 7, 2026Updated last week
- ☆11Aug 6, 2022Updated 3 years ago
- CNN For Fish Training☆24Jul 9, 2025Updated 7 months ago
- Python client for Kurento Media Server. Use these bindings to build an app server for KMS in Python☆12Nov 21, 2024Updated last year
- Where is this IP?☆14Feb 24, 2024Updated last year