FreedomIntelligence / ApolloMoELinks

[ICLR'25] ApolloMoE: Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family Experts

☆51

Alternatives and similar repositories for ApolloMoE

Users that are interested in ApolloMoE are comparing it to the libraries listed below

Sorting:

18907305772 / FuseAI
FuseAI Project
☆87Updated 9 months ago
du-nlp-lab / MLR-Copilot
☆67Updated 7 months ago
FreedomIntelligence / Apollo
Multilingual Medicine: Model, Dataset, Benchmark, Code
☆197Updated last year
dinobby / MAgICoRE
☆24Updated last year
agokrani / distillKitPlus
Easy to use, High Performant Knowledge Distillation for LLMs
☆95Updated 6 months ago
Cerebras / DocChat
GPT-4 Level Conversational QA Trained In a Few Hours
☆65Updated last year
bespokelabsai / verifiers
Verifiers for LLM Reinforcement Learning
☆79Updated 7 months ago
LLM360 / amber-data-prep
Data preparation code for Amber 7B LLM
☆93Updated last year
zou-group / sirius
SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning
☆72Updated last week
LLM360 / crystalcoder-data-prep
Data preparation code for CrystalCoder 7B LLM
☆45Updated last year
xufangzhi / phi-Decoding
[ACL 2025] An inference-time decoding strategy with adaptive foresight sampling
☆106Updated 6 months ago
THUDM / DeepDive
DeepDive: Advancing Deep Search Agents with Knowledge Graphs and Multi-Turn RL
☆204Updated last month
Zoeyyao27 / SirLLM
This repository contains the code for the paper: SirLLM: Streaming Infinite Retentive LLM
☆60Updated last year
QwenLM / Self-Lengthen
☆92Updated last year
StigLidu / DualDistill
[EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"
☆100Updated 2 months ago
THU-KEG / Agentic-Reward-Modeling
[ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems
☆112Updated 5 months ago
SimonAytes / SoT
Official code repository for Sketch-of-Thought (SoT)
☆129Updated 6 months ago
zjunlp / KnowSelf
[ACL 2025] Agentic Knowledgeable Self-awareness
☆89Updated 5 months ago
RUC-NLPIR / HiRA
The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search
☆62Updated 4 months ago
metal-chart-generation / metal
☆40Updated 5 months ago
hyintell / RetrievalQA
Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [F…
☆69Updated last year
padas-lab-de / ir-rag-sigir24-persona-rag
☆51Updated last year
shan23chen / MedBrowseComp
☆35Updated 6 months ago
David-Li0406 / Preference-Leakage
☆51Updated 6 months ago
VectorSpaceLab / Infomatica
Data Synthesis for Deep Research Based on Semi-Structured Data
☆179Updated last week
SalesforceAIResearch / PretrainRL-pipeline
An automated data pipeline scaling RL to pretraining levels
☆68Updated last month
ali-bahrainian / RAG_best_practices
☆98Updated 7 months ago
YutongWang1216 / DocMTAgent
Code and data releases for the paper -- DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory
☆54Updated 9 months ago
opendatalab / OHR-Bench
(ICCV 2025) OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation
☆94Updated 4 months ago
arcee-ai / DAM
☆55Updated last year