FreedomIntelligence / ApolloMoELinks
[ICLR'25] ApolloMoE: Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family Experts
☆44Updated 7 months ago
Alternatives and similar repositories for ApolloMoE
Users that are interested in ApolloMoE are comparing it to the libraries listed below
Sorting:
- FuseAI Project☆87Updated 5 months ago
- Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models☆36Updated 8 months ago
- ☆24Updated 9 months ago
- Verifiers for LLM Reinforcement Learning☆60Updated 2 months ago
- OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation☆76Updated 3 months ago
- Code and data releases for the paper -- DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory☆44Updated 4 months ago
- The official code repo and data hub of top_nsigma sampling strategy for LLMs.☆26Updated 4 months ago
- ☆17Updated 2 months ago
- ☆86Updated last month
- Data preparation code for CrystalCoder 7B LLM☆45Updated last year
- Easy to use, High Performant Knowledge Distillation for LLMs☆85Updated last month
- ☆65Updated 2 months ago
- Reformatted Alignment☆113Updated 9 months ago
- ☆53Updated last year
- Code for KaLM-Embedding models☆78Updated 3 months ago
- ☆40Updated last year
- ☆20Updated last year
- ☆56Updated 7 months ago
- [ACL 2025] Agentic Knowledgeable Self-awareness☆72Updated last week
- Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [F…☆66Updated last year
- ☆50Updated last year
- a curated list of the role of small models in the LLM era☆101Updated 9 months ago
- This repository contains the code for the paper: SirLLM: Streaming Infinite Retentive LLM☆58Updated last year
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆93Updated 2 weeks ago
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆38Updated last year
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆78Updated last year
- Multilingual Medicine: Model, Dataset, Benchmark, Code☆189Updated 8 months ago
- Code and data for CoachLM, an automatic instruction revision approach LLM instruction tuning.☆61Updated last year
- minimal scripts for 24GB VRAM GPUs. training, inference, whatever☆40Updated last week
- ☆29Updated last year