"FusionFactory: Fusing LLM Capabilities with Routing Data", Tao Feng, Haozhen Zhang, Zijie Lei, Pengrui Han, Mostofa Patwary, Mohammad Shoeybi, Bryan Catanzaro, Jiaxuan You
☆20Dec 30, 2025Updated 3 months ago
Alternatives and similar repositories for FusionFactory
Users that are interested in FusionFactory are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2025] "GraphRouter: A Graph-based Router for LLM Selections", Tao Feng, Yanzhen Shen, Jiaxuan You☆63Dec 30, 2025Updated 3 months ago
- [ICML 2025]"Graph World Model", Tao Feng, Yexin Wu, Guanyu Lin, Jiaxuan You☆33Sep 20, 2025Updated 6 months ago
- [ACL'25 Main] Graph of Records: Boosting Retrieval Augmented Generation for Long-context Summarization with Graphs☆41May 26, 2025Updated 10 months ago
- [ICML 2025] ResearchTown: Simulator of Human Research Community☆201Updated this week
- ☆13Jun 4, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Synthetic data generation for evaluating LLM symbolic and logic reasoning☆22Mar 6, 2026Updated last month
- Code for "FactKB: Generalizable Factuality Evaluation using Language Models Enhanced with Factual Knowledge". EMNLP 2023.☆20Dec 25, 2023Updated 2 years ago
- [COLM 2025] "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"☆20Apr 9, 2025Updated last year
- ☆21Sep 11, 2023Updated 2 years ago
- Repository for "Training Language Models To Explain Their Own Computations"☆21Dec 22, 2025Updated 3 months ago
- [NeurIPS'25] Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning☆127Dec 30, 2025Updated 3 months ago
- ☆34Jun 28, 2025Updated 9 months ago
- ☆21Jan 15, 2024Updated 2 years ago
- Code and data for Marked Personas (ACL 2023)☆30May 26, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- [ICCVW2025] V-RoAst: A New Dataset for Visual Road Assessment☆11Dec 17, 2025Updated 3 months ago
- Source code for Jellyfish, a soft real-time inference serving system☆15Dec 20, 2022Updated 3 years ago
- Repository for the COLM 2025 paper SpecDec++: Boosting Speculative Decoding via Adaptive Candidate Lengths☆18Jul 10, 2025Updated 9 months ago
- Source code of IPA, https://escholarship.org/uc/item/2p0805dq☆12Jun 27, 2024Updated last year
- ☆146Jan 21, 2026Updated 2 months ago
- ☆36Feb 11, 2026Updated 2 months ago
- [AAAI2024] An official pytorch implement of the paper: Vision-Language Pre-training with Object Contrastive Learning for 3D Scene Underst…☆13Dec 8, 2024Updated last year
- About Code release for "Imagination Mechanism: Mesh Information Propagation for Enhancing Data Efficiency in Reinforcement Learning"☆13Oct 7, 2023Updated 2 years ago
- 基于InternLm chat 7B大模型基座,构建一个Agent ,可以调用 MMYOLO 工具来完成图像内视觉任务☆11Oct 30, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Bird's Eye View Calibration Toolkit☆17Jun 21, 2025Updated 9 months ago
- ☆12Oct 16, 2022Updated 3 years ago
- This repository contains all the code and data used in our article titled “Estimating international trade status of countries from global…☆10Jul 6, 2023Updated 2 years ago
- The code for the paper "BotMoE: Twitter Bot Detection with Community-Aware Mixtures of Modal-Specific Experts"☆27Sep 16, 2023Updated 2 years ago
- Fully Decentralized Multi-Agent Reinforcement Learning with Networked Agents☆12Jan 14, 2022Updated 4 years ago
- Rank9 IJCAI-18 阿里妈妈搜索广告转化预测 第一赛季☆10Aug 22, 2018Updated 7 years ago
- Open-ended wargames with large language models☆52Feb 11, 2026Updated 2 months ago
- [Up-To-Date] Awesome Agent Memory Paper Resource☆123Feb 11, 2026Updated 2 months ago
- ☆16Mar 26, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- TokenSim is a tool for simulating the behavior of large language models (LLMs) in a distributed environment.☆22Sep 20, 2025Updated 6 months ago
- The course website for Large Language Models Methods and Applications☆28May 6, 2024Updated last year
- PyTorch implementation of "Arbitrary Style Transfer in Real-time with Adaptive Instance Normalization"☆11Jul 24, 2023Updated 2 years ago
- ☆20May 14, 2025Updated 11 months ago
- ☆21Mar 6, 2026Updated last month
- ☆14Dec 14, 2025Updated 4 months ago
- Sharing my solutions to data science hackathons conducted by Analytics Vidhya☆11Apr 29, 2018Updated 7 years ago