"FusionFactory: Fusing LLM Capabilities with Routing Data", Tao Feng, Haozhen Zhang, Zijie Lei, Pengrui Han, Mostofa Patwary, Mohammad Shoeybi, Bryan Catanzaro, Jiaxuan You
☆20Dec 30, 2025Updated 4 months ago
Alternatives and similar repositories for FusionFactory
Users that are interested in FusionFactory are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2025] "GraphRouter: A Graph-based Router for LLM Selections", Tao Feng, Yanzhen Shen, Jiaxuan You☆67Dec 30, 2025Updated 4 months ago
- [ICML 2025] ResearchTown: Simulator of Human Research Community☆202Updated this week
- ☆13Jun 4, 2024Updated last year
- SPEC-RL: Accelerating On-Policy Reinforcement Learning via Speculative Rollouts☆64Dec 1, 2025Updated 5 months ago
- [COLM 2025] "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"☆20Apr 9, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆21Sep 11, 2023Updated 2 years ago
- [COLM 2025] JailDAM: Jailbreak Detection with Adaptive Memory for Vision-Language Model☆27Nov 25, 2025Updated 5 months ago
- Repository for "Training Language Models To Explain Their Own Computations"☆22Dec 22, 2025Updated 4 months ago
- [NeurIPS'25] Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning☆129Dec 30, 2025Updated 4 months ago
- ☆35Jun 28, 2025Updated 10 months ago
- [ICCVW2025] V-RoAst: Visual Road Assessment. Can VLM be a Road Safety Assessor Using the iRAP Standard?☆11Dec 17, 2025Updated 4 months ago
- D3PE (Deep Data-Driven Policy Evaluation) aims to evaluation a large set of candidate policies from a fixed dataset to select best ones.☆11Jun 2, 2022Updated 3 years ago
- Source code for Jellyfish, a soft real-time inference serving system☆15Dec 20, 2022Updated 3 years ago
- ☆33Feb 4, 2026Updated 3 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Source code of IPA, https://escholarship.org/uc/item/2p0805dq☆12Jun 27, 2024Updated last year
- [ICLR 2025] PyTorch Implementation of "ETA: Evaluating Then Aligning Safety of Vision Language Models at Inference Time"☆31Jul 20, 2025Updated 9 months ago
- ☆149Jan 21, 2026Updated 3 months ago
- ☆38Feb 11, 2026Updated 2 months ago
- Entropy-Driven GRPO with Guided Error Correction for Advantage Diversity☆22Aug 28, 2025Updated 8 months ago
- 基于InternLm chat 7B大模型基座,构建一个Agent ,可以调用 MMYOLO 工具来完成图像内视觉任务☆11Oct 30, 2024Updated last year
- Code for Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation (EVOL-RL).☆50Mar 31, 2026Updated last month
- ☆19May 4, 2023Updated 3 years ago
- Bird's Eye View Calibration Toolkit☆18Jun 21, 2025Updated 10 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This repository contains all the code and data used in our article titled “Estimating international trade status of countries from global…☆10Jul 6, 2023Updated 2 years ago
- ☆46Sep 13, 2025Updated 7 months ago
- Rank9 IJCAI-18 阿里妈妈搜索广告转化预测 第一赛季☆10Aug 22, 2018Updated 7 years ago
- ☆16Mar 26, 2025Updated last year
- TokenSim is a tool for simulating the behavior of large language models (LLMs) in a distributed environment.☆22Sep 20, 2025Updated 7 months ago
- ☆37Feb 8, 2026Updated 2 months ago
- PyTorch implementation of "Arbitrary Style Transfer in Real-time with Adaptive Instance Normalization"☆11Jul 24, 2023Updated 2 years ago
- ☆34Oct 13, 2025Updated 6 months ago
- ☆22Mar 6, 2026Updated last month
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- KeepGPU is a simple CLI app that keeps your GPUs running.☆34Mar 9, 2026Updated last month
- OR-LLM-Agent: Automating Modeling and Solving of Operations Research Optimization Problems with Reasoning LLM☆88Aug 28, 2025Updated 8 months ago
- ☆14Dec 14, 2025Updated 4 months ago
- ☆11Dec 11, 2024Updated last year
- Sharing my solutions to data science hackathons conducted by Analytics Vidhya☆11Apr 29, 2018Updated 8 years ago
- [Up-To-Date] Awesome Agent Memory Paper Resource☆138Feb 11, 2026Updated 2 months ago
- 赛题的解题思路描述和项目源代码☆16Jan 31, 2024Updated 2 years ago