"FusionFactory: Fusing LLM Capabilities with Routing Data", Tao Feng, Haozhen Zhang, Zijie Lei, Pengrui Han, Mostofa Patwary, Mohammad Shoeybi, Bryan Catanzaro, Jiaxuan You
☆20Dec 30, 2025Updated 5 months ago
Alternatives and similar repositories for FusionFactory
Users that are interested in FusionFactory are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2025] "GraphRouter: A Graph-based Router for LLM Selections", Tao Feng, Yanzhen Shen, Jiaxuan You☆71Dec 30, 2025Updated 5 months ago
- [ICML 2025]"Graph World Model", Tao Feng, Yexin Wu, Guanyu Lin, Jiaxuan You☆39Sep 20, 2025Updated 8 months ago
- [ACL'25 Main] Graph of Records: Boosting Retrieval Augmented Generation for Long-context Summarization with Graphs☆42May 26, 2025Updated last year
- SPEC-RL: Accelerating On-Policy Reinforcement Learning via Speculative Rollouts☆65Dec 1, 2025Updated 6 months ago
- Code for "FactKB: Generalizable Factuality Evaluation using Language Models Enhanced with Factual Knowledge". EMNLP 2023.☆20Dec 25, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [COLM 2025] "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"☆21Apr 9, 2025Updated last year
- Repository for "Training Language Models To Explain Their Own Computations"☆22Dec 22, 2025Updated 5 months ago
- [NeurIPS'25] Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning☆135Dec 30, 2025Updated 5 months ago
- [AAAI 2026] AD-L-JEPA: Self-Supervised Representation Learning with Joint Embedding Predictive Architecture for Automotive LiDAR Object D…☆45Nov 18, 2025Updated 6 months ago
- ☆21Jan 15, 2024Updated 2 years ago
- [ICCVW2025] V-RoAst: Visual Road Assessment. Can VLM be a Road Safety Assessor Using the iRAP Standard?☆13Dec 17, 2025Updated 5 months ago
- Repository for the COLM 2025 paper SpecDec++: Boosting Speculative Decoding via Adaptive Candidate Lengths☆19Jul 10, 2025Updated 11 months ago
- ☆33Feb 4, 2026Updated 4 months ago
- Source code for the data collection and analysis used in the 'How unique is your onion?' project.☆11Dec 15, 2017Updated 8 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆152Jan 21, 2026Updated 4 months ago
- [AAAI2024] An official pytorch implement of the paper: Vision-Language Pre-training with Object Contrastive Learning for 3D Scene Underst…☆13Dec 8, 2024Updated last year
- Entropy-Driven GRPO with Guided Error Correction for Advantage Diversity☆22Aug 28, 2025Updated 9 months ago
- ☆25Mar 17, 2026Updated 2 months ago
- 基于InternLm chat 7B大模型基座,构建一个Agent ,可以调用 MMYOLO 工具来完成图像内视觉任务☆11Oct 30, 2024Updated last year
- Code for Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation (EVOL-RL).☆51Mar 31, 2026Updated 2 months ago
- ☆18May 4, 2023Updated 3 years ago
- This repository contains all the code and data used in our article titled “Estimating international trade status of countries from global…☆10Jul 6, 2023Updated 2 years ago
- ☆47Sep 13, 2025Updated 9 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- TokenSim is a tool for simulating the behavior of large language models (LLMs) in a distributed environment.☆24Sep 20, 2025Updated 8 months ago
- ☆16Mar 26, 2025Updated last year
- The course website for Large Language Models Methods and Applications☆28May 6, 2024Updated 2 years ago
- ☆30Apr 17, 2025Updated last year
- PyTorch implementation of "Arbitrary Style Transfer in Real-time with Adaptive Instance Normalization"☆11Jul 24, 2023Updated 2 years ago
- ☆41Feb 8, 2026Updated 4 months ago
- ☆20May 14, 2025Updated last year
- ☆22Mar 6, 2026Updated 3 months ago
- KeepGPU is a simple CLI app that keeps your GPUs running.☆36Mar 9, 2026Updated 3 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆14Dec 14, 2025Updated 6 months ago
- ☆11Dec 11, 2024Updated last year
- Sharing my solutions to data science hackathons conducted by Analytics Vidhya☆11Apr 29, 2018Updated 8 years ago
- [NeurIPS 2025] TrajAgent: An LLM-Agent Framework for Trajectory Modeling via Large-and-Small Model Collaboration☆26Nov 30, 2025Updated 6 months ago
- [Up-To-Date] Awesome Agent Memory Paper Resource☆156Feb 11, 2026Updated 4 months ago
- 赛题的解题思路描述和项目源代码☆16Jan 31, 2024Updated 2 years ago
- ☆15Sep 15, 2021Updated 4 years ago