"FusionFactory: Fusing LLM Capabilities with Routing Data", Tao Feng, Haozhen Zhang, Zijie Lei, Pengrui Han, Mostofa Patwary, Mohammad Shoeybi, Bryan Catanzaro, Jiaxuan You
☆22Dec 30, 2025Updated 6 months ago
Alternatives and similar repositories for FusionFactory
Users that are interested in FusionFactory are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ACL'25 Main] Graph of Records: Boosting Retrieval Augmented Generation for Long-context Summarization with Graphs☆41May 26, 2025Updated last year
- ☆13Jun 4, 2024Updated 2 years ago
- ☆14Aug 30, 2023Updated 2 years ago
- Code for "FactKB: Generalizable Factuality Evaluation using Language Models Enhanced with Factual Knowledge". EMNLP 2023.☆20Dec 25, 2023Updated 2 years ago
- [COLM 2025] "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"☆21Apr 9, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆21Sep 11, 2023Updated 2 years ago
- [COLM 2025] JailDAM: Jailbreak Detection with Adaptive Memory for Vision-Language Model☆26Nov 25, 2025Updated 7 months ago
- Repository for "Training Language Models To Explain Their Own Computations"☆22Dec 22, 2025Updated 6 months ago
- [NeurIPS'25] Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning☆141Dec 30, 2025Updated 6 months ago
- Code and data for Marked Personas (ACL 2023)☆30May 26, 2023Updated 3 years ago
- [ICCVW2025] V-RoAst: Visual Road Assessment. Can VLM be a Road Safety Assessor Using the iRAP Standard?☆13Dec 17, 2025Updated 6 months ago
- D3PE (Deep Data-Driven Policy Evaluation) aims to evaluation a large set of candidate policies from a fixed dataset to select best ones.☆10Jun 2, 2022Updated 4 years ago
- Source code for Jellyfish, a soft real-time inference serving system☆15Dec 20, 2022Updated 3 years ago
- Design and analyze optimal deep learning models.☆31Aug 2, 2025Updated 11 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆155Jan 21, 2026Updated 5 months ago
- ☆41Feb 11, 2026Updated 4 months ago
- [ICLR 2025] PyTorch Implementation of "ETA: Evaluating Then Aligning Safety of Vision Language Models at Inference Time"☆34Jul 20, 2025Updated 11 months ago
- [AAAI2024] An official pytorch implement of the paper: Vision-Language Pre-training with Object Contrastive Learning for 3D Scene Underst…☆13Dec 8, 2024Updated last year
- Entropy-Driven GRPO with Guided Error Correction for Advantage Diversity☆22Aug 28, 2025Updated 10 months ago
- About Code release for "Imagination Mechanism: Mesh Information Propagation for Enhancing Data Efficiency in Reinforcement Learning"☆13Oct 7, 2023Updated 2 years ago
- ☆26Mar 17, 2026Updated 3 months ago
- 基于InternLm chat 7B大模型基座,构建一个Agent ,可以调用 MMYOLO 工具来完成图像内视觉任务☆11Oct 30, 2024Updated last year
- Code for Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation (EVOL-RL).☆51Mar 31, 2026Updated 3 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆18May 4, 2023Updated 3 years ago
- ☆12Oct 16, 2022Updated 3 years ago
- This repository contains all the code and data used in our article titled “Estimating international trade status of countries from global…☆10Jul 6, 2023Updated 2 years ago
- ☆48Sep 13, 2025Updated 9 months ago
- The code for the paper "BotMoE: Twitter Bot Detection with Community-Aware Mixtures of Modal-Specific Experts"☆28Sep 16, 2023Updated 2 years ago
- Rank9 IJCAI-18 阿里妈妈搜索广告转化预测 第一赛季☆10Aug 22, 2018Updated 7 years ago
- TokenSim is a tool for simulating the behavior of large language models (LLMs) in a distributed environment.☆25Jun 26, 2026Updated last week
- ☆16Mar 26, 2025Updated last year
- ☆32Apr 17, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆34Oct 13, 2025Updated 8 months ago
- OR-LLM-Agent: Automating Modeling and Solving of Operations Research Optimization Problems with Reasoning LLM☆95Jun 23, 2026Updated last week
- Reviews of part of courses of AI☆25Jun 19, 2023Updated 3 years ago
- decision-making processes of human drivers☆14Mar 28, 2024Updated 2 years ago
- [Up-To-Date] Awesome Agent Memory Paper Resource☆165Feb 11, 2026Updated 4 months ago
- 微信AI内容创作智能体,可自动完成信息爬取、内容整理、排版及草稿推送。涵盖Kaggle竞赛、HuggingFace论文以及ProductHunt产品资讯。☆16Aug 3, 2025Updated 11 months ago
- Code for "From Pretraining Data to Language Models to Downstream Tasks: Tracking the Trails of Political Biases Leading to Unfair NLP Mod…☆41Mar 7, 2024Updated 2 years ago