FuseAI Project
☆93Jan 25, 2025Updated last year
Alternatives and similar repositories for FuseAI
Users that are interested in FuseAI are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- FuseAI Project☆595Jan 25, 2025Updated last year
- EMNLP'2023: Explore-Instruct: Enhancing Domain-Specific Instruction Coverage through Active Exploration☆36Mar 10, 2024Updated 2 years ago
- [ICLR 2025] Weighted-Reward Preference Optimization for Implicit Model Fusion☆14Mar 17, 2025Updated last year
- Code for "Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective"☆33May 9, 2024Updated last year
- ☆14Apr 16, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- ☆21Jul 5, 2024Updated last year
- EMNLP'2024: Knowledge Verification to Nip Hallucination in the Bud☆23Mar 10, 2024Updated 2 years ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks (EMNLP'24)☆146Sep 20, 2024Updated last year
- My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activated☆34Aug 14, 2024Updated last year
- Codes for Merging Large Language Models☆35Aug 7, 2024Updated last year
- AvatarGo: Plug and Play self-avatars for VR☆21Nov 22, 2022Updated 3 years ago
- ☆12Jun 28, 2021Updated 4 years ago
- PANDA: Prompt Transfer Meets Knowledge Distillation for Efficient Model Adaptation☆16Mar 28, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Notebooks showing off LangChain.js v0.1.0 features.☆20Jan 8, 2024Updated 2 years ago
- [EMNLP'24] LongHeads: Multi-Head Attention is Secretly a Long Context Processor☆31Apr 8, 2024Updated 2 years ago
- Moondream MCP Server in Python☆46Jul 2, 2025Updated 10 months ago
- The official GitHub page for paper "NegativePrompt: Leveraging Psychology for Large Language Models Enhancement via Negative Emotional St…☆25May 10, 2024Updated last year
- DialogueCSE: Dialogue-based Contrastive Learning of Sentence Embeddings☆19Nov 24, 2021Updated 4 years ago
- Does patch ordering affect context-limited vision transformers?☆17Oct 10, 2025Updated 6 months ago
- ☆83May 28, 2025Updated 11 months ago
- [ICML2023] Instant Soup Cheap Pruning Ensembles in A Single Pass Can Draw Lottery Tickets from Large Models. Ajay Jaiswal, Shiwei Liu, Ti…☆11Nov 28, 2023Updated 2 years ago
- Human I/O, published at CHI 2024, Honorable Mentions Award☆15Oct 22, 2025Updated 6 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Tools for merging pretrained large language models.☆7,023Mar 15, 2026Updated last month
- Xmixers: A collection of SOTA efficient token/channel mixers☆28Sep 4, 2025Updated 7 months ago
- Official code for the paper "Image generation with shortest path diffusion" accepted at ICML 2023.☆24Jul 10, 2023Updated 2 years ago
- A library for easily merging multiple LLM experts, and efficiently train the merged LLM.☆511Aug 26, 2024Updated last year
- Setup an MCP server in 60 seconds.☆13Dec 12, 2024Updated last year
- Code for Robust Fine-tuning (RbFT)☆17Jan 31, 2025Updated last year
- [ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data …☆848Mar 17, 2025Updated last year
- ☆241Apr 23, 2024Updated 2 years ago
- Source code of ACL 2023 accepted paper "AD-KD: Attribution-Driven Knowledge Distillation for Language Model Compression"☆13Jun 14, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆11Nov 13, 2024Updated last year
- Codebase for Merging Language Models (ICML 2024)☆866May 5, 2024Updated last year
- A open webui function for better R1 experience☆77Mar 7, 2025Updated last year
- Merge safetensor files using the technique described in "Language Models are Super Mario: Absorbing Abilities from Homologous Models as a…