SLIT-AI / FuseChat-3.0Links

☆18

Alternatives and similar repositories for FuseChat-3.0

Users that are interested in FuseChat-3.0 are comparing it to the libraries listed below

Sorting:

TIGER-AI-Lab / One-Shot-CFT
The official repo for “Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem” [EMNLP25]
☆33Updated 5 months ago
nexusflowai / NexusBench
Nexusflow function call, tool use, and agent benchmarks.
☆30Updated last year
attashe / ModifiedBeamSampler
Modified Beam Search with periodical restart
☆12Updated last year
fishiatee / Tumera
Yet another frontend for LLM, written using .NET and WinUI 3
☆10Updated 4 months ago
sunblaze-ucb / AgentSynth
AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents
☆37Updated 4 months ago
EduardTalianu / EntropixLab
entropix style sampling + GUI
☆27Updated last year
severian42 / Proteus-The-Genesis-LLM
Proteus is an experimental platform that combines the power of Large Language Models with the Genesis physics engine
☆25Updated last year
meetdavidwan / clamr
CLaMR: Contextualized Late-Interaction for Multimodal Content Retrieval
☆23Updated 7 months ago
agokrani / distillKitPlus
Easy to use, High Performant Knowledge Distillation for LLMs
☆97Updated 9 months ago
matthewrenze / jhu-concise-cot
The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models
☆24Updated last year
Tencent-Hunyuan / Hunyuan-4B
☆17Updated 6 months ago
xverse-ai / XVERSE-MoE-A36B
XVERSE-MoE-A36B: A multilingual large language model developed by XVERSE Technology Inc.
☆38Updated last year
astramind-ai / Pulsar
The hearth of The Pulsar App, fast, secure and shared inference with modern UI
☆59Updated last year
ElleLeonne / Lightning-ReLoRA
A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.
☆34Updated last year
camenduru / FluxMusic-jupyter
☆19Updated last year
MetaStone-AI / MetaStone-S1
The open-source code of MetaStone-S1.
☆105Updated 6 months ago
xcltql666 / DenseDiT
Code for "From Ideal to Real: Unified and Data-Efficient Dense Prediction for Real-World Scenarios"
☆28Updated 7 months ago
ZihanWang314 / coeCheck
☆19Updated 11 months ago
asappresearch / josh-llm-simulation-training
☆31Updated 11 months ago
fishiatee / yawullm
Yet Another (LLM) Web UI, made with Gemini
☆12Updated last year
cyzus / thoughtsculpt
THOUGHTSCULPT, a general reasoning and search method for complex tasks
☆13Updated last year
mozhu621 / SuperWriter
☆33Updated 8 months ago
latent-variable / r1_reasoning_effort
Forces DeepSeek R1 models to engage in extended reasoning by intercepting early termination tokens.
☆19Updated 11 months ago
camenduru / MoE-LLaVA-jupyter
☆17Updated 2 years ago
FractalAIResearchLabs / Fathom-DeepResearch
Fathom-DeepResearch: Unlocking Long Horizon Information Retrieval And Synthesis For SLMs
☆53Updated 4 months ago
tdrussell / qlora-pipe
A pipeline parallel training script for LLMs.
☆166Updated 9 months ago
neulab / MultiUI
Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding
☆53Updated last year
FreedomIntelligence / ApolloMoE
[ICLR'25] ApolloMoE: Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family Experts
☆52Updated last year
SWE-EVO / SWE-EVO
☆32Updated 2 weeks ago
arcee-ai / DAM
☆56Updated last year