SLIT-AI / FuseChat-3.0Links
☆18Updated 9 months ago
Alternatives and similar repositories for FuseChat-3.0
Users that are interested in FuseChat-3.0 are comparing it to the libraries listed below
Sorting:
- The official repo for “Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem” [EMNLP25]☆33Updated 5 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆30Updated last year
- Modified Beam Search with periodical restart☆12Updated last year
- Yet another frontend for LLM, written using .NET and WinUI 3☆10Updated 4 months ago
- AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents☆37Updated 4 months ago
- entropix style sampling + GUI☆27Updated last year
- Proteus is an experimental platform that combines the power of Large Language Models with the Genesis physics engine☆25Updated last year
- CLaMR: Contextualized Late-Interaction for Multimodal Content Retrieval☆23Updated 7 months ago
- Easy to use, High Performant Knowledge Distillation for LLMs☆97Updated 9 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆24Updated last year
- ☆17Updated 6 months ago
- XVERSE-MoE-A36B: A multilingual large language model developed by XVERSE Technology Inc.☆38Updated last year
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆59Updated last year
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆34Updated last year
- ☆19Updated last year
- The open-source code of MetaStone-S1.☆105Updated 6 months ago
- Code for "From Ideal to Real: Unified and Data-Efficient Dense Prediction for Real-World Scenarios"☆28Updated 7 months ago
- ☆19Updated 11 months ago
- ☆31Updated 11 months ago
- Yet Another (LLM) Web UI, made with Gemini☆12Updated last year
- THOUGHTSCULPT, a general reasoning and search method for complex tasks☆13Updated last year
- ☆33Updated 8 months ago
- Forces DeepSeek R1 models to engage in extended reasoning by intercepting early termination tokens.☆19Updated 11 months ago
- ☆17Updated 2 years ago
- Fathom-DeepResearch: Unlocking Long Horizon Information Retrieval And Synthesis For SLMs☆53Updated 4 months ago
- A pipeline parallel training script for LLMs.☆166Updated 9 months ago
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding☆53Updated last year
- [ICLR'25] ApolloMoE: Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family Experts☆52Updated last year
- ☆32Updated 2 weeks ago
- ☆56Updated last year