☆36Jan 13, 2026Updated last month
Alternatives and similar repositories for Latent-SFT
Users that are interested in Latent-SFT are comparing it to the libraries listed below
Sorting:
- [COLM 2025: 1st Workshop on the Application of LLM Explainability to Reasoning and Planning] Latent Chain-of-Thought? Decoding the Depth-…☆17Oct 4, 2025Updated 5 months ago
- Official implementation of "Reasoning by Superposition: A Theoretical Perspective on Chain of Continuous Thought" (NeurIPS 2025)☆38Oct 8, 2025Updated 5 months ago
- ☆15Dec 10, 2021Updated 4 years ago
- a website for accessing many models through api(deepseek、Qwen、Hunyuan etc.)☆17Jul 12, 2025Updated 7 months ago
- Code for Learning K-way D-dimensional Discrete Codes For Compact Embedding Representations☆29Jun 30, 2018Updated 7 years ago
- Accelerating Vision-Language Pretraining with Free Language Modeling (CVPR 2023)☆32May 15, 2023Updated 2 years ago
- [ACL 2025] The official pytorch implement of "MIND: A Multi-agent Framework for Zero-shot Harmful Meme Detection".☆25May 26, 2025Updated 9 months ago
- Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge☆111Jan 30, 2026Updated last month
- ☆11Jun 7, 2023Updated 2 years ago
- A framework for steering MoE models by detecting and controlling behavior-linked experts.☆30Sep 12, 2025Updated 5 months ago
- Clustered Compositional Embeddings☆11Oct 25, 2023Updated 2 years ago
- [ICLR 2025] Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception☆14Jul 4, 2025Updated 8 months ago
- [ICLR 2025 SynthData Workshop Spotlight] Empowering LLMs in Decision Games through Algorithmic Data Synthesis☆27Apr 27, 2025Updated 10 months ago
- Reference implementation of models from Nyonic Model Factory☆12May 13, 2024Updated last year
- [ICCV 2021] Multimodal Knowledge Expansion☆10Aug 28, 2021Updated 4 years ago
- ☆31Oct 23, 2025Updated 4 months ago
- ☆11Nov 23, 2024Updated last year
- This repo contains the official code release of the Neural Experts paper, published in NeurIPS 2024.☆14Dec 3, 2024Updated last year
- Residual vector quantization for KV cache compression in large language model☆12Oct 22, 2024Updated last year
- 2022 秋季学期清华大学电子系数据与算法课程 OJ 参考解答☆10Jun 18, 2023Updated 2 years ago
- [ICLR 2025] This repo is the official implementation of "The Labyrinth of Links: Navigating the Associative Maze of Multi-modal LLMs".☆13Jan 25, 2025Updated last year
- [npj Digital Medicine] A multimodal multidomain multilingual medical foundation model for zero shot clinical diagnosis☆17Feb 6, 2025Updated last year
- Official PyTorch implementation of CD-MOE☆12Mar 29, 2025Updated 11 months ago
- Exposing Text-Image Inconsistency Using Diffusion Models (ICLR 2024)☆10Jun 15, 2024Updated last year
- Official implementation of CytoSAE: Interpretable Cell Embeddings for Hematology☆22Jul 17, 2025Updated 7 months ago
- ☆17Updated this week
- [NAACL'25 🏆 SAC Award] Official code for "Advancing MoE Efficiency: A Collaboration-Constrained Routing (C2R) Strategy for Better Expert…☆15Feb 4, 2025Updated last year
- X-ANFIS: An Extensible and Cross-Learning ANFIS Framework for Machine Learning Tasks☆17Jun 7, 2025Updated 9 months ago
- 2023龙芯杯mips赛道作品☆14Dec 23, 2023Updated 2 years ago
- Code for "Skill-based Chain-of-Thoughts for Domain-Adaptive Video Reasoning [EMNLP 2025 Finding]"☆15Aug 27, 2025Updated 6 months ago
- Code accompanying the paper "A contrastive rule for meta-learning"☆13Oct 31, 2024Updated last year
- Official Implementation for "SiLVR : A Simple Language-based Video Reasoning Framework"☆19Jan 18, 2026Updated last month
- Watermarking LLM papers up-to-date☆11Dec 17, 2023Updated 2 years ago
- ☆11Mar 24, 2025Updated 11 months ago
- [Communications Medicine' 25 (Nature Portfolio) ] Tuning Vision Foundation Models for Rectal Cancer Segmentation from CT Scans☆13Jul 11, 2025Updated 7 months ago
- ☆10Nov 27, 2024Updated last year
- ☆10Oct 28, 2024Updated last year
- NSCSCC “龙芯杯” 2024 个人赛 LoongArch 赛道三等奖☆14Aug 17, 2024Updated last year
- 2022龙芯杯个人赛三等奖作品☆14Oct 11, 2023Updated 2 years ago