DJC-GO-SOLO / Latent-SFTView external linksLinks
☆36Jan 13, 2026Updated last month
Alternatives and similar repositories for Latent-SFT
Users that are interested in Latent-SFT are comparing it to the libraries listed below
Sorting:
- [COLM 2025: 1st Workshop on the Application of LLM Explainability to Reasoning and Planning] Latent Chain-of-Thought? Decoding the Depth-…☆17Oct 4, 2025Updated 4 months ago
- Official implementation of "Reasoning by Superposition: A Theoretical Perspective on Chain of Continuous Thought" (NeurIPS 2025)☆37Oct 8, 2025Updated 4 months ago
- ☆15Dec 10, 2021Updated 4 years ago
- a website for accessing many models through api(deepseek、Qwen、Hunyuan etc.)☆17Jul 12, 2025Updated 7 months ago
- Code for Learning K-way D-dimensional Discrete Codes For Compact Embedding Representations☆29Jun 30, 2018Updated 7 years ago
- Accelerating Vision-Language Pretraining with Free Language Modeling (CVPR 2023)☆32May 15, 2023Updated 2 years ago
- [ACL 2025] The official pytorch implement of "MIND: A Multi-agent Framework for Zero-shot Harmful Meme Detection".☆26May 26, 2025Updated 8 months ago
- Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge☆105Jan 30, 2026Updated 2 weeks ago
- ☆11Jun 7, 2023Updated 2 years ago
- A framework for steering MoE models by detecting and controlling behavior-linked experts.☆29Sep 12, 2025Updated 5 months ago
- Clustered Compositional Embeddings☆11Oct 25, 2023Updated 2 years ago
- 2022 秋季学期清华大学电子系数据与算法课程 OJ 参考解答☆10Jun 18, 2023Updated 2 years ago
- [ICLR 2025] This repo is the official implementation of "The Labyrinth of Links: Navigating the Associative Maze of Multi-modal LLMs".☆13Jan 25, 2025Updated last year
- Official PyTorch implementation of CD-MOE☆12Mar 29, 2025Updated 10 months ago
- This repo contains the official code release of the Neural Experts paper, published in NeurIPS 2024.☆13Dec 3, 2024Updated last year
- [npj Digital Medicine] A multimodal multidomain multilingual medical foundation model for zero shot clinical diagnosis☆17Feb 6, 2025Updated last year
- Exposing Text-Image Inconsistency Using Diffusion Models (ICLR 2024)☆10Jun 15, 2024Updated last year
- Residual vector quantization for KV cache compression in large language model☆11Oct 22, 2024Updated last year
- [ICLR 2026] Official repo for "FrameThinker: Learning to Think with Long Videos via Multi-Turn Frame Spotlighting"☆37Oct 9, 2025Updated 4 months ago
- Reference implementation of models from Nyonic Model Factory☆12May 13, 2024Updated last year
- [ICLR 2025] Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception☆14Jul 4, 2025Updated 7 months ago
- [ICCV 2021] Multimodal Knowledge Expansion☆10Aug 28, 2021Updated 4 years ago
- ☆17Feb 3, 2026Updated 2 weeks ago
- ☆11Nov 23, 2024Updated last year
- Official implementation of CytoSAE: Interpretable Cell Embeddings for Hematology☆22Jul 17, 2025Updated 7 months ago
- [ICLR 2025 SynthData Workshop Spotlight] Empowering LLMs in Decision Games through Algorithmic Data Synthesis☆26Apr 27, 2025Updated 9 months ago
- Official codebase for the paper Latent Visual Reasoning☆111Oct 22, 2025Updated 3 months ago
- A comprehensive overview of Data Distillation and Condensation (DDC). DDC is a data-centric task where a representative (i.e., small but …☆13Dec 1, 2022Updated 3 years ago
- 2023龙芯杯mips赛道作品☆14Dec 23, 2023Updated 2 years ago
- [NAACL'25 🏆 SAC Award] Official code for "Advancing MoE Efficiency: A Collaboration-Constrained Routing (C2R) Strategy for Better Expert…☆14Feb 4, 2025Updated last year
- ☆10Oct 28, 2024Updated last year
- A parallel coordinates plot using matplotlib☆11Aug 13, 2021Updated 4 years ago
- X-ANFIS: An Extensible and Cross-Learning ANFIS Framework for Machine Learning Tasks☆17Jun 7, 2025Updated 8 months ago
- ☆11Mar 24, 2025Updated 10 months ago
- Open-source code of TransGTR.☆15Nov 1, 2023Updated 2 years ago
- ☆11Sep 7, 2020Updated 5 years ago
- ☆13Jul 10, 2024Updated last year
- [ICLR 2026] SwiReasoning: Switch-Thinking in Latent and Explicit for Pareto-Superior Reasoning LLMs☆43Oct 14, 2025Updated 4 months ago
- [Communications Medicine' 25 (Nature Portfolio) ] Tuning Vision Foundation Models for Rectal Cancer Segmentation from CT Scans☆13Jul 11, 2025Updated 7 months ago