U-C4N / Deepseek-CoTLinks
Deepseek-CoT
☆10Updated last year
Alternatives and similar repositories for Deepseek-CoT
Users that are interested in Deepseek-CoT are comparing it to the libraries listed below
Sorting:
- An automated data pipeline scaling RL to pretraining levels☆67Updated 3 weeks ago
- Unofficial Implementation of Evolutionary Model Merging☆41Updated last year
- ☆29Updated 4 months ago
- THOUGHTSCULPT, a general reasoning and search method for complex tasks☆13Updated 10 months ago
- ☆15Updated 6 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆60Updated last year
- Jina VDR is a multilingual, multi-domain benchmark for visual document retrieval☆31Updated 3 months ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31Updated last year
- Sparse Autoencoders (SAE) vs CLIP fine-tuning fun.☆16Updated 10 months ago
- entropix style sampling + GUI☆27Updated last year
- ☆24Updated last year
- Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)☆35Updated 8 months ago
- ☆26Updated 9 months ago
- ☆37Updated last month
- [ACL 2025] An inference-time decoding strategy with adaptive foresight sampling☆106Updated 5 months ago
- ☆60Updated 4 months ago
- ☆55Updated last year
- ☆67Updated 7 months ago
- An unofficial pytorch implementation of 'Efficient Infinite Context Transformers with Infini-attention'☆54Updated last year
- AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents☆33Updated last month
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆172Updated 9 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆29Updated 10 months ago
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated 8 months ago
- ☆122Updated 8 months ago
- Entropy Based Sampling and Parallel CoT Decoding☆17Updated last year
- A Practitioner's Guide to M(eow)ti Turn Agentic ReinfOrcement learning☆47Updated last week
- ☆92Updated last year
- ☆18Updated 6 months ago
- ☆93Updated 4 months ago
- Lego for GRPO☆30Updated 5 months ago