D2I-ai/dasd-thinking

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/D2I-ai/dasd-thinking)

D2I-ai / dasd-thinking

☆105

Alternatives and similar repositories for dasd-thinking

Users that are interested in dasd-thinking are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

shawnli / on-policy-distillation
View on GitHub
Implementation of On-Policy Distillation (GKD) for Language Models - ICLR 2024
☆23Nov 24, 2025Updated 8 months ago
Guinan-Su / auto-merge-llm
View on GitHub
An official repository for GPTailor
☆18Jun 29, 2025Updated last year
LUMIA-Group / PonderingLM
View on GitHub
Official implementation of the paper "Pretraining Language Models to Ponder in Continuous Space"
☆26Jul 21, 2025Updated last year
Trae1ounG / BuPO
View on GitHub
[arxiv: 2512.19673] Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies
☆60Feb 6, 2026Updated 5 months ago
UmeanNever / RankSurprisalRatio
View on GitHub
[ACL 2026 Main] Official Repo for Paper "Which Reasoning Trajectories Teach Students to Reason Better? A Simple Metric of Informative Ali…
☆17Jul 1, 2026Updated 3 weeks ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
multimodal-art-projection / REER_DeepWriter
View on GitHub
REverse-Engineered Reasoning for Open-Ended Generation
☆98Sep 10, 2025Updated 10 months ago
shiweijiezero / R3L
View on GitHub
☆23Apr 5, 2026Updated 3 months ago
Applied-Machine-Learning-Lab / PLATE
View on GitHub
implementation code for 'PLATE: A Prompt-Enhanced Paradigm for Multi-Scenario Recommendations' in SIGIR 2023
☆13Sep 27, 2024Updated last year
GMLR-Penn / Multiplex-Thinking
View on GitHub
Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge
☆131May 24, 2026Updated 2 months ago
cvenhoff / steering-thinking-llms
View on GitHub
☆39Jul 9, 2025Updated last year
FatemehShiri / Spatial-MM
View on GitHub
☆12Jan 10, 2025Updated last year
bubble65 / DLLM-Searcher
View on GitHub
DLLM-Searcher has been accepted by SIGIR 2026! 🥳
☆33Jan 23, 2026Updated 6 months ago
phymhan / S2D2
View on GitHub
☆16Jun 17, 2026Updated last month
thu-coai / SPaR
View on GitHub
☆47Jun 11, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
TencentCloudADP / youtu-vl
View on GitHub
Youtu-VL: Unleashing Visual Potential via Unified Vision-Language Supervision
☆169Feb 6, 2026Updated 5 months ago
insuhan / calibquant
View on GitHub
☆21Apr 3, 2025Updated last year
Ionio-io / langgraph-with-crewai
View on GitHub
This repository contains detailed introduction to Langgraph, a new Langchain library. You will learn about langgraph and also create a mi…
☆18May 27, 2024Updated 2 years ago
abdelfattah-lab / TokenButler
View on GitHub
☆27May 12, 2026Updated 2 months ago
RUCAIBox / Passk_Training
View on GitHub
The official repository of paper "Pass@k Training for Adaptively Balancing Exploration and Exploitation of Large Reasoning Models''
☆113Aug 15, 2025Updated 11 months ago
KnowledgeXLab / O2-Searcher
View on GitHub
[TMLR 2026] A Searching-based Agent Model for Open-Domain Open-Ended Question Answering
☆39Jun 20, 2025Updated last year
ianhohoho / auto-hyde
View on GitHub
🔎 A deep-dive into HyDE for Advanced LLM RAG + 💡 Introducing AutoHyDE, a semi-supervised framework to improve the effectiveness, covera…
☆38Mar 26, 2024Updated 2 years ago
RUCBM / G-OPD
View on GitHub
Official repository for the paper "Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation"
☆276May 28, 2026Updated 2 months ago
wizard-III / Archer2.0
View on GitHub
Archer2.0 evolves from its predecessor by introducing ASPO, which overcomes fundamental PPO-Clip limitations to prevent premature converg…
☆31Oct 10, 2025Updated 9 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
SWE-Gym / SWE-Bench-Fork
View on GitHub
☆13Mar 5, 2025Updated last year
hecongqing / Competition_Baselines
View on GitHub
☆11Sep 25, 2020Updated 5 years ago
THU-KEG / LongWriter-V
View on GitHub
[ACM MM25] LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models
☆24Mar 29, 2025Updated last year
Kwai-YuanQi / MM-RLHF
View on GitHub
The Next Step Forward in Multimodal LLM Alignment
☆199May 1, 2025Updated last year
bigai-nlco / CREAM
View on GitHub
[NeurIPS 2024] | An Efficient Recipe for Long Context Extension via Middle-Focused Positional Encoding
☆22Oct 10, 2024Updated last year
modelscope / easydistill
View on GitHub
a toolkit on knowledge distillation for large language models
☆442Mar 10, 2026Updated 4 months ago
EsmaeilNarimissa / aws-sft-grpo-budget-llm-finetune
View on GitHub
☆19May 17, 2025Updated last year
agentica-project / verl
View on GitHub
☆17Mar 30, 2026Updated 3 months ago
Jikai0Wang / OPT-Tree
View on GitHub
☆30May 24, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Peregrine123 / ROPD_official
View on GitHub
☆76May 8, 2026Updated 2 months ago
InternLM / Condor
View on GitHub
[ACL 2025] An official pytorch implement of the paper: Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement
☆40May 28, 2025Updated last year
Open-Source-O1 / o1_Reasoning_Patterns_Study
View on GitHub
☆105Dec 6, 2024Updated last year
princeton-pli / STAT
View on GitHub
Skill-Targeted Adaptive Training
☆24Mar 12, 2026Updated 4 months ago
URRealHero / JudgeAnything
View on GitHub
☆17Jun 1, 2025Updated last year
dongyh20 / Insight-V
View on GitHub
[CVPR2025 Highlight] Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models
☆240Nov 7, 2025Updated 8 months ago
foreverlasting1202 / QuestA
View on GitHub
☆22Jan 2, 2026Updated 6 months ago