wenge-research / CRE-SFTView external linksLinks
A supervised fine-tuning method for controllable reasoning length in large language models (一种通过有监督微调实现大语言模型思考长度可控的方法)
☆10May 8, 2025Updated 9 months ago
Alternatives and similar repositories for CRE-SFT
Users that are interested in CRE-SFT are comparing it to the libraries listed below
Sorting:
- This repository contains code and data for the paper "TableEval: A Real-World Benchmark for Complex, Multilingual, and Multi-Structured T…☆28Jun 12, 2025Updated 8 months ago
- DeepLiterature: A fully open-source intelligent research assistant that integrates search, code execution, link resolution, and informati…☆100Mar 19, 2025Updated 10 months ago
- [COLING 2024 (Oral)] PromISe:Releasing the Capabilities of LLMs with Prompt Introspective Search☆23Aug 26, 2024Updated last year
- Classic Chess game using x86 Assembly Language☆11Apr 23, 2019Updated 6 years ago
- This is the code of a agentic rag method with dynamic workflow.☆13Jan 22, 2026Updated 3 weeks ago
- Develop a machine learning (ML) model for lung cancer detection using U-Net and DenseNet architectures. Achieve an accuracy of at least 9…☆10Dec 9, 2023Updated 2 years ago
- A-Soul-Data Json数据存放☆13Sep 17, 2022Updated 3 years ago
- ☆21Jul 8, 2025Updated 7 months ago
- Learning to Skip the Middle Layers of Transformers☆17Aug 7, 2025Updated 6 months ago
- ☆10Jul 24, 2023Updated 2 years ago
- Fully working chess game implemented in the x86 Intel Assembly language☆13Oct 3, 2022Updated 3 years ago
- Marathon: A Multiple-choice Long Context Evaluation Benchmark for Large Language Models.☆10May 16, 2024Updated last year
- The official codes for our paper at COLING 2022: Semantic-Preserving Adversarial Code Comprehension☆12Oct 23, 2022Updated 3 years ago
- ☆13May 15, 2025Updated 8 months ago
- ☆10Oct 28, 2020Updated 5 years ago
- TurboFuzzLLM: Turbocharging Mutation-based Fuzzing for Effectively Jailbreaking Large Language Models in Practice☆22Nov 24, 2025Updated 2 months ago
- code of paper "Defending Against Alignment-Breaking Attacks via Robustly Aligned LLM"☆14Nov 17, 2023Updated 2 years ago
- Text Adventure Learning Environment Suite - Benchmark to evaluate language models on interactive text environments.☆25Jan 31, 2026Updated 2 weeks ago
- Plagiarism Detection Approach for PAN 2015 Text Alignment task☆11May 11, 2018Updated 7 years ago
- Towards a Mechanistic Understanding of Large Reasoning Models: A Survey of Training, Inference, and Failures☆30Jan 29, 2026Updated 2 weeks ago
- [NeurIPS25] RULE: Reinforcement UnLEarning Achieves Forge-retain Pareto Optimality☆19Oct 22, 2025Updated 3 months ago
- ☆12Aug 16, 2018Updated 7 years ago
- Agent-RRM: Exploring Reasoning Reward Model for Agents☆38Feb 4, 2026Updated last week
- [EMNLP'22] Textual Manifold-based Defense Against Natural Language Adversarial Examples☆11Apr 6, 2023Updated 2 years ago
- Deep Variational Information Bottleneck (DVIB) in PyTorch.☆10Apr 25, 2020Updated 5 years ago
- This is a repository for DKI group concerning the LLM-related papers alongside with code.☆28Feb 5, 2026Updated last week
- ☆14May 7, 2024Updated last year
- Chrome Extension to detect Malicious Websites☆12May 29, 2024Updated last year
- [ICLR 2024] Towards Elminating Hard Label Constraints in Gradient Inverision Attacks☆14Feb 6, 2024Updated 2 years ago
- (ACL 2025) 🔥🔥🔥Code for "Empowering Multimodal Large Language Models with Evol-Instruct"☆20May 15, 2025Updated 8 months ago
- ☆16Mar 1, 2025Updated 11 months ago
- Repo for paper: Examining LLMs' Uncertainty Expression Towards Questions Outside Parametric Knowledge☆14Feb 20, 2024Updated last year
- ☆11Mar 6, 2022Updated 3 years ago
- DILMA: Differentiable Language Model Adversarial Attacks on Categorical Sequence Classifiers☆12Oct 7, 2020Updated 5 years ago
- The official source code for "Boosting LLM Agents with Recursive Contemplation for Effective Deception Handling" (ACL 2024, Findings)☆14Aug 12, 2024Updated last year
- ☆13Oct 13, 2022Updated 3 years ago
- 8086 Assembly Chess☆11Feb 11, 2019Updated 7 years ago
- ☆19Jul 14, 2025Updated 7 months ago
- An open-source solution for full parameter fine-tuning of DeepSeek-V3/R1 671B, including complete code and scripts from training to infer…☆798Mar 13, 2025Updated 11 months ago