A supervised fine-tuning method for controllable reasoning length in large language models (一种通过有监督微调实现大语言模型思考长度可控的方法)
☆10May 8, 2025Updated 10 months ago
Alternatives and similar repositories for CRE-SFT
Users that are interested in CRE-SFT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository contains code and data for the paper "TableEval: A Real-World Benchmark for Complex, Multilingual, and Multi-Structured T…☆28Jun 12, 2025Updated 9 months ago
- DeepLiterature: A fully open-source intelligent research assistant that integrates search, code execution, link resolution, and informati…☆104Mar 19, 2025Updated last year
- [COLING 2024 (Oral)] PromISe:Releasing the Capabilities of LLMs with Prompt Introspective Search☆23Aug 26, 2024Updated last year
- Classic Chess game using x86 Assembly Language☆11Apr 23, 2019Updated 6 years ago
- Create a website for your Sessionize event within seconds.☆28Mar 17, 2026Updated last week
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Learning to Skip the Middle Layers of Transformers☆17Aug 7, 2025Updated 7 months ago
- ☆14Aug 5, 2019Updated 6 years ago
- This is the code of a agentic rag method with dynamic workflow.☆12Jan 22, 2026Updated 2 months ago
- An open-source solution for full parameter fine-tuning of DeepSeek-V3/R1 671B, including complete code and scripts from training to infer…☆796Mar 13, 2025Updated last year
- Marathon: A Multiple-choice Long Context Evaluation Benchmark for Large Language Models.☆10May 16, 2024Updated last year
- Fully working chess game implemented in the x86 Intel Assembly language☆12Oct 3, 2022Updated 3 years ago
- Plagiarism Detection Approach for PAN 2015 Text Alignment task☆11May 11, 2018Updated 7 years ago
- Chrome Extension to detect Malicious Websites☆12May 29, 2024Updated last year
- This is a repository for DKI group concerning the LLM-related papers alongside with code.☆32Feb 27, 2026Updated last month
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ACL2020] Effective Inter-Clause Modeling for End-to-End Emotion-Cause Pair Extraction☆55Mar 16, 2022Updated 4 years ago
- Develop a machine learning (ML) model for lung cancer detection using U-Net and DenseNet architectures. Achieve an accuracy of at least 9…☆10Dec 9, 2023Updated 2 years ago
- [NeurIPS25] RULE: Reinforcement UnLEarning Achieves Forge-retain Pareto Optimality☆20Oct 22, 2025Updated 5 months ago
- ☆38Apr 2, 2024Updated last year
- Text Adventure Learning Environment Suite - Benchmark to evaluate language models on interactive text environments.☆26Feb 18, 2026Updated last month
- CFT-RAG: An Entity Tree Based Retrieval Augmented Generation Algorithm With Cuckoo Filter☆23May 28, 2025Updated 9 months ago
- ☆19Jul 14, 2025Updated 8 months ago
- ☆21Jan 16, 2025Updated last year
- 8086 Assembly Chess☆11Feb 11, 2019Updated 7 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- (ACL 2025) 🔥🔥🔥Code for "Empowering Multimodal Large Language Models with Evol-Instruct"☆20May 15, 2025Updated 10 months ago
- A repository containing deep learning models and evaluation methods for enhancing medical image segmentation in Computed Tomography (CT) …☆20Jan 20, 2024Updated 2 years ago
- The official source code for "Boosting LLM Agents with Recursive Contemplation for Effective Deception Handling" (ACL 2024, Findings)☆14Aug 12, 2024Updated last year
- Towards a Mechanistic Understanding of Large Reasoning Models: A Survey of Training, Inference, and Failures☆31Jan 29, 2026Updated last month
- The official implementation of the iConference 2022 paper "Identifying Machine-Paraphrased Plagiarism".☆18Nov 19, 2022Updated 3 years ago
- Offical Repository of MetaAgent Program☆43Dec 2, 2025Updated 3 months ago
- Agent-RRM: Exploring Reasoning Reward Model for Agents☆55Mar 17, 2026Updated last week
- [ICLR 2024] Towards Elminating Hard Label Constraints in Gradient Inverision Attacks☆14Feb 6, 2024Updated 2 years ago
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆28Aug 9, 2025Updated 7 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- GenEnv: Difficulty-Aligned Co-Evolution Between LLM Agents and Environment Simulators☆50Dec 23, 2025Updated 3 months ago
- Official code and resources for the paper "EXIT: Context-Aware Extractive Compression for Enhancing Retrieval-Augmented Generation."☆23Dec 23, 2024Updated last year
- ☆10Oct 28, 2020Updated 5 years ago
- ☆28Feb 13, 2026Updated last month
- The official codes for our paper at COLING 2022: Semantic-Preserving Adversarial Code Comprehension☆12Oct 23, 2022Updated 3 years ago
- 雅意信息抽取大模型:在百万级人工构造的高质量信息抽取数据上进行指令微调,由中科闻歌算法团队研发。 (Repo for YAYI Unified Information Extraction Model)☆316Aug 8, 2024Updated last year
- [EMNLP'24 (Main)] DRPO(Dynamic Rewarding with Prompt Optimization) is a tuning-free approach for self-alignment. DRPO leverages a search-…☆25Nov 17, 2024Updated last year