中文基于满血DeepSeek-R1蒸馏数据集
☆64Feb 21, 2025Updated last year
Alternatives and similar repositories for Chinese-Data-Distill-From-R1
Users that are interested in Chinese-Data-Distill-From-R1 are comparing it to the libraries listed below
Sorting:
- Fast instruction tuning with Llama2☆11Apr 8, 2024Updated last year
- [ACL 2025] An official pytorch implement of the paper: Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement☆39May 28, 2025Updated 9 months ago
- ☆51Oct 28, 2024Updated last year
- ☆17Aug 28, 2025Updated 6 months ago
- Dataset corresponding to the paper: "Form2Seq : A Framework for Higher-Order Form Structure Extraction"☆10Feb 17, 2021Updated 5 years ago
- ☆26Aug 7, 2025Updated 7 months ago
- ☆16Sep 17, 2024Updated last year
- ☆10Sep 2, 2023Updated 2 years ago
- ☆26Mar 10, 2026Updated last week
- Official code for paper "SPA-RL: Reinforcing LLM Agent via Stepwise Progress Attribution"☆71Sep 13, 2025Updated 6 months ago
- The official repo of INF-34B models trained by INF Technology.☆34Jul 25, 2024Updated last year
- LinkMind is an enterprise-level composite multimodal large model middleware.☆19Updated this week
- make LLM easier to use☆59Jul 4, 2023Updated 2 years ago
- ☆41Apr 30, 2025Updated 10 months ago
- SimX-OR: Extending Any Simulation Benchmark to Evaluate the Observational Robustness of VLA Models☆32Nov 4, 2025Updated 4 months ago
- (ICLR 2025) AgentRefine: Enhancing Agent Generalization through Refinement Tuning☆19Nov 22, 2025Updated 4 months ago
- 从Docker官方Ubuntu镜像,定制中国地区使用的对应镜像。☆13Jan 13, 2021Updated 5 years ago
- This repository contains the code for the paper “Neuro-Symbolic Query Compiler”, accepted to the Findings of ACL 2025.☆16Oct 20, 2025Updated 5 months ago
- CamRest676 is an English data set, I translate it into Chinese for training nlu.☆12Dec 20, 2017Updated 8 years ago
- A Top-Down Profiler for GPU Applications☆22Feb 29, 2024Updated 2 years ago
- ☆19May 3, 2025Updated 10 months ago
- ☆16Jun 10, 2025Updated 9 months ago
- [NeurIPS 2024 poster] Cross-model Control: Improving Multiple Large Language Models in One-time Training☆14Oct 25, 2024Updated last year
- A Conversational Text-to-SQL Challenge Towards Cross-Domain Natural Language Interfaces to Databases☆14Mar 22, 2022Updated 4 years ago
- Official code for DeepSound-V1☆13May 14, 2025Updated 10 months ago
- Github Repo for OATS: Outlier-Aware Pruning through Sparse and Low Rank Decomposition☆18Apr 16, 2025Updated 11 months ago
- Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"☆136Jun 5, 2024Updated last year
- [ACL 2025 Main] (🏆 Outstanding Paper Award) Rethinking the Role of Prompting Strategies in LLM Test-Time Scaling: A Perspective of Proba…☆16Aug 15, 2025Updated 7 months ago
- The official implementation of "Well Begun is Half Done: Low-resource Preference Alignment by Weak-to-Strong Decoding"☆22Jun 26, 2025Updated 8 months ago
- ☆41Updated this week
- daVinci-Agency: Unlocking Long-Horizon Agency Data-Efficiently☆34Feb 4, 2026Updated last month
- ☆33May 27, 2025Updated 9 months ago
- The official code of our paper “RAG-Critic: Leveraging Automated Critic-Guided Agentic Workflow for Retrieval Augmented Generation”☆27Aug 19, 2025Updated 7 months ago
- SeRL: Self-Play Reinforcement Learning for Large Language Models with Limited Data☆21Jan 24, 2026Updated last month
- [MICCAI 2025] GL-LCM: Global-Local Latent Consistency Models for Fast High-Resolution Bone Suppression in Chest X-Ray Images☆15Mar 12, 2026Updated last week
- Finetuning a codegen model with python instruction set using QLORA technique for better efficacy☆11Aug 31, 2023Updated 2 years ago
- 使用ONNXRuntime部署StyleGAN人像卡通画,包含C++和Python两个版本的程序☆22Jul 9, 2022Updated 3 years ago
- CTE: Contextualized Table Extraction Dataset☆17Feb 23, 2023Updated 3 years ago
- 中文短文本数据集,用于短文本分类研究,涉及情感分类、多分类等,发布的中文公开短文本数据集☆19Aug 16, 2024Updated last year