collecting publicly available distillation datasets based on DepSeek-R1
☆27Mar 12, 2025Updated last year
Alternatives and similar repositories for DeepSeek-Distillation
Users that are interested in DeepSeek-Distillation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A general framework used on evaluating the performance of large language models (LLMs) based on the peer review mechanism among LLMs☆19Aug 3, 2024Updated last year
- Repo. for RLCF.☆15Apr 1, 2024Updated 2 years ago
- An evaluation framework to test AI in a trial-and-error process. It is a simplified Natural Selection test.☆22Mar 11, 2025Updated last year
- Model-based Hindsight Experience Replay☆10Jun 8, 2022Updated 4 years ago
- ☆13Oct 28, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [EMNLP-2025] R1-Zero on ANY TASK☆31Nov 9, 2025Updated 7 months ago
- From Easy to Hard: A Dual Curriculum Learning Framework for Context-Aware Document Ranking☆14Oct 25, 2022Updated 3 years ago
- Official Implementation for the paper "VisCodex: Unified Multimodal Code Generation via Merging Vision and Coding Models"☆23Aug 14, 2025Updated 10 months ago
- ☆12Jul 4, 2022Updated 3 years ago
- Another Wheel to parse json☆11Mar 13, 2020Updated 6 years ago
- Code for AAAI 2024 paper Wikiformer☆20Dec 21, 2023Updated 2 years ago
- PyTorch implementation for our paper "Efficient Meta Reinforcement Learning for Preference-based Fast Adaptation"☆13Apr 19, 2023Updated 3 years ago
- A collection of papers and libraries for performing multi-agent optimization☆19Jun 6, 2026Updated last week
- ☆13Mar 13, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A toolkit for building dense retrievers with deep language models.☆64Sep 24, 2021Updated 4 years ago
- Code for I3 Retriever, accepted by CIKM'23.☆53Oct 22, 2023Updated 2 years ago
- [COLING 2025] "Physics Reasoner: Knowledge-Augmented Reasoning for Solving Physics Problems with Large Language Models"☆22Dec 18, 2024Updated last year
- Official code for the paper: DRA-GRPO: Exploring Diversity-Aware Reward Adjustment for R1-Zero-Like Training of Large Language Models☆24Jan 6, 2026Updated 5 months ago
- ☆42Apr 8, 2026Updated 2 months ago
- Code for NeurIPS 2022 paper "Robust offline Reinforcement Learning via Conservative Smoothing"☆24Feb 15, 2023Updated 3 years ago
- Official Implementation for "Purifying Quantization-conditioned Backdoors via Layer-wise Activation Correction with Distribution Approxim…☆12Aug 14, 2024Updated last year
- 基于文本相似度的win10智能客服问答系统☆16Mar 12, 2020Updated 6 years ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Jul 12, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Repository of paper "Establishing Trustworthy LLM Evaluation via Shortcut Neuron Analysis" (ACL 2025 Main)☆19Jul 19, 2025Updated 11 months ago
- A benchmark dataset designed to support the development and evaluation of large language models (LLMs) for conversational mental health a…☆21Feb 24, 2025Updated last year
- ☆46Mar 4, 2025Updated last year
- SIGIR 2022: GERE: Generative Evidence Retrieval for Fact Verification☆20Jul 19, 2022Updated 3 years ago
- Modular-HER is revised from OpenAI baselines and supports many improvements for Hindsight Experience Replay as modules.☆17Jun 23, 2021Updated 4 years ago
- An all-in-one framework for Ad-hoc Information Retrieval.☆18Apr 3, 2024Updated 2 years ago
- A teleoperation framework with joint-level master-slave isomorphic mapping and end-effector pose teleoperation for Franka Research 3, bui…☆40Mar 23, 2026Updated 2 months ago
- 基于区块链的商品溯源系统☆10Mar 11, 2021Updated 5 years ago
- 基于simcse的中文句向量生成☆16Jun 8, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Open source echographic image cleaning package. This package provides the tools to automatically remove watermarks, crop the echographic …☆14Sep 12, 2024Updated last year
- A web-based tool that converts Claude Code CLI conversation logs (JSONL format) into human-readable Markdown. Features a built-in file ex…☆65May 12, 2026Updated last month
- ☆23Sep 18, 2024Updated last year
- LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation☆35Feb 26, 2026Updated 3 months ago
- ☆24Oct 14, 2024Updated last year
- PYCON 2018 and BACH 2018 breast cancer histology challenge☆14Nov 23, 2018Updated 7 years ago
- ☆10Mar 28, 2022Updated 4 years ago