0xZee / DeepSeek-R1-FineTuningLinks
Fine-Tuning of DeepSeek-Style Reasoning Models | RL + Quantization Implementation
☆15Updated 7 months ago
Alternatives and similar repositories for DeepSeek-R1-FineTuning
Users that are interested in DeepSeek-R1-FineTuning are comparing it to the libraries listed below
Sorting:
- ☆17Updated 2 weeks ago
- A Comprehensive survey on business use cases of AI that help them thrive in the digital economy☆13Updated 4 years ago
- ☆20Updated last year
- 本项目主要对开源的MOSS SFT数据进行整理 ,转换成mnbvc多轮对话格式。MOSS-003涵盖用性、忠实性、无害性三个层面,共353w样本,MOSS-003 包含更细粒度的有用性类别标记、更广泛的无害性数据和更长对话轮数,共630w样本,☆12Updated last year
- open-llms-next-web,一个类似于chatgpt-next-web的开源大型语言模型web演示,支持离线开源大模型和PEFT模型☆18Updated last year
- help kids learn python☆36Updated this week
- MNBVC项目-ShareGPT语料清洗☆15Updated last year
- [EMNLP 2024] PsyGUARD: An Automated System for Suicide Detection and Risk Assessment in Psychological Counseling☆15Updated 4 months ago
- By leveraging Bocha AI Search API , your AI applications can now access high-quality, up-to-date knowledge from billions of web pages and…☆21Updated 7 months ago
- 首个中文心理咨询对话安全检测数据集☆20Updated last year
- ☆20Updated 12 years ago
- Cloning Yourself using your whatsapp chat history and training a model on it.☆15Updated last year
- This GUI aims to simplify the process of converting GGUF files to llamafile format by providing an intuitive and convenient way for users…☆14Updated last year
- SiliconCloud Cookbook☆22Updated 6 months ago
- ☆23Updated 2 years ago
- An instruction tuned large language model with extra support for poetry and verse generation☆24Updated 2 years ago
- A benchmark corpus of 100 English novels, covering the 19th and the beginning of the 20th century☆22Updated 3 years ago
- ☆22Updated 2 years ago
- NextChat mcp server collection☆27Updated 8 months ago
- 基于CEC语料库挖掘要素识别规则,对新闻报道类生语料进行自动标注☆20Updated 10 years ago
- Instruction Fine-Tuning of Meta Llama 3.2-3B Instruct on Kannada Conversations. Tailoring the model to follow specific instructions in Ka…☆21Updated 7 months ago
- ☆19Updated 2 years ago
- Hybrid Deep Sequential Modeling for Social Text-Driven Stock Prediction-Dataset☆21Updated 7 years ago
- 基于chatgpt-next-web 增强版本,后台管理,接入知识库等。将按需持续接入midjourney绘画功能,接入了stable-diffusion,支持oss,支持dall-e-3、gpt-4-vision-preview、whisper、tts,支持gpt-4-a…☆36Updated last year
- Extract templated Open Information Extraction☆17Updated 8 years ago
- This repository will contain a demo using Weaviate with data and metadata from the arXiv dataset.☆16Updated 3 years ago
- ☆41Updated 2 years ago
- 本项目的数据来自“互联网新闻情感分析”赛题。使用Bert-As-Service库中的中文Bert模型进行句向量的提取,加入全连接层后进行三分类。☆29Updated 5 years ago
- Exploration: using technology to aid people who lack both the ability to speak and fine motor control.☆21Updated 10 months ago
- ☆29Updated 10 months ago