fine-tune deepseek r1
☆125Feb 10, 2025Updated last year
Alternatives and similar repositories for deepseek-r1-llama-8b
Users that are interested in deepseek-r1-llama-8b are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The objective of this project is to demonstrate how to fine-tune deepseek-r1-distill-llama-8b.☆17Feb 19, 2025Updated last year
- 海康威视Python封装☆28Jan 5, 2024Updated 2 years ago
- 在线 教学管理系统(在线课程学习平台)☆12Apr 12, 2023Updated 3 years ago
- Python基于改进Resnet和Vgg新冠肺炎分类[源码&部署教程]☆19Nov 20, 2023Updated 2 years ago
- WaterNetV1. Official implementation of "WaterNet: An adaptive matching pipeline for segmenting water with volatile appearance", published…☆11Jul 21, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 分别使用OpenCV、ONNXRuntime部署DIS高精度图像二类分割,包含C++和Python两种版本的程序☆18Jan 2, 2024Updated 2 years ago
- 开源语音识别自定义数据模型训练指南☆13Oct 8, 2023Updated 2 years ago
- 改进CNN&FCN的晶圆缺陷分割系统☆19Nov 22, 2023Updated 2 years ago
- SeeSo(Eye-Tracking SDK) sample for iOS☆13Jan 5, 2024Updated 2 years ago
- The more you practice, the better you learn☆20Mar 9, 2026Updated last month
- Preference :"Size invariant circle detection"☆10Oct 21, 2019Updated 6 years ago
- ☆15Apr 3, 2025Updated last year
- YOLOv8-obb☆19Apr 22, 2024Updated last year
- Deep Transfer Learning for Weed Classification☆14May 10, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 这是一个不基于任何框架实现的从0到1的VLM finetune(包括Pre-train和SFT)☆39Aug 22, 2025Updated 7 months ago
- ☆17Feb 13, 2021Updated 5 years ago
- ☆10Aug 16, 2022Updated 3 years ago
- Constrained learning using boxes for event-event relation extraction☆12Aug 5, 2022Updated 3 years ago
- 毕业设计:基于深度学习的印刷字体识别系统设计与实现☆16Aug 9, 2025Updated 8 months ago
- 基于200万条医疗数据对DeepSeek-R1-Distill-Qwen-32B进行fine tune且部署☆162Feb 25, 2025Updated last year
- ☆11Oct 3, 2021Updated 4 years ago
- Code for: "Cutting Down on Prompts and Parameters: Simple Few-Shot Learning with Language Models"☆19Feb 2, 2022Updated 4 years ago
- use MobilenetV2 classified images based on TF-slim☆16Aug 11, 2018Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 基于BERT+Biaffine结构的关系抽取模型☆12Feb 23, 2022Updated 4 years ago
- The project is based on YoloV3 and PyTorch to detect the national flag in the picture.☆11Aug 3, 2021Updated 4 years ago
- 用于训练中文DeepSeek R1大模型的Lora脚本☆13Mar 20, 2025Updated last year
- Transfer learning and fine-tuning with YAMNet☆21Jan 20, 2026Updated 3 months ago
- External Knowledge (Oxford Dictionary and ConceptNet etc.) fused Enhanced Event Causality Identitification Model☆10Jan 11, 2022Updated 4 years ago
- 中文突发事件语料库(Chinese Emergency Corpus)-上海大学-语义智能实验室☆12Apr 21, 2021Updated 4 years ago
- My solutions of the Titanic competition of Kaggle https://www.kaggle.com/c/titanic☆10May 8, 2022Updated 3 years ago
- 使用vis.js可视化知识图谱,使用Flask框架,数据库为neo4j,实现查询节点,显示节点的知识图谱导力图☆17Mar 2, 2023Updated 3 years ago
- A demo using SuperGlue and SuperPoint to do the image matching task based PaddlePaddle.☆23Mar 16, 2022Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [EMNLP'22] Title2Event: Benchmarking Open Event Extraction with a Large-scale Chinese Title Dataset☆20Apr 4, 2023Updated 3 years ago
- chinese few-shot ner☆16Aug 28, 2022Updated 3 years ago
- Using BiLSTM-CRF model for Chinese NER☆15Mar 1, 2018Updated 8 years ago
- Used LSTM and Graph Attention Mechanism to detect the causal relationship in a sentence☆12May 14, 2023Updated 2 years ago
- The official repository of the paper "Laplacian Gradient Consistency Prior for Flash Guided Non-Flash Image Denoising"☆13Nov 9, 2024Updated last year
- Code repository for ACL-IJCNLP 2021 paper 'Poisoning Knowledge Graph Embeddings via Relation Inference Patterns'☆14Oct 13, 2022Updated 3 years ago
- 基于vue的可视化动态更改网格尺寸/可拖拽,可动态改变大小,网格布局和自由布局(vue-gride-layout/dnd-gride)☆11Jul 20, 2018Updated 7 years ago