The objective of this project is to demonstrate how to fine-tune deepseek-r1-distill-llama-8b.
☆17Feb 19, 2025Updated last year
Alternatives and similar repositories for deepseek-r1-distill-llama-8b-lora
Users that are interested in deepseek-r1-distill-llama-8b-lora are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Nonrigid Iterative Closest Point Algorithm☆10Feb 19, 2016Updated 10 years ago
- fine-tune deepseek r1☆125Feb 10, 2025Updated last year
- 指针生成网络在 中英文数据集下的应用☆17Mar 10, 2020Updated 6 years ago
- 使用多轮对话数据集对deepseek进行lora微调教程☆60Dec 26, 2024Updated last year
- ☆10Aug 16, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Constrained learning using boxes for event-event relation extraction☆12Aug 5, 2022Updated 3 years ago
- The project is based on YoloV3 and PyTorch to detect the national flag in the picture.☆11Aug 3, 2021Updated 4 years ago
- External Knowledge (Oxford Dictionary and ConceptNet etc.) fused Enhanced Event Causality Identitification Model☆10Jan 11, 2022Updated 4 years ago
- 中文突发事件语料库(Chinese Emergency Corpus)-上海大学-语义智能实验室☆12Apr 21, 2021Updated 4 years ago
- My solutions of the Titanic competition of Kaggle https://www.kaggle.com/c/titanic☆10May 8, 2022Updated 3 years ago
- Used LSTM and Graph Attention Mechanism to detect the causal relationship in a sentence☆12May 14, 2023Updated 2 years ago
- PPAT: Progressive Graph Pairwise Attention Network for Event Causality Identification☆16Jun 7, 2024Updated last year
- We introduce a novel fine-grained causal reasoning dataset and present a series of novel tasks in NLP, from causality detection to event …☆15Apr 21, 2022Updated 3 years ago
- This GitHub repository provides an implementation of the paper "MAGNET: Multi-Label Text Classification using Attention-based Graph Neura…☆20Nov 2, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for EMNLP 2023 long paper: An Iteratively Parallel Generation Method with the Pre-Filling Strategy for Document-level Event Extracti…☆19Feb 2, 2025Updated last year
- This repo contains our PyTorch implementation for the paper Selecting Optimal Context Sentences for Event-Event Relation Extraction.☆14Nov 25, 2023Updated 2 years ago
- ☆16Nov 25, 2022Updated 3 years ago
- This repo contains the codebase for the paper "Unifying Generative and Dense Retrieval for Sequential Recommendation".☆35Jun 16, 2025Updated 10 months ago
- ☆10Nov 12, 2020Updated 5 years ago
- Registration between 3d volume and 2d images.☆10Dec 21, 2018Updated 7 years ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- VesNet-RL: Simulation-based ReinforcementLearning for Real-World US Probe Navigation☆14Sep 27, 2023Updated 2 years ago
- 大模型推理压测☆47Jul 31, 2025Updated 8 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code for our ACL-2022 paper "Multilingual Generative Language Models for Zero-Shot Cross-Lingual Event Argument Extraction".☆20Jun 24, 2024Updated last year
- Python3数据分析与挖掘建模实战 学习代码☆20Apr 14, 2018Updated 8 years ago
- Code for ISBI 2024 paper "Fully Differentiable Correlation-driven 2D/3D Registration for X-Ray to CT Image Fusion"☆10Aug 26, 2024Updated last year
- Automatic defect recognition in X-ray testing using computer vision☆13Dec 8, 2018Updated 7 years ago
- Region growing for automatic spine segmentation☆11Apr 1, 2020Updated 6 years ago
- Official codebase for FACMIC: Federated Adaptative CLIP Model for Medical Image Classification (Accepted at MICCAI 2024)☆14Jun 21, 2024Updated last year
- OpenVLA Lightweight Version(0.5B). It uses qwen2-0.5B and fine-tunes using mllm format, without occupying LLM's inherent tokens. It repre…☆17Jan 7, 2026Updated 3 months ago
- Code to BraTS 2023 challenge.☆15May 5, 2025Updated 11 months ago
- Code for our ACL-2023 paper AMPERE: AMR-Aware Prefix for Generation-Based Event Argument Extraction Model☆23Dec 14, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Implementation of the paper LIMITR: Leveraging Local Information for Medical Image-Text Representation☆17Feb 8, 2024Updated 2 years ago
- [ICCV 2025] MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation☆22Sep 5, 2025Updated 7 months ago
- 基于原生前端和 Python Flask 后端的文件服务器,可远程查看、下载和上传文件,局域网搭配内网穿透可实现公网访问☆21Apr 9, 2023Updated 3 years ago
- ☆12Nov 26, 2020Updated 5 years ago
- Algorithms of image enhancement☆10Mar 19, 2019Updated 7 years ago
- Code for offline processing and evaluation of depth processing algorithms for the Kinect v2☆12May 28, 2023Updated 2 years ago
- LDM-Morph: Latent diffusion model guided deformable image registration☆15Jan 19, 2025Updated last year