The objective of this project is to demonstrate how to fine-tune deepseek-r1-distill-llama-8b.
☆17Feb 19, 2025Updated last year
Alternatives and similar repositories for deepseek-r1-distill-llama-8b-lora
Users that are interested in deepseek-r1-distill-llama-8b-lora are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 基于deepseek、qwen3大模型,lora sft 医疗行业数据☆15Apr 10, 2026Updated last month
- fine-tune deepseek r1☆124Feb 10, 2025Updated last year
- A repository dedicated to learning about ChatGPT training techniques and related knowledge. Contains study notes, code snippets, and reso…☆13Dec 14, 2024Updated last year
- nku(Nankai University)南开大学操作系统课程实验 2024Fall☆12Dec 18, 2024Updated last year
- ☆10Aug 16, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Constrained learning using boxes for event-event relation extraction☆12Aug 5, 2022Updated 3 years ago
- Personal course work for NKU-COSC0018-Computer Architecture. WARNING: only for references;☆24Aug 18, 2024Updated last year
- Code of the paper “A Fin-BERT-based Event Extraction Method for Chinese Financial Domain”☆12May 22, 2024Updated 2 years ago
- A benchmark for assessing the strength of causal relationships between real-world events (EMNLP 2023).☆15Nov 23, 2023Updated 2 years ago
- The final project of NKU 2022 Computer Architecture. 南开大学2022体系结构大作业。☆10Sep 25, 2023Updated 2 years ago
- ☆10Sep 7, 2022Updated 3 years ago
- vanna.ai demo☆31May 1, 2024Updated 2 years ago
- Fine-tune Qwen2.5-VL-7B on custom visual QA tasks using LoRA + Accelerate, supporting single/multi-GPU training on COCO 2014 dataset.☆29Apr 28, 2025Updated last year
- My solutions of the Titanic competition of Kaggle https://www.kaggle.com/c/titanic☆10May 8, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [EMNLP'22] Title2Event: Benchmarking Open Event Extraction with a Large-scale Chinese Title Dataset☆20Apr 4, 2023Updated 3 years ago
- 2023年南开大学操作系统课程实验指北(助教小木个人版)☆17Nov 1, 2023Updated 2 years ago
- 针对南开大学张春玲版《大学基础物理实验》的数据处理excel,欢迎contribute。☆38Mar 19, 2025Updated last year
- ☆16Sep 17, 2021Updated 4 years ago
- PPAT: Progressive Graph Pairwise Attention Network for Event Causality Identification☆16Jun 7, 2024Updated last year
- Label-Representative Graph Convolutional Network for Multi-Label Text Classification☆18Sep 20, 2022Updated 3 years ago
- This repository contains the source code related to the paper Compressed Volumetric Heatmaps for Multi-Person 3D Pose Estimation☆11Jun 23, 2020Updated 5 years ago
- This GitHub repository provides an implementation of the paper "MAGNET: Multi-Label Text Classification using Attention-based Graph Neura…☆20Nov 2, 2023Updated 2 years ago
- A stateful multi-agent travel service system built on LangChain & LangGraph. Features intelligent task delegation, permission control, an…☆54Jan 10, 2026Updated 4 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Multi-modal 3D ultrasound and CT in image-guided spinal surgery: public database and new registration algorithms☆13Mar 9, 2023Updated 3 years ago
- Llama3 Streaming Chat Sample☆22Apr 24, 2024Updated 2 years ago
- ☆12Sep 23, 2022Updated 3 years ago
- VesNet-RL: Simulation-based ReinforcementLearning for Real-World US Probe Navigation☆14Sep 27, 2023Updated 2 years ago
- 大模型推理压测☆48Jul 31, 2025Updated 9 months ago
- Code for our ACL-2022 paper "Multilingual Generative Language Models for Zero-Shot Cross-Lingual Event Argument Extraction".☆20Jun 24, 2024Updated last year
- Python3数据分析与挖掘建模实战 学习代码☆21Apr 14, 2018Updated 8 years ago
- ☆20Nov 21, 2019Updated 6 years ago
- Dual Quaternion implementation in python.☆11Nov 30, 2016Updated 9 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Pyramid Attention Network for Medical Image Registration (ISBI 2024)☆16Feb 6, 2025Updated last year
- Region growing for automatic spine segmentation☆11Apr 1, 2020Updated 6 years ago
- Code for our ACL-2023 paper AMPERE: AMR-Aware Prefix for Generation-Based Event Argument Extraction Model☆23Dec 14, 2023Updated 2 years ago
- [ICCV 2025] MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation☆22Sep 5, 2025Updated 8 months ago
- Repository for Master thesis project investigating classification of 3D chest CT scans using Vision Transformer.☆15Aug 29, 2023Updated 2 years ago
- Official implementation of the paper, Revisiting Event Argument Extraction: Can EAE Models Learn Better When Being Aware of Event Co-occu…☆25Jul 18, 2023Updated 2 years ago
- SymTrans: A symmetric Transformer-based model for image registration.☆14May 27, 2022Updated 4 years ago