The objective of this project is to demonstrate how to fine-tune deepseek-r1-distill-llama-8b.
☆17Feb 19, 2025Updated last year
Alternatives and similar repositories for deepseek-r1-distill-llama-8b-lora
Users that are interested in deepseek-r1-distill-llama-8b-lora are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 用于训练中文DeepSeek R1大模型 的Lora脚本☆13Mar 20, 2025Updated last year
- 知识图谱问答☆14Mar 11, 2021Updated 5 years ago
- Nonrigid Iterative Closest Point Algorithm☆10Feb 19, 2016Updated 10 years ago
- fine-tune deepseek r1☆125Feb 10, 2025Updated last year
- ☆10Aug 16, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A benchmark for assessing the strength of causal relationships between real-world events (EMNLP 2023).☆15Nov 23, 2023Updated 2 years ago
- ☆10Sep 7, 2022Updated 3 years ago
- The project is based on YoloV3 and PyTorch to detect the national flag in the picture.☆11Aug 3, 2021Updated 4 years ago
- 中文突发事件语料库(Chinese Emergency Corpus)-上海大学-语义智能实验室☆12Apr 21, 2021Updated 5 years ago
- Fine-tune Qwen2.5-VL-7B on custom visual QA tasks using LoRA + Accelerate, supporting single/multi-GPU training on COCO 2014 dataset.☆29Apr 28, 2025Updated last year
- ☆19Jan 13, 2022Updated 4 years ago
- ☆14Jun 26, 2023Updated 2 years ago
- Knowledge-enriched and Attention Guided Network, which is used for event causality identification.☆15Sep 14, 2022Updated 3 years ago
- ☆16Sep 17, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Label-Representative Graph Convolutional Network for Multi-Label Text Classification☆18Sep 20, 2022Updated 3 years ago
- We introduce a novel fine-grained causal reasoning dataset and present a series of novel tasks in NLP, from causality detection to event …☆15Apr 21, 2022Updated 4 years ago
- In-context Contrastive Learning for Event Causality Identification☆15Oct 15, 2024Updated last year
- Document-Level Multi-Event Extraction with Event Proxy Nodes and Hausdorff Distance Minimization☆17Nov 3, 2023Updated 2 years ago
- This GitHub repository provides an implementation of the paper "MAGNET: Multi-Label Text Classification using Attention-based Graph Neura…☆20Nov 2, 2023Updated 2 years ago
- Code for EMNLP 2023 long paper: An Iteratively Parallel Generation Method with the Pre-Filling Strategy for Document-level Event Extracti…☆19Feb 2, 2025Updated last year
- 用C++ qt库编写的日历小程序,可以查询1900-2100年的日期,农历,星座,以及生肖,可缩小至任务栏.☆17May 21, 2015Updated 10 years ago
- ☆16Nov 25, 2022Updated 3 years ago
- The GitHub repository for the paper "Reinforcement Learning-based Dialogue Guided Event Extraction to Exploit Argument Relations"☆23Oct 31, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Registration between 3d volume and 2d images.☆10Dec 21, 2018Updated 7 years ago
- Multi-modal 3D ultrasound and CT in image-guided spinal surgery: public database and new registration algorithms☆13Mar 9, 2023Updated 3 years ago
- Llama3 Streaming Chat Sample☆22Apr 24, 2024Updated 2 years ago
- VesNet-RL: Simulation-based ReinforcementLearning for Real-World US Probe Navigation☆14Sep 27, 2023Updated 2 years ago
- ☆10Mar 1, 2021Updated 5 years ago
- 大模型推理压测☆47Jul 31, 2025Updated 9 months ago
- Visual SLAM from RGB-D data using Microsoft Kinect☆10May 13, 2016Updated 9 years ago
- Parallelize the serial implementation of 3D scene reconstruction with input from kinect sensor and run it on NvidiaGPU using CUDA.☆12Nov 2, 2016Updated 9 years ago
- The code of CL4CTR☆48Jul 1, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Python3数据分析与挖掘建模实战 学习代码☆20Apr 14, 2018Updated 8 years ago
- Pyramid Attention Network for Medical Image Registration (ISBI 2024)☆16Feb 6, 2025Updated last year
- Code to BraTS 2023 challenge.☆15May 5, 2025Updated last year
- Code for our ACL-2023 paper AMPERE: AMR-Aware Prefix for Generation-Based Event Argument Extraction Model☆23Dec 14, 2023Updated 2 years ago
- Yet another frontend for LLM, written using .NET and WinUI 3☆11Sep 14, 2025Updated 7 months ago
- Official implementation of the paper, Revisiting Event Argument Extraction: Can EAE Models Learn Better When Being Aware of Event Co-occu…☆25Jul 18, 2023Updated 2 years ago
- 基于原生前端和 Python Flask 后端的文件服务器,可远程查看、下载和上传文件,局域网搭配内网穿透可实现公网访问☆21Apr 9, 2023Updated 3 years ago