用于训练中文DeepSeek R1大模型的Lora脚本
☆13Mar 20, 2025Updated last year
Alternatives and similar repositories for DeepSeek_R1_LoraTrain
Users that are interested in DeepSeek_R1_LoraTrain are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The objective of this project is to demonstrate how to fine-tune deepseek-r1-distill-llama-8b.☆17Feb 19, 2025Updated last year
- Constrained learning using boxes for event-event relation extraction☆12Aug 5, 2022Updated 3 years ago
- libsmctrl论文的复现,添加了python端接口,可以在python端灵活调用接口来分配计算资源☆12May 21, 2024Updated last year
- Code of the paper “A Fin-BERT-based Event Extraction Method for Chinese Financial Domain”☆12May 22, 2024Updated last year
- A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launches☆15Jun 21, 2019Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A benchmark for assessing the strength of causal relationships between real-world events (EMNLP 2023).☆15Nov 23, 2023Updated 2 years ago
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆19Apr 24, 2026Updated last week
- ☆10Sep 7, 2022Updated 3 years ago
- The project is based on YoloV3 and PyTorch to detect the national flag in the picture.☆11Aug 3, 2021Updated 4 years ago
- ☆19Mar 4, 2025Updated last year
- ROS 2 Packages for Testing LIO-SAM on a Robotic Vehicle with 3D LIDAR and 9-Axis IMU☆13Dec 2, 2023Updated 2 years ago
- ☆13Jun 29, 2024Updated last year
- Tutorials of Extending and importing TVM with CMAKE Include dependency.☆16Oct 11, 2024Updated last year
- My solutions of the Titanic competition of Kaggle https://www.kaggle.com/c/titanic☆10May 8, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official implementation for the paper Lancet: Accelerating Mixture-of-Experts Training via Whole Graph Computation-Communication Overlapp…☆14Nov 17, 2025Updated 5 months ago
- [EMNLP'22] Title2Event: Benchmarking Open Event Extraction with a Large-scale Chinese Title Dataset☆20Apr 4, 2023Updated 3 years ago
- ☆11Nov 14, 2022Updated 3 years ago
- Deep Introspective SLAM: Deep Reinforcement Learning based Approach to Avoid Tracking Failure in Visual SLAM☆11Jul 31, 2021Updated 4 years ago
- Used LSTM and Graph Attention Mechanism to detect the causal relationship in a sentence☆12May 14, 2023Updated 2 years ago
- ☆36Oct 14, 2020Updated 5 years ago
- ☆19Jan 13, 2022Updated 4 years ago
- This is the proof-of-concept CPU implementation of ASPEN used for the NeurIPS'23 paper ASPEN: Breaking Operator Barriers for Efficient Pa…☆13Apr 4, 2024Updated 2 years ago
- ☆14Jun 26, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Python implementation of a Genetic Algorithm for the Resource-Constrained Project Scheduling Problem☆14May 29, 2023Updated 2 years ago
- Knowledge-enriched and Attention Guided Network, which is used for event causality identification.☆15Sep 14, 2022Updated 3 years ago
- ☆16Sep 17, 2021Updated 4 years ago
- PPAT: Progressive Graph Pairwise Attention Network for Event Causality Identification☆16Jun 7, 2024Updated last year
- ☆11Mar 28, 2024Updated 2 years ago
- We introduce a novel fine-grained causal reasoning dataset and present a series of novel tasks in NLP, from causality detection to event …☆15Apr 21, 2022Updated 4 years ago
- GoPTX: Fine-grained GPU Kernel Fusion by PTX-level Instruction Flow Weaving☆19Jul 30, 2025Updated 9 months ago
- This GitHub repository provides an implementation of the paper "MAGNET: Multi-Label Text Classification using Attention-based Graph Neura…☆20Nov 2, 2023Updated 2 years ago
- Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels☆51Mar 31, 2026Updated last month
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Code for EMNLP 2023 long paper: An Iteratively Parallel Generation Method with the Pre-Filling Strategy for Document-level Event Extracti…☆19Feb 2, 2025Updated last year
- This repo contains our PyTorch implementation for the paper Selecting Optimal Context Sentences for Event-Event Relation Extraction.☆14Nov 25, 2023Updated 2 years ago
- Tencent Distribution of TVM☆16Apr 7, 2023Updated 3 years ago
- 用C++ qt库编写的日历小程序,可以查询1900-2100年的日期,农历,星座,以及生肖,可缩小至任务栏.☆17May 21, 2015Updated 10 years ago
- Region Proposal generation on images using clustering in Pointcloud - Currently only for Pedestrians☆11Jul 13, 2020Updated 5 years ago
- Using OpenVINO to accelerate HF-Net☆12Aug 7, 2025Updated 8 months ago
- ☆10Mar 22, 2024Updated 2 years ago