The objective of this project is to demonstrate how to fine-tune deepseek-r1-distill-llama-8b.
☆17Feb 19, 2025Updated last year
Alternatives and similar repositories for deepseek-r1-distill-llama-8b-lora
Users that are interested in deepseek-r1-distill-llama-8b-lora are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 基于deepseek、qwen3大模型,lora sft 医疗行业数据☆15Apr 10, 2026Updated 2 months ago
- fine-tune deepseek r1☆123Feb 10, 2025Updated last year
- ☆10Aug 16, 2022Updated 3 years ago
- ☆18Feb 20, 2024Updated 2 years ago
- Constrained learning using boxes for event-event relation extraction☆12Aug 5, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code of the paper “A Fin-BERT-based Event Extraction Method for Chinese Financial Domain”☆12May 22, 2024Updated 2 years ago
- A benchmark for assessing the strength of causal relationships between real-world events (EMNLP 2023).