LlaMA3-SFT, Meta-Llama-3-8B/Meta-Llama-3-8B-Instruct微调(transformers)/LORA(peft)/推理, 支持中文(chinese, zh)
☆34May 17, 2024Updated 2 years ago
Alternatives and similar repositories for LLaMA3-SFT
Users that are interested in LLaMA3-SFT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Qwen1.5大模型微调、基于PEFT框架LoRA微调,在数据集HC3-Chinese上实现文本分类。☆12Jun 29, 2024Updated last year
- Gemma-SFT, gemma-2b/gemma-7b微调(finetune,transformers)/LORA(peft)/推理(inference)☆32May 17, 2024Updated 2 years ago
- Use pretrained BERT model to automatically generate grammar multiple choice questions (MCQ) from any news article or story.☆13Oct 2, 2019Updated 6 years ago
- (NBCE)Naive Bayes-based Context Extension on ChatGLM-6b☆15Jun 7, 2023Updated 2 years ago
- An open-source UNet-based pipeline for nuclei segmentation in histopathology images using the PanNuke dataset. It features an interactive…☆11Jan 9, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆10Aug 16, 2022Updated 3 years ago
- The implementation of STAR-HiT.☆11Oct 18, 2023Updated 2 years ago
- ☆15Nov 12, 2025Updated 6 months ago
- 对llama3进行全参微调、lora微调以及qlora微调。☆220Oct 4, 2024Updated last year
- The pytorch implementation of DisenPOI.☆24Oct 18, 2023Updated 2 years ago
- Solves the Vehicle Routing Problem (VRP) using Column Generation (CG). It is made as an inspiration to use CG in more projects, since it …☆10Nov 2, 2022Updated 3 years ago
- ☆19Sep 3, 2024Updated last year
- Emotional First Aid Raw Dataset, 心理咨询问答原始语料库☆22Mar 6, 2026Updated 2 months ago
- Multi-Object Tracking with Ultralytics YOLO11☆13Oct 5, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆10Mar 19, 2024Updated 2 years ago
- Llama2-SFT, Llama-2-7B微调(transformers)/LORA(peft)/推理☆27Jul 26, 2023Updated 2 years ago
- The official PyTorch implementation for 2024-ICASSP-Adaptive Spatial-Temporal Hypergraph Fusion Learning for Next POI Recommendation☆13Sep 8, 2024Updated last year
- A tool library for riichi mahjong written in Rust, made mostly to be used as a WASM component.☆12Aug 29, 2025Updated 9 months ago
- [ACL'25] Code for "Aligning Large Language Models to Follow Instructions and Hallucinate Less via Effective Data Filtering"☆21Jul 23, 2025Updated 10 months ago
- 使用Qwen1.5-0.5B-Chat模型进行通用信息抽取任务的微调,旨在: 验证生成式方法相较于抽取式NER的效果; 为新手提供简易的模型微调流程,尽量减少代码量; 大模型训练的数据格式处理。☆14Sep 6, 2024Updated last year
- ☆14Oct 8, 2024Updated last year
- 中文纠错-使用拼音树及编辑距离☆13Jul 19, 2019Updated 6 years ago
- Code for my own blog☆10Nov 7, 2013Updated 12 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Code for "Retaining Key Information under High Compression Rates: Query-Guided Compressor for LLMs" (ACL 2024)☆19Jun 12, 2024Updated last year
- Applying Deep Reinforcement Learning for dialogue generation. aka chatbot☆13Apr 30, 2017Updated 9 years ago
- The implementation of ROTAN: A Rotation-based Temporal Attention Network for Time-Specific Next POI Recommendation published in KDD 2024☆19Jun 12, 2024Updated last year
- Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理☆73May 17, 2024Updated 2 years ago
- ☆13Jul 14, 2021Updated 4 years ago
- Official repository of paper "Context-DPO: Aligning Language Models for Context-Faithfulness"☆23Feb 17, 2025Updated last year
- ☆17May 13, 2023Updated 3 years ago
- A real-time diffusion MRI viewer for Linux and Windows using OpenGL 4.6.☆10Apr 23, 2025Updated last year
- Code for Engel, Grossmann & Ockenfels☆20Jan 2, 2026Updated 4 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Large Language-and-Vision Assistant for BioMedicine, built towards multimodal GPT-4 level capabilities.☆10Nov 29, 2023Updated 2 years ago
- ☆29Jul 17, 2025Updated 10 months ago
- Code for the paper "CoS: Enhancing Personalization and Mitigating Bias with Context Steering"☆20Dec 13, 2024Updated last year
- A simple, easy-to-hack GraphRAG implementation☆15Sep 21, 2024Updated last year
- ☆39Jan 19, 2026Updated 4 months ago
- 本科毕设:基于VTK的三维可视化平台☆12Jun 13, 2019Updated 6 years ago
- DynaPlex is a software library for formulating and solving Markov Decision Problems, written primarily in C++20☆17Jun 19, 2025Updated 11 months ago