LlaMA3-SFT, Meta-Llama-3-8B/Meta-Llama-3-8B-Instruct微调(transformers)/LORA(peft)/推理, 支持中文(chinese, zh)
☆34May 17, 2024Updated last year
Alternatives and similar repositories for LLaMA3-SFT
Users that are interested in LLaMA3-SFT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Gemma-SFT, gemma-2b/gemma-7b微调(finetune,transformers)/LORA(peft)/推理(inference)☆32May 17, 2024Updated last year
- Use pretrained BERT model to automatically generate grammar multiple choice questions (MCQ) from any news article or story.☆13Oct 2, 2019Updated 6 years ago
- Created a simple neural network using C++17 standard and the Eigen library that supports both forward and backward propagation.☆11Jul 27, 2024Updated last year
- ☆10Aug 16, 2022Updated 3 years ago
- The implementation of STAR-HiT.☆11Oct 18, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The Detectron 2 MaskRCNN implementation for the PanNuke Dataset (https://jgamper.github.io/PanNukeDataset/).☆14Jun 25, 2021Updated 4 years ago
- 对llama3进行全参微调、lora微调以及qlora微调。☆219Oct 4, 2024Updated last year
- Analysis of NBA player stats and salaries of the 2016-17 for the 17-18 season☆10Aug 10, 2017Updated 8 years ago
- The pytorch implementation of DisenPOI.☆24Oct 18, 2023Updated 2 years ago
- ☆19Sep 3, 2024Updated last year
- 微调阿里开源的文字检测模型,利用合合识别返回的OCR结果作为初始训练数据,对模型进行优化训练,使其更加适应1万张图片的具体场景,提高文字识别的精度。☆10Dec 9, 2024Updated last year
- Emotional First Aid Raw Dataset, 心理咨询问答原始语料库☆22Mar 6, 2026Updated 2 months ago
- A framework for Ranked List Truncation, including the implementation of multiple existing deep models, such as BiCut、Choopy and AttnCut. …☆14May 7, 2022Updated 4 years ago
- Multi-Object Tracking with Ultralytics YOLO11☆13Oct 5, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Predicting the medal table of the Summer Games☆12Jul 6, 2023Updated 2 years ago
- Llama2-SFT, Llama-2-7B微调(transformers)/LORA(peft)/推理☆27Jul 26, 2023Updated 2 years ago
- The official PyTorch implementation for 2024-ICASSP-Adaptive Spatial-Temporal Hypergraph Fusion Learning for Next POI Recommendation☆13Sep 8, 2024Updated last year
- 使用Qwen1.5-0.5B-Chat模型进行通用信息抽取任务的微调,旨在: 验证生成式方法相较于抽取式NER的效果; 为新手提供简易的模型微调流程,尽量减少代码量; 大模型训练的数据格式处理。☆14Sep 6, 2024Updated last year
- Code for "Retaining Key Information under High Compression Rates: Query-Guided Compressor for LLMs" (ACL 2024)☆19Jun 12, 2024Updated last year
- This repository hosts the official implementation of MAS4POI: A Multi-Agent System Collaboration for Next POI Recommendation, accepted by…☆16Apr 24, 2026Updated 2 weeks ago
- Applying Deep Reinforcement Learning for dialogue generation. aka chatbot☆13Apr 30, 2017Updated 9 years ago
- The implementation of ROTAN: A Rotation-based Temporal Attention Network for Time-Specific Next POI Recommendation published in KDD 2024☆17Jun 12, 2024Updated last year
- This repo is to demo the concept of lossless compression with Transformers as encoder and decoder.☆14May 2, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理☆73May 17, 2024Updated last year
- Langchain_CrewAI_Gemini - An Gemini AI powered AI Agent (Multi-Agent) Project.☆14Mar 24, 2024Updated 2 years ago
- LBSN based on foursquare dataset☆14Apr 26, 2019Updated 7 years ago
- Patterns-of-Life simulation☆17Jul 23, 2023Updated 2 years ago
- ☆17May 13, 2023Updated 2 years ago
- upload a new programing,something like☆27Nov 30, 2019Updated 6 years ago
- The implementation for the NeurIPS 2022 paper Parameter-free Dynamic Graph Embedding for Link Prediction.☆16Dec 7, 2022Updated 3 years ago
- Large Language-and-Vision Assistant for BioMedicine, built towards multimodal GPT-4 level capabilities.☆10Nov 29, 2023Updated 2 years ago
- A simple, easy-to-hack GraphRAG implementation☆15Sep 21, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Code for the paper "CoS: Enhancing Personalization and Mitigating Bias with Context Steering"☆20Dec 13, 2024Updated last year
- Portable TCP/UDP/ICMP traceroute tool, written in Python☆17Apr 18, 2020Updated 6 years ago
- [AAAI2025] FedCFA: Alleviating Simpson’s Paradox in Model Aggregation with Counterfactual Federated Learning☆24Jan 23, 2025Updated last year
- ☆25Feb 21, 2026Updated 2 months ago
- Code and data for "Medical Dialogue Generation via Dual Flow Modeling" (ACL 2023 Findings)☆14Nov 22, 2023Updated 2 years ago
- Nano-BERT is a straightforward, lightweight and comprehensible custom implementation of BERT, inspired by the foundational "Attention is …☆21Oct 19, 2023Updated 2 years ago
- 基于python的12306定时抢票脚本☆23Mar 31, 2026Updated last month