llama2 finetuning with deepspeed and lora
☆176Jul 28, 2023Updated 2 years ago
Alternatives and similar repositories for llama2-lora-fine-tuning
Users that are interested in llama2-lora-fine-tuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- llama fine-tuning with lora☆140May 8, 2024Updated 2 years ago
- The code of SKS☆15Mar 22, 2022Updated 4 years ago
- [Information Systems-2024] The official implemention of ACMR (Bert4XMR).☆11Sep 22, 2024Updated last year
- 简单易懂的LLaMA微调指南。☆414Jul 5, 2023Updated 2 years ago
- MSTI☆17Mar 6, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code for COLING 2022 paper "FactMix: Using a Few Labeled In-domain Examples to Generalize to Cross-domain Named Entity Recognition"☆15Jan 15, 2023Updated 3 years ago
- Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用☆14,712Apr 6, 2025Updated last year
- ☆83May 2, 2026Updated 3 weeks ago
- LLaMA-TRL: Fine-tuning LLaMA with PPO and LoRA☆239Aug 17, 2025Updated 9 months ago
- Train llm (bloom, llama, baichuan2-7b, chatglm3-6b) with deepspeed pipeline mode. Faster than zero/zero++/fsdp.☆97Feb 5, 2024Updated 2 years ago
- A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human…☆221May 20, 2024Updated 2 years ago
- Chinese Word Segmentation task based on BERT and implemented in Pytorch☆14Aug 14, 2020Updated 5 years ago
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)☆163Oct 30, 2024Updated last year
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…☆18,337May 19, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code for paper: Long cOntext aliGnment via efficient preference Optimization☆24Oct 10, 2025Updated 7 months ago
- Llama2-SFT, Llama-2-7B微调(transformers)/LORA(peft)/推理☆27Jul 26, 2023Updated 2 years ago
- Large language Model fintuning bloom , opt , gpt, gpt2 ,llama,llama-2,cpmant and so on☆100Apr 24, 2024Updated 2 years ago
- Source code for the paper "A Medical Semantic-Assisted Transformer for Radiographic Report Generation"☆25Jun 23, 2023Updated 2 years ago
- This repository provides the code for applying Contrastive Learning Penalty Loss (CLPL) and Mixture of Experts (MoE) to the BGE-M3 text e…☆11Dec 27, 2024Updated last year
- bert语言模型校验句子的通顺性☆15Aug 17, 2020Updated 5 years ago
- CIKM 2022: CorpusBrain: Pre-train a Generative Retrieval Model for Knowledge-Intensive Language Tasks☆34Aug 31, 2022Updated 3 years ago
- [ACL 2024] ValueBench: Towards Comprehensively Evaluating Value Orientations and Understanding of Large Language Models☆26Jan 11, 2025Updated last year
- The official code of ALLECS: A Lightweight Language Error Correction System☆11Mar 12, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆38Oct 2, 2024Updated last year
- rag base on langchain☆11Mar 1, 2024Updated 2 years ago
- 怎么训练一个LLM分词器☆152Jul 13, 2023Updated 2 years ago
- [EMNLP 2023] ALCUNA: Large Language Models Meet New Knowledge☆30Oct 30, 2023Updated 2 years ago
- ☆29Apr 30, 2024Updated 2 years ago
- ☆152Apr 8, 2026Updated last month
- Source code of paper "Alirector: Alignment-Enhanced Chinese Grammatical Error Corrector" (Findings of ACL 2024)☆14Mar 19, 2025Updated last year
- llama,chatglm 等模型的微调☆91Jul 18, 2024Updated last year
- Code for our EMNLP 2020 paper "Uncertainty-Aware Label Refinement for Sequence Labeling"☆22Oct 4, 2020Updated 5 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Easy to Annotate Helitrons Unix-like command line.☆10Feb 14, 2024Updated 2 years ago
- Source code for ACL 2022 paper "Self-contrastive Decorrelation for Sentence Embeddings".☆26Mar 10, 2025Updated last year
- 中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)☆7,139Apr 19, 2026Updated last month
- [ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction☆90Mar 23, 2025Updated last year
- youtube video recommendation(generation 4)☆21Oct 16, 2019Updated 6 years ago
- Handling long-running processes (like ML model predictions) inside a Flask app using Celery.☆12Jan 13, 2021Updated 5 years ago
- uni+BLE 开发的蓝牙调试工具(微信蓝牙调试工具)☆13Apr 27, 2021Updated 5 years ago