llama2 finetuning with deepspeed and lora
☆176Jul 28, 2023Updated 2 years ago
Alternatives and similar repositories for llama2-lora-fine-tuning
Users that are interested in llama2-lora-fine-tuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Llama2 chinese finetuning☆38Aug 2, 2023Updated 2 years ago
- llama fine-tuning with lora☆140May 8, 2024Updated last year
- The code of SKS☆15Mar 22, 2022Updated 4 years ago
- Easy and Efficient Finetuning LLMs. (Supported LLama, LLama2, LLama3, Qwen, Baichuan, GLM , Falcon) 大模型高效量化训练+部署.☆620Jan 24, 2025Updated last year
- [Information Systems-2024] The official implemention of ACMR (Bert4XMR).☆11Sep 22, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 简单易懂的LLaMA微调指南。☆414Jul 5, 2023Updated 2 years ago
- Implementation of SATA Tree-LSTM (Dynamic Compositionality in Recursive Neural Networks with Structure-aware Tag Representations, AAAI 20…☆10Jun 21, 2022Updated 3 years ago
- MSTI☆16Mar 6, 2024Updated 2 years ago
- Code for the ACL 2022 (Long paper): "New Intent Discovery with Pre-training and Contrastive Learning".☆14Jul 18, 2022Updated 3 years ago
- Look Back to Reason Forward: Revisitable Memory for Long-Context LLM Agents☆27Mar 9, 2026Updated 3 weeks ago
- Codebase for fine-tuning Llama2 70B to generate math test questions and answers.☆11Aug 30, 2024Updated last year
- Code for COLING 2022 paper "FactMix: Using a Few Labeled In-domain Examples to Generalize to Cross-domain Named Entity Recognition"☆15Jan 15, 2023Updated 3 years ago
- Train llm (bloom, llama, baichuan2-7b, chatglm3-6b) with deepspeed pipeline mode. Faster than zero/zero++/fsdp.☆97Feb 5, 2024Updated 2 years ago
- A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human…☆221May 20, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Universal information extraction with instruction learning☆394Feb 28, 2025Updated last year
- [CVPR 2026] FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection☆27Feb 10, 2026Updated last month
- ☆43Dec 15, 2023Updated 2 years ago
- Repo for ACL2023 paper "Won't Get Fooled Again: Answering Questions with False Premises"☆22Jun 11, 2023Updated 2 years ago
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)☆161Oct 30, 2024Updated last year
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…☆18,265Mar 3, 2026Updated 3 weeks ago
- Code for paper: Long cOntext aliGnment via efficient preference Optimization☆24Oct 10, 2025Updated 5 months ago
- Llama2-SFT, Llama-2-7B微调(transformers)/LORA(peft)/推理☆27Jul 26, 2023Updated 2 years ago
- Code for the paper "A Mechanistic Interpretation of Arithmetic Reasoning in Language Models using Causal Mediation Analysis"☆20Jun 12, 2025Updated 9 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Large language Model fintuning bloom , opt , gpt, gpt2 ,llama,llama-2,cpmant and so on☆101Apr 24, 2024Updated last year
- This repository provides the code for applying Contrastive Learning Penalty Loss (CLPL) and Mixture of Experts (MoE) to the BGE-M3 text e…☆11Dec 27, 2024Updated last year
- [ACL 2024] ValueBench: Towards Comprehensively Evaluating Value Orientations and Understanding of Large Language Models☆26Jan 11, 2025Updated last year
- LLM, Fine Tuning, Llama 2, Gemma, Mixtral, vLLM, LangChain, RAG, ChromaDB, FAISS☆13Mar 5, 2024Updated 2 years ago
- 怎么训练一个LLM分词器☆152Jul 13, 2023Updated 2 years ago
- [EMNLP 2023] ALCUNA: Large Language Models Meet New Knowledge☆30Oct 30, 2023Updated 2 years ago
- ☆29Apr 30, 2024Updated last year
- Repository for "Attribute First, then Generate: Locally-attributable Grounded Text Generation", ACL 2024☆30Dec 19, 2024Updated last year
- ☆144Updated this week
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- llama,chatglm 等模型的微调☆91Jul 18, 2024Updated last year
- Code for our EMNLP 2020 paper "Uncertainty-Aware Label Refinement for Sequence Labeling"☆22Oct 4, 2020Updated 5 years ago
- Source code for ACL 2022 paper "Self-contrastive Decorrelation for Sentence Embeddings".☆26Mar 10, 2025Updated last year
- 中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)☆7,161Jul 15, 2025Updated 8 months ago
- [ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction☆87Mar 23, 2025Updated last year
- youtube video recommendation(generation 4)☆21Oct 16, 2019Updated 6 years ago
- Official repository for ODQA experiments from Decomposed Prompting: A Modular Approach for Solving Complex Tasks, ICLR23☆12Jul 28, 2023Updated 2 years ago