llama2 finetuning with deepspeed and lora
☆176Jul 28, 2023Updated 2 years ago
Alternatives and similar repositories for llama2-lora-fine-tuning
Users that are interested in llama2-lora-fine-tuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- llama fine-tuning with lora☆140May 8, 2024Updated last year
- The code of SKS☆15Mar 22, 2022Updated 4 years ago
- Easy and Efficient Finetuning LLMs. (Supported LLama, LLama2, LLama3, Qwen, Baichuan, GLM , Falcon) 大模型高效量化训练+部署.☆621Jan 24, 2025Updated last year
- [Information Systems-2024] The official implemention of ACMR (Bert4XMR).☆11Sep 22, 2024Updated last year
- 简单易懂的LLaMA微调指南。☆413Jul 5, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Implementation of SATA Tree-LSTM (Dynamic Compositionality in Recursive Neural Networks with Structure-aware Tag Representations, AAAI 20…☆10Jun 21, 2022Updated 3 years ago
- MSTI☆16Mar 6, 2024Updated 2 years ago
- Documentation at☆14Mar 27, 2025Updated last year
- ☆77Mar 23, 2026Updated 3 weeks ago
- LLaMA-TRL: Fine-tuning LLaMA with PPO and LoRA☆240Aug 17, 2025Updated 8 months ago
- Look Back to Reason Forward: Revisitable Memory for Long-Context LLM Agents☆31Updated this week
- A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human…☆221May 20, 2024Updated last year
- Universal information extraction with instruction learning☆399Feb 28, 2025Updated last year
- [CVPR 2026] FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection☆31Feb 10, 2026Updated 2 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆43Dec 15, 2023Updated 2 years ago
- Repo for ACL2023 paper "Won't Get Fooled Again: Answering Questions with False Premises"☆22Jun 11, 2023Updated 2 years ago
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)☆161Oct 30, 2024Updated last year
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…☆18,286Apr 1, 2026Updated 2 weeks ago
- Code for paper: Long cOntext aliGnment via efficient preference Optimization☆24Oct 10, 2025Updated 6 months ago
- Llama2-SFT, Llama-2-7B微调(transformers)/LORA(peft)/推理☆27Jul 26, 2023Updated 2 years ago
- QuoteSum is a textual QA dataset containing Semi-Extractive Multi-source Question Answering (SEMQA) examples written by humans, based on …☆13Mar 25, 2024Updated 2 years ago
- Large language Model fintuning bloom , opt , gpt, gpt2 ,llama,llama-2,cpmant and so on☆101Apr 24, 2024Updated last year
- This repository provides the code for applying Contrastive Learning Penalty Loss (CLPL) and Mixture of Experts (MoE) to the BGE-M3 text e…☆11Dec 27, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- CIKM 2022: CorpusBrain: Pre-train a Generative Retrieval Model for Knowledge-Intensive Language Tasks☆34Aug 31, 2022Updated 3 years ago
- LLM, Fine Tuning, Llama 2, Gemma, Mixtral, vLLM, LangChain, RAG, ChromaDB, FAISS☆13Mar 5, 2024Updated 2 years ago
- ☆38Oct 2, 2024Updated last year
- 怎么训练一个LLM分词器☆152Jul 13, 2023Updated 2 years ago
- [EMNLP 2023] ALCUNA: Large Language Models Meet New Knowledge☆30Oct 30, 2023Updated 2 years ago
- ☆29Apr 30, 2024Updated last year
- Repository for "Attribute First, then Generate: Locally-attributable Grounded Text Generation", ACL 2024☆30Dec 19, 2024Updated last year
- ☆148Apr 8, 2026Updated last week
- Source code of paper "Alirector: Alignment-Enhanced Chinese Grammatical Error Corrector" (Findings of ACL 2024)☆13Mar 19, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- a multimodal retrieval dataset☆24Jul 8, 2023Updated 2 years ago
- Code for our EMNLP 2020 paper "Uncertainty-Aware Label Refinement for Sequence Labeling"☆22Oct 4, 2020Updated 5 years ago
- 中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)☆7,153Updated this week
- ☆15Aug 4, 2025Updated 8 months ago
- Official Implementation for the paper "Integrative Decoding: Improving Factuality via Implicit Self-consistency"☆32Apr 12, 2025Updated last year
- 爬取同花顺的股票(A股)信息☆10Nov 5, 2021Updated 4 years ago
- 基于ChatGLM-6B、ChatGLM2-6B、ChatGLM3-6B模型,进行下游具体任务微调,涉及Freeze、Lora、P-tuning、全参微调等☆2,781Dec 12, 2023Updated 2 years ago