llama fine-tuning with lora
☆140May 8, 2024Updated last year
Alternatives and similar repositories for llama-lora-fine-tuning
Users that are interested in llama-lora-fine-tuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- llama2 finetuning with deepspeed and lora☆176Jul 28, 2023Updated 2 years ago
- A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human…☆221May 20, 2024Updated last year
- Original PyTorch Implementation for the EMNLP 2023 Paper "Beyond Detection: A Defend-and-Summarize Strategy for Robust and Interpretable …☆16Dec 14, 2023Updated 2 years ago
- This is the official impletations of the EMNLP Findings paper, VideoINSTA: Zero-shot Long-Form Video Understanding via Informative Spatia…☆25Apr 7, 2026Updated last week
- ☆19Dec 12, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Chinese Word Segmentation task based on BERT and implemented in Pytorch☆14Aug 14, 2020Updated 5 years ago
- 基于prompt的中文文本分类。☆55May 6, 2023Updated 2 years ago
- Tuning LLMs with no tears💦; Sample Design Engineering (SDE) for more efficient downstream-tuning.☆1,017Apr 27, 2024Updated last year
- Finetune LLaMA-7B with Chinese instruction datasets☆136May 8, 2023Updated 2 years ago
- ☆10Jul 8, 2021Updated 4 years ago
- [SIGIR 2025] Benchmarking Recommendation, Classification, and Tracing Based on Hugging Face Knowledge Graph☆16Jun 6, 2025Updated 10 months ago
- APIBench is a benchmark for evaluating the performance of API recommendation approaches released in the paper "Revisiting, Benchmarking a…☆66Apr 3, 2023Updated 3 years ago
- ☆15Aug 4, 2025Updated 8 months ago
- PyTorch implementation of delayed-feedback-model (DFM)☆15Feb 7, 2022Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Easy and Efficient Finetuning LLMs. (Supported LLama, LLama2, LLama3, Qwen, Baichuan, GLM , Falcon) 大模型高效量化训练+部署.☆621Jan 24, 2025Updated last year
- 简单易懂的LLaMA微调指南。☆413Jul 5, 2023Updated 2 years ago
- Pytorch re-implementation of R-BERT model☆66Apr 20, 2020Updated 6 years ago
- Instruct-tune LLaMA on consumer hardware☆18,945Jul 29, 2024Updated last year
- Experiments for our CLEAR benchmark of unlearning methods in a multimodal setup☆22Aug 6, 2025Updated 8 months ago
- ☆14Jun 3, 2023Updated 2 years ago
- 2016华为codecraft算法大赛 (dfs+pruning)☆12Mar 6, 2017Updated 9 years ago
- A Chinese Conversation Dataset with Rich Annotations for Super-Aged Seniors☆25Jul 30, 2025Updated 8 months ago
- Code for our ACL '20 paper "Representation Engineering with Natural Language Explanations"☆29Jun 15, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Recommender systems with large language models (Paper list)☆64Nov 20, 2023Updated 2 years ago
- onebot v11 adapter in plugin☆10Mar 5, 2023Updated 3 years ago
- Ensembling Hugging Face transformers made easy☆61Dec 24, 2022Updated 3 years ago
- [EMNLP 25] An effective and interpretable weight-editing method for mitigating overly short reasoning in LLMs, and a mechanistic study un…☆18Dec 17, 2025Updated 4 months ago
- Benchmark suit for large scale socio-technical datasets in open collaboration☆12Oct 12, 2024Updated last year
- ☆14Jun 27, 2019Updated 6 years ago
- Thin wrapper for the AllenNLP's implementation of supervised open information extraction☆17Nov 19, 2019Updated 6 years ago
- Text Style Transfer: A Review☆13Jun 1, 2019Updated 6 years ago
- Code & data for EMNLP 2020 paper "MOCHA: A Dataset for Training and Evaluating Reading Comprehension Metrics".☆16May 3, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.☆103Oct 28, 2024Updated last year
- Implementation of Beyond Neural Scaling beating power laws for deep models and prototype-based models☆34Oct 30, 2025Updated 5 months ago
- Code repository for the WWW 2019 paper "Predicting ConceptNet Path Quality Using Crowdsourced Assessments of Naturalness"☆12Feb 1, 2019Updated 7 years ago
- Improving transparency of large language models' reasoning☆15Nov 25, 2025Updated 4 months ago
- A simple script for extracting plain text from arxiv dataset: https://www.kaggle.com/Cornell-University/arxiv☆15Dec 7, 2020Updated 5 years ago
- An all-in-one framework for Ad-hoc Information Retrieval.☆18Apr 3, 2024Updated 2 years ago
- Setup an MCP server in 60 seconds.☆13Dec 12, 2024Updated last year