Large language Model fintuning bloom , opt , gpt, gpt2 ,llama,llama-2,cpmant and so on
☆99Apr 24, 2024Updated 2 years ago
Alternatives and similar repositories for llm_finetuning
Users that are interested in llm_finetuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- realize the reinforcement learning training for gpt2 llama bloom and so on llm model☆27Sep 19, 2023Updated 2 years ago
- demos based on PSpider☆17Mar 1, 2019Updated 7 years ago
- Alpaca Lora☆25Jul 25, 2023Updated 2 years ago
- aigc evals☆10Dec 2, 2023Updated 2 years ago
- moss chat finetuning☆51Apr 23, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Ref-Diff: Zero-shot Referring Image Segmentation with Generative Models☆21May 29, 2025Updated last year
- Codebase for fine-tuning Llama2 70B to generate math test questions and answers.☆11Aug 30, 2024Updated last year
- ☆25Jul 20, 2025Updated 11 months ago
- Official github repo for AutoDetect, an automated weakness detection framework for LLMs.☆46Jun 25, 2024Updated 2 years ago
- basic algorithm☆11Nov 28, 2020Updated 5 years ago
- 对ChatGLM直接使用RLHF提升或降低目标输出概率|Modify ChatGLM output with only RLHF☆196May 23, 2023Updated 3 years ago
- QuoteSum is a textual QA dataset containing Semi-Extractive Multi-source Question Answering (SEMQA) examples written by humans, based on …☆13Mar 25, 2024Updated 2 years ago
- The code for paper: Hierarchical Document Refinement for Long-context Retrieval-augmented Generation [ACL2025 Oral]☆46Aug 25, 2025Updated 10 months ago
- A competition on DataCastle which is about text keyword extraction ! Rank 6 / 622 !☆16Jan 27, 2019Updated 7 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 基于 LoRA 和 P-Tuning v2 的 ChatGLM-6B 高效参数微调☆55May 17, 2023Updated 3 years ago
- One Repository of AI Series: Collecting the useful technology articles, opensource tutorials and opensource books. 我的AI系列仓库之一:收集有用的技术文章、开…☆10Nov 14, 2024Updated last year
- 基于中文 GPT2 预训练模型的文本分类微调☆23Mar 29, 2023Updated 3 years ago
- MSTI☆16Mar 6, 2024Updated 2 years ago
- Continual Memorization of Factoids in Large Language Models☆12Nov 20, 2024Updated last year
- Implementing BERT + CRF with PyTorch for Chinese NER.☆10Mar 7, 2022Updated 4 years ago
- ☆17Jul 18, 2022Updated 3 years ago
- Code for paper "W-RAG: Weakly Supervised Dense Retrieval in RAG for Open-domain Question Answering"☆16Oct 2, 2025Updated 9 months ago
- A simple cycle-accurate DaDianNao simulator☆13Mar 27, 2019Updated 7 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Simple Conversational Data Augmentation for Semi-supervised Abstractive Conversation Summarization☆10Mar 7, 2022Updated 4 years ago
- Fine-tuning GPT-2 to generate research paper abstracts☆12Apr 28, 2021Updated 5 years ago
- NMT based punctuation prediction system using lexical and acoustic features .☆14Mar 30, 2020Updated 6 years ago
- ☆11Aug 29, 2022Updated 3 years ago
- Neutral Network based Chinese Segment System☆19Nov 29, 2016Updated 9 years ago
- Official Implementation of Avoiding spurious correlations via logit correction☆17May 6, 2023Updated 3 years ago
- This is for EMNLP 2024 Paper: AppBench: Planning of Multiple APIs from Various APPs for Complex User Instruction☆16Nov 4, 2024Updated last year
- Official code for the paper: DRA-GRPO: Exploring Diversity-Aware Reward Adjustment for R1-Zero-Like Training of Large Language Models☆24Jan 6, 2026Updated 5 months ago
- Implementation of Hyena Hierarchy in JAX☆10Apr 30, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆12Sep 2, 2021Updated 4 years ago
- Contrastive Fact Verification☆73Sep 17, 2022Updated 3 years ago
- Code for paper 'Are We Falling in a Middle-Intelligence Trap? An Analysis and Mitigation of the Reversal Curse'☆14Aug 2, 2024Updated last year
- NTK scaled version of ALiBi position encoding in Transformer.☆69Aug 16, 2023Updated 2 years ago
- Parameter-efficient Fine Tuning for Clinical LLMs☆18Apr 23, 2024Updated 2 years ago
- KDD 2024 AQA competition 2nd place solution☆12Jul 21, 2024Updated last year
- [EMNLP 2024] SURf: Teaching Large Vision-Language Models to Selectively Utilize Retrieved Information☆11Oct 11, 2024Updated last year