chatglm_rlhf_finetuning
☆30Oct 10, 2023Updated 2 years ago
Alternatives and similar repositories for chatglm_rlhf
Users that are interested in chatglm_rlhf are comparing it to the libraries listed below
Sorting:
- A full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Huma…☆140Apr 28, 2023Updated 2 years ago
- 对ChatGLM直接使用RLHF提升或降低目标输出概率|Modify ChatGLM output with only RLHF☆198May 23, 2023Updated 2 years ago
- realize the reinforcement learning training for gpt2 llama bloom and so on llm model☆27Sep 19, 2023Updated 2 years ago
- Evaluation for AI apps and agent☆44Jan 18, 2024Updated 2 years ago
- aigc_serving lightweight and efficient Language service model reasoning☆24Jun 12, 2024Updated last year
- Train LoRA using Microsoft's official implementation with Stable Diffusion models.☆33May 9, 2023Updated 2 years ago
- ChatGLM2-6B微调, SFT/LoRA, instruction finetune☆110Jul 19, 2023Updated 2 years ago
- 马克思主义哲学:从《黑格尔法哲学批判》到《资本论》☆16Jul 13, 2024Updated last year
- 🛰️ 基于真实医疗对话数据在ChatGLM上进行LoRA、P-Tuning V2、Freeze、RLHF等微调,我们的眼光不止于医疗问答☆337Sep 2, 2023Updated 2 years ago
- deep learning☆150May 6, 2025Updated 10 months ago
- Quick hack job to allow use with Sillytavern. This works for me, some further updates are expected to expose more settings to sillytavern☆11May 30, 2024Updated last year
- Python script to fine tune Open source Video Vision Transformer (ViVit) using HuggingFace Trainer Library☆14Aug 1, 2024Updated last year
- Mixed reality communication system for remote meetings.☆14Feb 21, 2019Updated 7 years ago
- ☆10Sep 12, 2024Updated last year
- Latin texts annotated for named entities and NER tagger used for the Herodotos Project (Ohio State University / Ghent University)☆11Sep 26, 2022Updated 3 years ago
- A Challenge on Dialog Systems with Retrieval Augmented Generation (FutureDial-RAG), Co-located with SLT2024 FutureDial-RAG Challenge☆11Aug 10, 2024Updated last year
- My implementation of the model KosmosG from "KOSMOS-G: Generating Images in Context with Multimodal Large Language Models"☆14Nov 11, 2024Updated last year
- This project demonstrates the use of generic bi-directional LSTM models for predicting importance of words in a spoken dialgoue for under…☆10Mar 24, 2023Updated 2 years ago
- Auto party rave lights☆23Dec 3, 2025Updated 3 months ago
- A work-in-progress framework for building multi-platform, multi-user VR experiences for social learning spaces.☆20Apr 3, 2025Updated 11 months ago
- AI修仙☆11Jul 8, 2025Updated 7 months ago
- Prototype implementation of an architecture suggested in Robot Dream paper (http://arxiv.org/abs/1603.03007)☆12Jul 3, 2019Updated 6 years ago
- SimEc code relying on the theano library - check out the simec repo instead for keras based code!☆10Feb 28, 2018Updated 8 years ago
- A Toolkit for Fine-Tuning Large Language Models with LoRA and DeepSpeed☆11Apr 14, 2023Updated 2 years ago
- Finetuning Stable Diffusion from Diffusers☆12Mar 11, 2024Updated last year
- MedicalGPT-zh:一个基于ChatGLM的在高质量指令数据集微调的中文医疗对话语言模型☆11Apr 9, 2023Updated 2 years ago
- An efficient distillation method for flow matching models☆22Feb 1, 2026Updated last month
- chatglm 6b finetuning and alpaca finetuning☆1,536Mar 9, 2025Updated 11 months ago
- 基于ChatGLM-6B,低成本实现类Instruction效果的角色扮演☆45May 15, 2023Updated 2 years ago
- ChatGLM2-6B 全参数微调,支持多轮对话的高效微调。☆402Aug 17, 2023Updated 2 years ago
- Mobile solution from KISSY☆67Dec 4, 2013Updated 12 years ago
- retired, features pushed to upstream, please using the upstream repo.☆11Sep 8, 2024Updated last year
- Official Pytorch implementation of "Omni-AVSR: Towards Unified Multimodal Speech Recognition with Large Language Models" [IEEE ICASSP 202…☆29Jan 18, 2026Updated last month
- A Deep Learning Project about cats.☆11Aug 8, 2022Updated 3 years ago
- Adaptive Deep Learning Model Selection On Embedded Systems☆11May 6, 2018Updated 7 years ago
- ☆11Apr 29, 2019Updated 6 years ago
- ☆11Jul 19, 2017Updated 8 years ago
- 一个音视频封面、缩略图、多媒体信息加载工具☆11Jan 9, 2018Updated 8 years ago
- Textobject for evil based on indentation.☆15Aug 31, 2013Updated 12 years ago