chatglm_rlhf_finetuning
☆30Oct 10, 2023Updated 2 years ago
Alternatives and similar repositories for chatglm_rlhf
Users that are interested in chatglm_rlhf are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Huma…☆138Apr 28, 2023Updated 3 years ago
- realize the reinforcement learning training for gpt2 llama bloom and so on llm model☆27Sep 19, 2023Updated 2 years ago
- 对ChatGLM直接使用RLHF提升或降低目标输出概率|Modify ChatGLM output with only RLHF☆198May 23, 2023Updated 2 years ago
- aigc_serving lightweight and efficient Language service model reasoning☆24Jun 12, 2024Updated last year
- share data, prompt data , pretraining data☆36Nov 30, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Training a reward model for RLHF using RWKV.☆15Jun 5, 2023Updated 2 years ago
- Solution of team funny in WSDM2020☆13Jan 17, 2020Updated 6 years ago
- deep learning☆150May 6, 2025Updated last year
- moss chat finetuning☆51Apr 23, 2024Updated 2 years ago
- LoRA supervised fine-tuning, RLHF (PPO) and RAG with llama-3-8B on the TLDR summarization dataset☆14Feb 2, 2025Updated last year
- Evaluation for AI apps and agent☆45Jan 18, 2024Updated 2 years ago
- Implementation of RLHF (Reinforcement Learning with Human Feedback) and GAN (Generative Adversarial Network) on top of the T5 architectur…☆17Jan 2, 2023Updated 3 years ago
- 3gpp协议26073里面的vad的移植☆14Feb 14, 2019Updated 7 years ago
- Intrinsic Time-Scale Decomposition☆17Feb 20, 2026Updated 2 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- 基于bert的中文实体链接☆30Nov 24, 2021Updated 4 years ago
- Train LoRA using Microsoft's official implementation with Stable Diffusion models.☆34May 9, 2023Updated 2 years ago
- This repository contains the data used for the paper "Entity Recognition at First Sight: Improving NER with Eye Movement Information" by …☆11Jan 22, 2020Updated 6 years ago
- Go语言开发的HTML和Markdown转JSON工具,将HTML和Markdown内容转换为符合各种小程序`rich-text`组件内容渲染所需格式的`JSON`☆12May 5, 2023Updated 3 years ago
- ☆11Oct 13, 2024Updated last year
- ChatGLM2-6B微调, SFT/LoRA, instruction finetune☆109Jul 19, 2023Updated 2 years ago
- a website designed to help you manage your tasks and stay organized.☆11Mar 16, 2023Updated 3 years ago
- ☆11Aug 10, 2022Updated 3 years ago
- clue chatyuan finetuning☆17Mar 10, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- DiT for VAE (and Video Generation)☆35Sep 2, 2024Updated last year
- chatglm 6b finetuning and alpaca finetuning☆1,537Mar 9, 2025Updated last year
- Code for KDD 2023 long paper: MetricPrompt: Prompting Model as a Relevance Metric for Few-Shot Text Classification☆19Aug 10, 2024Updated last year
- Code for the paper "Closing the Curious Case of Neural Text Degeneration"☆12Apr 9, 2025Updated last year
- Latin texts annotated for named entities and NER tagger used for the Herodotos Project (Ohio State University / Ghent University)☆12Sep 26, 2022Updated 3 years ago
- Demo telematics app for Flutter. The application walks you through the telematics SDK integration. The technology is suitable for UBI (Us…☆24Apr 1, 2026Updated last month
- A Toolkit for Fine-Tuning Large Language Models with LoRA and DeepSpeed☆11Apr 14, 2023Updated 3 years ago
- Cluster Images using Perceptual Hash☆13Apr 22, 2016Updated 10 years ago
- My implementation of the model KosmosG from "KOSMOS-G: Generating Images in Context with Multimodal Large Language Models"☆14Nov 11, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A Straightforward Pytorch Implementation of Gated Feedback RNNs☆12May 8, 2017Updated 8 years ago
- 用于大模型 RLHF 进行人工数据标注排序的工具。A tool for manual response data annotation sorting in RLHF stage.☆256Aug 1, 2023Updated 2 years ago
- Pytorch implementation of vision models.☆12Dec 8, 2022Updated 3 years ago
- ChatGLM2-6B 全参数微调,支持多轮对话的高效微调。☆402Aug 17, 2023Updated 2 years ago
- Baselines of metric learning method using PyTorch.☆11Oct 18, 2021Updated 4 years ago
- use chatGLM to perform text embedding☆45Apr 9, 2023Updated 3 years ago
- ☆21Mar 3, 2026Updated 2 months ago