Miraclemarvel55/LLaMA-MOSS-RLHF-LoRA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Miraclemarvel55/LLaMA-MOSS-RLHF-LoRA)

Miraclemarvel55 / LLaMA-MOSS-RLHF-LoRA

用RLHF可选LoRA对LLaMA和MOSS进行训练|Training LLaMA or MOSS with RLHF [LoRA]

☆21

Alternatives and similar repositories for LLaMA-MOSS-RLHF-LoRA

Users that are interested in LLaMA-MOSS-RLHF-LoRA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

alexrs / herd
View on GitHub
Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.
☆12Feb 11, 2024Updated 2 years ago
CreaLabs / Enhanced-BGE-M3-with-CLP-and-MoE
View on GitHub
This repository provides the code for applying Contrastive Learning Penalty Loss (CLPL) and Mixture of Experts (MoE) to the BGE-M3 text e…
☆11Dec 27, 2024Updated last year
CLUEbenchmark / Math24o
View on GitHub
Math24o: 高中奥林匹克数学竞赛测评集 High School Olympiad Mathematics Chinese Benchmark
☆14Mar 27, 2025Updated last year
shuttie / embed-benchmark
View on GitHub
☆16Nov 10, 2023Updated 2 years ago
wisnunugroho21 / reinforcement_learning_truly_ppo
View on GitHub
Deep Reinforcement Learning by using Truly Proximal Policy Optimization in Tensorflow 2 and Pytorch
☆22Nov 9, 2025Updated 8 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
sdunlp / nlp_Chinese
View on GitHub
Application for processing Chinese text : Sentiment , Keywords , Abstract
☆10Apr 13, 2017Updated 9 years ago
v-i-s-h / lte-rl
View on GitHub
A fork of ns3 LTE module for reinforcement learning experiments
☆13Feb 20, 2017Updated 9 years ago
christosbampis / Psychopy_Software_Demo_LIVE_NFLX_II
View on GitHub
Demo for the subjective interface
☆14Mar 4, 2018Updated 8 years ago
winkidney / wei-dev
View on GitHub
一个微信图形界面调试工具，免去你将程序部署到服务器的麻烦。
☆35Jul 4, 2017Updated 9 years ago
guoyongcs / RSPC
View on GitHub
Code for "Improving Robustness of Vision Transformers by Reducing Sensitivity to Patch Corruptions"
☆14Sep 3, 2023Updated 2 years ago
Li-ChangHao / CoNav
View on GitHub
☆12Jul 16, 2024Updated last year
ycchen218 / EDA-DRC-Prediction
View on GitHub
This is a deep-learning based model for Electronic Design Automation(EDA), predicting the Design Rule Check (DRC) violation location.
☆13Jun 24, 2023Updated 3 years ago
jiah-li / magic
View on GitHub
The repo for paper: Exploiting the Index Gradients for Optimization-Based Jailbreaking on Large Language Models.
☆15Dec 16, 2024Updated last year
thu-coai / LongSafety
View on GitHub
[ACL 2025] LongSafety: Evaluating Long-Context Safety of Large Language Models
☆16Jun 18, 2025Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
xubingna / HCD
View on GitHub
☆11Aug 31, 2023Updated 2 years ago
PeihaoChen / ActiveCamera
View on GitHub
Official implementation of NeurIPS 2022 paper "Learning Active Camera for Multi-Object Navigation"
☆14Apr 23, 2023Updated 3 years ago
yoonholee / reinforcement-learning-papers
View on GitHub
My notes on reinforcement learning papers
☆15Jun 14, 2018Updated 8 years ago
Bitbol-Lab / Phylogeny-MSA-Transformer
View on GitHub
Supporting repository for "Protein language models trained on multiple sequence alignments learn phylogenetic relationships" (https://www…
☆17Jan 17, 2025Updated last year
jimmy15923 / wspss_mil_transformer
View on GitHub
☆11Jun 30, 2023Updated 3 years ago
owenliang / asyncio-threadpool-demo
View on GitHub
fastapi异步IO+threadpool线程池的工作原理
☆18Feb 12, 2024Updated 2 years ago
limenlp / safer-instruct
View on GitHub
This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"
☆17Feb 22, 2024Updated 2 years ago
lizhaoliu-Lec / DAS
View on GitHub
This is the official repo for Densely-Anchored Sampling for Deep Metric Learning (ECCV 22).
☆16May 24, 2024Updated 2 years ago
YuejiangLIU / prioritized_option_critic
View on GitHub
Implementation of the Prioritized Option-Critic on the Four-Rooms Environment
☆17Dec 24, 2017Updated 8 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
teffland / ner-expected-entity-ratio
View on GitHub
Implementation and experiments for Partially Supervised NER via Expected Entity Ratio in TACL 2022
☆14Nov 7, 2022Updated 3 years ago
Htallone / beamerBUAA
View on GitHub
北航主题的LaTeX beamer模板
☆21Jun 9, 2018Updated 8 years ago
veronicachelu / temporal_abstraction
View on GitHub
Option Critic with subgoal discovery by spectral decomposition of the Successor Features Matrix or clustering in Successor features space…
☆24Nov 29, 2018Updated 7 years ago
alon-albalak / online-data-mixing
View on GitHub
An implementation of online data mixing for the Pile dataset, based on the GPT-NeoX library.
☆14Jan 9, 2024Updated 2 years ago
mishajw / repeng
View on GitHub
Experiments with representation engineering
☆14Feb 28, 2024Updated 2 years ago
yutuer21 / quantumzero
View on GitHub
☆15Feb 24, 2022Updated 4 years ago
azharlabs / large-models
View on GitHub
☆15Feb 7, 2024Updated 2 years ago
zjunlp / NLPCC2024_RegulatingLLM
View on GitHub
[NLPCC 2024] Shared Task 10: Regulating Large Language Models
☆14Jun 12, 2024Updated 2 years ago
shadowkiller33 / Language_attack
View on GitHub
A repo for LLM jailbreak
☆14Sep 5, 2023Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
AlonMendelson / SGVL
View on GitHub
☆17Dec 13, 2023Updated 2 years ago
a-nagrani / CVPR2020_Poster
View on GitHub
Speech2Action CVPR Poster Source Code
☆20Apr 29, 2020Updated 6 years ago
Liadrinz / RLlib-Common-Paramters
View on GitHub
RLlib超参数详解（中文）
☆18Jan 24, 2022Updated 4 years ago
ivanalberico / Probabilistic-Artificial-Intelligence-ETH
View on GitHub
Graded projects of the course "Probabilistic Artificial Intelligence", ETH Zürich (Fall 2020). Topics: Gaussian Process Regression, Bayes…
☆12Nov 3, 2021Updated 4 years ago
ChengTsang / HMM-For-NER
View on GitHub
This is a program to solve NER with HMM. The principles and details can refer to my blog: https://blog.csdn.net/weixin_41679411/article/d…
☆11Nov 20, 2018Updated 7 years ago
lizhaoliu-Lec / CG-VLM
View on GitHub
This is the official repo for Contrastive Vision-Language Alignment Makes Efficient Instruction Learner.
☆20Dec 1, 2023Updated 2 years ago
yongchanghao / multi-task-nat
View on GitHub
☆11Jul 17, 2021Updated 4 years ago