wzhwzhwzh0921/Awesome_LRM_with_Entropy

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/wzhwzhwzh0921/Awesome_LRM_with_Entropy)

wzhwzhwzh0921 / Awesome_LRM_with_Entropy

Introduction about AWESOME_ENTROPY+LRM_PAPERS

☆32

Alternatives and similar repositories for Awesome_LRM_with_Entropy

Users that are interested in Awesome_LRM_with_Entropy are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

linjh1118 / AwesomeRM
View on GitHub
☆32Jan 11, 2026Updated 6 months ago
linjh1118 / survey_agent
View on GitHub
☆17Jan 14, 2026Updated 6 months ago
misonsky / HiFT
View on GitHub
memory-efficient fine-tuning; support 24G GPU memory fine-tuning 7B
☆21May 26, 2024Updated 2 years ago
sci-m-wang / Spy-Game
View on GitHub
利用大语言模型进行卧底游戏，包括谁是卧底及衍生的发现AI卧底游戏等。
☆11Sep 6, 2024Updated last year
sci-m-wang / NEU-Thesis
View on GitHub
东北大学学位论文LaTex版 (本硕博通用)，可直接导入Overleaf。LaTex version of Northeastern University's thesis, which can be imported directly into Overleaf.
☆127Jun 14, 2026Updated last month
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
linjh1118 / Llama3-Chinese-ORPO
View on GitHub
基于Llama3，通过进一步CPT，SFT，ORPO得到的中文版Llama3
☆16Apr 24, 2024Updated 2 years ago
latentcraft / replay
View on GitHub
[CVPR 2026] Boosting Reasoning in Large Multimodal Models via Activation Replay
☆24May 7, 2026Updated 2 months ago
xlyu0106 / MACT
View on GitHub
☆19Jul 31, 2025Updated 11 months ago
liziliao / MMConv
View on GitHub
Official repository for "MMConv: An Environment for Multimodal Conversational Search across Multiple Domains"
☆34Jul 15, 2021Updated 5 years ago
Yuan-Hou / Human-MME
View on GitHub
Official repository for "Human-MME: A Holistic Evaluation Benchmark for Human-Centric Multimodal Large Language Models"
☆22Dec 2, 2025Updated 7 months ago
declare-lab / MM-InstructEval
View on GitHub
This repository contains code to evaluate various multimodal large language models using different instructions across multiple multimoda…
☆32Mar 9, 2025Updated last year
linjh1118 / Chinese_Awesome_CV
View on GitHub
Awesome_CV的中文版本，clone本项目到overleaf即可轻松愉快编写自己的CV
☆18May 24, 2024Updated 2 years ago
linjh1118 / LLM-Research
View on GitHub
A LLM Paper note list.
☆19Apr 6, 2024Updated 2 years ago
YinBo0927 / FeRA
View on GitHub
[ICML 2026] The official code of FeRA: Frequency–Energy Constrained Routing for Effective Diffusion Adaptation Fine-Tuning
☆29Dec 27, 2025Updated 7 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
BoyuanJiang / FluxFit
View on GitHub
Virtual Try-on based on the powerful Flux model
☆27Dec 4, 2024Updated last year
RH-Lin / abbreviate_pub_names_in_bib
View on GitHub
Automatically replace full publication names in a bibtex database file into official abbreviated names, or reverse. (Support IEEE/ACM/Sci…
☆14Jul 30, 2024Updated last year
icip-cas / Verifier-Engineering
View on GitHub
Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering
☆63Dec 5, 2024Updated last year
NEU-DataMining / awesome-affective-computing
View on GitHub
A comprehensive overview of affective computing research in the era of large language models (LLMs).
☆33Aug 7, 2024Updated last year
circle-hit / CauAIN
View on GitHub
Code for IJCAI 2022 accepted paper titled "CauAIN: Causal Aware Interaction Network for Emotion Recognition in Conversations"
☆24Jun 11, 2023Updated 3 years ago
d-f / llm-summarization
View on GitHub
LoRA supervised fine-tuning, RLHF (PPO) and RAG with llama-3-8B on the TLDR summarization dataset
☆14Feb 2, 2025Updated last year
Jiachen-T-Wang / GREATS
View on GitHub
☆20Jun 27, 2026Updated last month
nku-shengzheliu / SER30K
View on GitHub
[ACM MM 2022 Oral] This is the official implementation of "SER30K: A Large-Scale Dataset for Sticker Emotion Recognition"
☆32Oct 18, 2022Updated 3 years ago
Evanwu1125 / LiteCoT
View on GitHub
☆17Jun 10, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Ravoxsg / efficient_unified_crs
View on GitHub
Source code for PECRS (EACL 2024)
☆12Feb 3, 2024Updated 2 years ago
zhliu0106 / learning-to-refuse
View on GitHub
Official Implementation of "Learning to Refuse: Towards Mitigating Privacy Risks in LLMs"
☆10Dec 13, 2024Updated last year
Orion-zhen / roleplay-dataset
View on GitHub
收集优质的角色扮演聊天数据 | Collection of roleplay conversations of high quality
☆16Dec 1, 2024Updated last year
Libo-Xu / DRIVE--Digital-Retinal-Images-for-Vessel-Extraction
View on GitHub
☆12Sep 8, 2020Updated 5 years ago
simplelifetime / TIVE
View on GitHub
Less is More: High-value Data Selection for Visual Instruction Tuning
☆20Jan 18, 2025Updated last year
maximek3 / MIMIC-NLE
View on GitHub
☆21Jul 25, 2022Updated 4 years ago
zxd-octopus / ECR
View on GitHub
The implementation for the Recsys paper: Towards Empathetic Conversational Recommender System
☆26Sep 3, 2024Updated last year
sylvain-wei / 24-Game-Reasoning
View on GitHub
超简单复现Deepseek-R1-Zero和Deepseek-R1，以「24点游戏」为例。通过zero-RL、SFT以及SFT+RL，以激发LLM的自主验证反思能力。 About Clean, minimal, accessible reproduction of Dee…
☆35Apr 5, 2025Updated last year
tmlr-group / TriMem
View on GitHub
[arXiv:2605.19952] "Rethinking How to Remember: Beyond Atomic Facts in Lifelong LLM Agent Memory"
☆16May 20, 2026Updated 2 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
wang8740 / MAP
View on GitHub
Documentation at
☆14Mar 27, 2025Updated last year
wzhwzhwzh0921 / S-D-Mamba
View on GitHub
Code for "Is Mamba Effective for Time Series Forecasting?"
☆392May 20, 2025Updated last year
shaohao011 / MedCCO
View on GitHub
[ACM MM2026] This is the official implementation of MedCCO
☆17Jul 12, 2026Updated 2 weeks ago
ZhangYiqun018 / Avengers
View on GitHub
[AAAI 2026] The Avengers: A Simple Recipe for Uniting Smaller Language Models to Challenge Proprietary Giants
☆46Dec 11, 2025Updated 7 months ago
BaohaoLiao / frac-cot
View on GitHub
[COLM 2026] An efficient 3D sampling method for long-CoT LLM.
☆16May 25, 2025Updated last year
ZhangYiqun018 / StickerConv
View on GitHub
[ACL 2024]
☆60Jun 20, 2024Updated 2 years ago
CyberAgentAILab / filtered-dpo
View on GitHub
[EMNLP 2024] Introducing Filtered Direct Preference Optimization (fDPO) that enhances language model alignment with human preferences by …
☆16Nov 27, 2024Updated last year