A Survey of LLM Alignment (SFT & RLHF), and A Survey of RLHF methods (2023~2024)
☆21May 21, 2024Updated last year
Alternatives and similar repositories for LLM-Alignment
Users that are interested in LLM-Alignment are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A python implementation of discrete optimal transport with a Tsallis entropy regularization.☆14Oct 23, 2023Updated 2 years ago
- Experimental code for the paper 'Finding Convincing Arguments Using Scalable Bayesian Preference Learning'☆13Dec 8, 2022Updated 3 years ago
- 受到self-instruct启 发,除了通用LLM还能做垂直领域的小LLM实现定制效果,通过GPT获得question和answer来作为训练数据☆18May 12, 2023Updated 2 years ago
- The source code of the paper "SHGNN: Structure-Aware Heterogeneous Graph Neural Network"☆14Dec 14, 2021Updated 4 years ago
- ☆12Nov 28, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- a model zoo☆11Jul 19, 2017Updated 8 years ago
- 基于 Anatole 开发的 Halo 博客主题 Knarc☆13Apr 6, 2023Updated 3 years ago
- OpenLLMDE: An open source data engineering framework for LLMs☆18Sep 9, 2023Updated 2 years ago
- ☆12Aug 25, 2017Updated 8 years ago
- MARL: the model of the IJCAI 2020 paper 'Retrieve, Program, Repeat: Complex Knowledge Base Question Answering via Alternate Meta-learning…☆13Oct 8, 2020Updated 5 years ago
- Sayers Engine is a simple visualization engine based on openGL, MikuMikuDance and MikuMiku Editor.☆13Feb 27, 2016Updated 10 years ago
- ☆15Oct 25, 2021Updated 4 years ago
- Code and data for EMNLP 2023 research track paper "MarkQA: A large scale KBQA dataset with numerical reasoning"☆12Jan 2, 2024Updated 2 years ago
- We propose the Flowmind2digital method and the hdFlowmind dataset in this paper.☆14Nov 17, 2025Updated 5 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Faithfully Explainable Recommendation via Neural Logic Reasoning☆16May 3, 2021Updated 4 years ago
- Code for MME-SID accepted to CIKM 2025 Full Research track.☆29Oct 29, 2025Updated 6 months ago
- Accepted to ICLR 2025. MetaMetrics is a calibrated meta-metric designed to evaluate generation tasks across different modalities aligned …☆14Dec 30, 2024Updated last year
- ☆14Jun 28, 2022Updated 3 years ago
- Code for "Function Space Particle Optimization for Bayesian Neural Networks"☆18Oct 26, 2022Updated 3 years ago
- 创造自己的工具集,build for fun🎉☆17May 13, 2023Updated 2 years ago
- Official code for the SIGIR 2025 accepted paper "CDC: Causal Domain Clustering for Multi-Domain Recommendation".☆14Aug 27, 2025Updated 8 months ago
- EarthVL: A Progressive Earth Vision-Language Understanding and Generation Framework☆42Jan 22, 2026Updated 3 months ago
- ☆59Oct 6, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 2025.01:从零到一实现了一个多模态大模型,并命名为Reyes(睿视),R:睿,eyes:眼。Reyes的参数量为8B,视觉编码器使用的是InternViT-300M-448px-V2_5,语言模型侧使用的是Qwen2.5-7B-Instruct,Reyes也通过一个两…☆33Feb 10, 2026Updated 2 months ago
- Dataset citeulike-t for 'Collaborative Topic Regression with Social Regularization' (CTRSR)☆18Jul 13, 2021Updated 4 years ago
- [TACL] Code for "Red Teaming Language Model Detectors with Language Models"☆24Nov 24, 2023Updated 2 years ago
- Code for AAAI 2023 research track paper "Question Decomposition Tree for Answering Complex Questions over Knowledge Bases"☆18Jan 3, 2024Updated 2 years ago
- Implementation about a recommender System using RQ-VAE Semantic IDs☆16Apr 15, 2026Updated 2 weeks ago
- the code of our paper "Beyond Matching: Modeling Two-Sided Multi-Behavioral Sequences For Dynamic Person-Job Fit" (实现十多个人岗匹配模型和动态人岗匹配模型的算…☆16Aug 10, 2023Updated 2 years ago
- RecommenderSystems: from 0 to practice. 包括推荐系统实践和深度推荐系统两部分☆17Dec 22, 2021Updated 4 years ago
- Perspective Transformation for Indoor Image Aesthetic Enhancement☆12Jan 8, 2020Updated 6 years ago
- ☆15Mar 25, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [EMNLP 2023 (Findings)] Schema-adaptable Knowledge Graph Construction☆22Jan 28, 2024Updated 2 years ago
- [CVPR-2023] Semantic-Promoted Debiasing and Background Disambiguation for Zero-Shot Instance Segmentation☆18Jul 2, 2023Updated 2 years ago
- DeepLearningを利用して簡単に花の絵を描くツール☆18May 23, 2018Updated 7 years ago
- Fastest CPU (AVX/SSE) SIFT or other 128-float vector matcher for computer vision☆14Mar 23, 2021Updated 5 years ago
- [ICLR 2026] The official code for "Doxing via the Lens: Revealing Location-related Privacy Leakage on Multi-modal Large Reasoning Models"☆26Feb 7, 2026Updated 2 months ago
- [CVPRW 2023]The Winner's Solution of CVPR2023-ABAW5 Emotional Reaction Intensity (ERI) Estimation Challenge☆27Mar 19, 2023Updated 3 years ago
- Empirical Study of Recent Face Alignment Methods☆13May 19, 2017Updated 8 years ago