☆76Nov 13, 2023Updated 2 years ago
Alternatives and similar repositories for Simple_LLM_DPO
Users that are interested in Simple_LLM_DPO are comparing it to the libraries listed below
Sorting:
- ☆19Aug 9, 2024Updated last year
- 手搓Llama,个人学习用☆16May 21, 2024Updated last year
- 基于youtube、bilibili等视频平台、webpage网页等,利用零一万物大模型或ollama本地小模型构建大语言模型高质量训练数据集(计划支持可自定义输出的训练数据格式)☆19May 2, 2024Updated last year
- OpenLLMDE: An open source data engineering framework for LLMs☆18Sep 9, 2023Updated 2 years ago
- 通义千问的DPO训练☆63Sep 21, 2024Updated last year
- E2E NLG Challenge submission☆23Feb 14, 2021Updated 5 years ago
- ☆23Apr 16, 2024Updated last year
- Detecting car parking slot on Open car park space☆13Oct 21, 2019Updated 6 years ago
- 2019中国高校计算机大赛,rank5代码+ppt☆29Aug 27, 2019Updated 6 years ago
- ☆33Aug 7, 2024Updated last year
- ☆17Feb 6, 2025Updated last year
- The classic movies redux with machine learning using TensorFlow and Keras.☆11Feb 12, 2019Updated 7 years ago
- [ACL 2024]Controlled Text Generation for Large Language Model with Dynamic Attribute Graphs☆40Sep 24, 2024Updated last year
- FOBIE dataset and code for Semi-Open Relation Extraction, applied to Biology for Computer-Aided Biomimetics.☆35Jun 14, 2020Updated 5 years ago
- Codes for our paper "JointGT: Graph-Text Joint Representation Learning for Text Generation from Knowledge Graphs" (ACL 2021 Findings)☆75Jul 27, 2021Updated 4 years ago
- graphrag的基础架构☆46Oct 17, 2024Updated last year
- A collection of some awesome public projects about LLM-based Web Agents and Tools.☆12Apr 25, 2024Updated last year
- 在NLP领域中一些任务的Demo☆13Sep 11, 2023Updated 2 years ago
- Application of deep generative model discovers novel and diverse functional peptides against microbial resistance☆11Dec 22, 2022Updated 3 years ago
- ☆13Sep 23, 2025Updated 5 months ago
- [NeurIPS 2025] Official Implementation for "Glocal Information Bottleneck for Time Series Imputation"☆14Nov 4, 2025Updated 4 months ago
- ☆10Dec 17, 2019Updated 6 years ago
- Code for paper "ToxIBTL: prediction of peptide toxicity based on information bottleneck and transfer learning"☆13Jan 24, 2022Updated 4 years ago
- Python library for solving reinforcement learning (RL) problems using generative models.☆11Feb 18, 2025Updated last year
- This script is designed for evaluating the binding affinity between a protein target and a small molecule, by calculating the binding fre…☆14May 1, 2023Updated 2 years ago
- Conversational AI based on Rasa☆40Feb 11, 2022Updated 4 years ago
- A pytorch implementation of LCGNN☆11Jun 1, 2020Updated 5 years ago
- Classification of tamil news headlines - experimental☆13Feb 21, 2019Updated 7 years ago
- ☆12Feb 20, 2021Updated 5 years ago
- js逆向练习记录☆11Nov 30, 2023Updated 2 years ago
- ☆11May 24, 2024Updated last year
- Using single image per person to train face recognition model☆11Oct 11, 2019Updated 6 years ago
- An Introductory Jupyter Notebook to Manipulate Ontologies with Owlready2☆11Jan 10, 2020Updated 6 years ago
- This project utilizes deep reinforcement learning techniques to train a robot, which combines a mobile platform and a Panda robotic arm, …☆10Jun 7, 2023Updated 2 years ago
- AbationGraph® is a time-series knowledge graph database for real-time data analysis☆16Nov 23, 2023Updated 2 years ago
- ☆15Oct 18, 2020Updated 5 years ago
- 基于Python+Flask+MySQL的数据微中台,支持数据库管理、数据收集(某乎爬虫等)等功能☆10Sep 4, 2020Updated 5 years ago
- 本项目对Deepseek-R1-Distill-Qwen-7B进行心理咨询CoT数据的LoRA微调,以进一步提升Deepseek-R1-Distill-Qwen-7B在心理咨询领域的慢思考能力。☆12Mar 11, 2025Updated 11 months ago
- ☆11Nov 23, 2025Updated 3 months ago