基于DPO算法微调语言大模型,简单好上手。
☆52Jul 3, 2024Updated last year
Alternatives and similar repositories for Simple-Trl-Training
Users that are interested in Simple-Trl-Training are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆19Aug 9, 2024Updated last year
- Mining , pre-processing and embedding over 1 million Amazon Movie & T.V. reviews to build a multi class Naive Bayes model and later a CNN…☆11Jan 10, 2020Updated 6 years ago
- Attentive Knowledge-aware Graph Convolutional Networks with Collaborative Guidance for Personalized Recommendation☆12Sep 22, 2022Updated 3 years ago
- ZJU编译原理大作业☆17Jun 11, 2022Updated 3 years ago
- Official repository of paper "LOVE-R1: Advancing Long Video Understanding with Adaptive Zoom-in Mechanism via Multi-Step Reasoning"☆24Nov 1, 2025Updated 7 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 由于BAAI/bge-large-zh 在Hugging Face Clone不下来,手动下载下来,便于使用☆11Sep 16, 2023Updated 2 years ago
- Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"☆16Nov 11, 2024Updated last year
- unofficial impelement of the webformer: The Web-page Transformer for Structure Information Extraction☆13Apr 20, 2023Updated 3 years ago
- 神经网络各种模型PyTorch实现☆43Dec 25, 2022Updated 3 years ago
- ☆12May 13, 2023Updated 3 years ago
- 中文关键词提取☆14Aug 7, 2023Updated 2 years ago
- [ICLR 2025] Code&Data for the paper "Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization"☆15Jun 21, 2024Updated last year
- Software relating to relational empirical risk minimization☆16Jun 12, 2021Updated 4 years ago
- Extending context length of visual language models☆12Dec 18, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 基于vue2的admin后台管理系统,含有登陆页面(带有滑动验证)、修改密码页面、404页面。封装了axios,将api地址放入环境变量。权限控制生成可访问的路由,并根据路由生成侧边导航栏。有任何问题可以联系我的邮箱chenzhipeng709@163.com如果喜欢请点个…☆15Jul 26, 2024Updated last year
- Speechflow for emotion recognition related information decomposition☆10Jul 27, 2021Updated 4 years ago
- Co-Supervised Learning: Improving Weak-to-Strong Generalization with Hierarchical Mixture of Experts☆16Feb 26, 2024Updated 2 years ago
- Fixed version of https://github.com/tomguluson92/PRNet_PyTorch☆10Mar 30, 2020Updated 6 years ago
- A simple Rasa UI☆14Jul 13, 2020Updated 5 years ago
- ☆122Jun 30, 2024Updated last year
- Gradually Updated Neural Networks for Large-Scale Image Recognition at ICML 2018☆10Jun 25, 2018Updated 7 years ago
- UniVid: The Open-Source Unified Video Model☆32Oct 13, 2025Updated 7 months ago
- Automatically Update LLM Papers Daily using Github Actions. Ref: https://github.com/Vincentqyw/cv-arxiv-daily☆10Jun 1, 2026Updated last week
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- 深度学习领域论文翻译+理解☆18Feb 25, 2022Updated 4 years ago
- An unsupervised text summarization and information retrieval library under the hood using natural language processing models☆15Dec 11, 2020Updated 5 years ago
- ☆11Oct 11, 2023Updated 2 years ago
- ☆15Jul 14, 2022Updated 3 years ago
- This is the official Python implementation repository for a paper entitled "Resolving Camera Position for a Practical Application of Gaz…☆12Jan 11, 2022Updated 4 years ago
- ☆12Jul 6, 2023Updated 2 years ago
- [AAAI 2026] Test-Time Reinforcement Learning for GUI Grounding via Region Consistency https://arxiv.org/abs/2508.05615☆65Nov 8, 2025Updated 7 months ago
- LLM Tokenizer with BPE algorithm☆49May 7, 2024Updated 2 years ago
- xgboost复现☆15Oct 6, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This paper is accpeted by WSDM 2023☆13Mar 13, 2023Updated 3 years ago
- MeloTTS demo on Axera☆13Nov 18, 2025Updated 6 months ago
- ☆432Feb 10, 2025Updated last year
- Reproduced the DFT method without using Verl. https://arxiv.org/abs/2508.05629☆23Oct 14, 2025Updated 7 months ago
- 60k hours of phoneme-aligned audio from audio books☆19Jul 27, 2024Updated last year
- java implementation of Bert Tokenizer, support output onnx tensor for onnx model inference☆13Sep 4, 2023Updated 2 years ago
- Basic Tools☆13Dec 18, 2021Updated 4 years ago