基于DPO算法微调语言大模型,简单好上手。
☆51Jul 3, 2024Updated last year
Alternatives and similar repositories for Simple-Trl-Training
Users that are interested in Simple-Trl-Training are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- code for GaVaMoE: Gaussian-Variational Gated Mixture of Experts for Explainable Recommendation☆18Dec 7, 2024Updated last year
- ☆19Aug 9, 2024Updated last year
- A curated collection of resources, tools, and frameworks for developing GUI Agents.☆379Mar 23, 2026Updated last week
- ☆32Aug 11, 2025Updated 7 months ago
- ☆29Nov 29, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"☆16Nov 11, 2024Updated last year
- demo backend with flask and neo4j☆12Jul 3, 2017Updated 8 years ago
- 目前各大高校领域将各种信息分布在不同的部门信息门户下,存在典型的信息孤岛问题,各个部门信息没有形成互通。当前,老师和学生存在很多有关本校相关文件、政策和活动等众多方面智能问答的统一入口的需求,例如财务处、人事处、学工处、教务处、图书馆等存在各种政策和文件规定,目前在校师生都…☆36Aug 5, 2024Updated last year
- [CVPR 2026] Variation-aware Vision Token Dropping for Faster Large Vision-Language Models☆28Mar 18, 2026Updated last week
- [CVPR 2025] GUI-Xplore: Empowering Generalizable GUI Agents with One Exploration☆20Mar 21, 2025Updated last year
- ☆12May 13, 2023Updated 2 years ago
- 中文关键词提取☆14Aug 7, 2023Updated 2 years ago
- [ICLR 2025] Code&Data for the paper "Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization"☆14Jun 21, 2024Updated last year
- Software relating to relational empirical risk minimization☆16Jun 12, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- 知识表示和推理项目,收集知识表示和推理算法,部分算法给出了应用案例。☆13Apr 26, 2022Updated 3 years ago
- [ICML 2025 Oral] This is the official repository of the paper "What Limits Virtual Agent Application? OmniBench: A Scalable Multi-Dimensi…☆22Jun 12, 2025Updated 9 months ago
- This is the official repository of Daily-Omni: Towards Audio-Visual Reasoning with Temporal Alignment across Modalities☆39Updated this week
- ☆18Jul 25, 2025Updated 8 months ago
- This is the formal code implementation of the CVPR 2024 paper 'Traceable Federated Continual Learning'.☆18May 31, 2024Updated last year
- Official implementation of ICML 2025 paper "Understanding Multimodal LLMs Under Distribution Shifts: An Information-Theoretic Approach"☆11May 27, 2025Updated 10 months ago
- GSM8K-V: Can Vision Language Models Solve Grade School Math Word Problems in Visual Contexts☆40Sep 30, 2025Updated 6 months ago
- UniVid: The Open-Source Unified Video Model☆30Oct 13, 2025Updated 5 months ago
- ☆122Jun 30, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for "Position-aware Structure Learning for Graph Topology-imbalance by Relieving Under-reaching and Over-squashing"☆15Oct 1, 2023Updated 2 years ago
- Official completion of “Training on the Benchmark Is Not All You Need”.☆39Dec 31, 2024Updated last year
- 深度学习领域论文翻译+理解☆17Feb 25, 2022Updated 4 years ago
- An unsupervised text summarization and information retrieval library under the hood using natural language processing models☆15Dec 11, 2020Updated 5 years ago
- ☆11Oct 11, 2023Updated 2 years ago
- [NeurIPS 2024] Fast Best-of-N Decoding via Speculative Rejection☆55Oct 29, 2024Updated last year
- 基于语言学本体构建,全面覆盖汉语多音字、音变等现象的高效中文TTS数据集。A linguistically grounded and comprehensive Chinese TTS dataset, efficiently covering Chinese polyph…☆54Aug 13, 2024Updated last year
- This is the official Python implementation repository for a paper entitled "Resolving Camera Position for a Practical Application of Gaz…☆12Jan 11, 2022Updated 4 years ago
- ☆12Jul 6, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- LLM Tokenizer with BPE algorithm☆48May 7, 2024Updated last year
- MeloTTS demo on Axera☆12Nov 18, 2025Updated 4 months ago
- SGLang Kernel Wheel Index☆18Updated this week
- Reproduced the DFT method without using Verl. https://arxiv.org/abs/2508.05629☆21Oct 14, 2025Updated 5 months ago
- 60k hours of phoneme-aligned audio from audio books☆19Jul 27, 2024Updated last year
- java implementation of Bert Tokenizer, support output onnx tensor for onnx model inference☆13Sep 4, 2023Updated 2 years ago
- 使用Few-Shot方法来做文本分类任务,基于THUCNews数据☆10Jun 4, 2020Updated 5 years ago