基于DPO算法微调语言大模型,简单好上手。
☆52Jul 3, 2024Updated last year
Alternatives and similar repositories for Simple-Trl-Training
Users that are interested in Simple-Trl-Training are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆19Aug 9, 2024Updated last year
- A curated collection of resources, tools, and frameworks for developing GUI Agents.☆432Jun 2, 2026Updated 3 weeks ago
- Attentive Knowledge-aware Graph Convolutional Networks with Collaborative Guidance for Personalized Recommendation☆11Sep 22, 2022Updated 3 years ago
- Official repository of paper "LOVE-R1: Advancing Long Video Understanding with Adaptive Zoom-in Mechanism via Multi-Step Reasoning"☆24Nov 1, 2025Updated 8 months ago
- 把自己期末考前整理的一些上课资料以及作业放上GitHub,希望后面的学弟学妹们有参考价值(笑☆14Nov 3, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆29Sep 4, 2025Updated 9 months ago
- Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"☆16Nov 11, 2024Updated last year
- 神经网络各种模型PyTorch实现☆43Dec 25, 2022Updated 3 years ago
- [CVPR 2025] GUI-Xplore: Empowering Generalizable GUI Agents with One Exploration☆21Mar 21, 2025Updated last year
- ☆12May 13, 2023Updated 3 years ago
- 中文关键词提取☆14Aug 7, 2023Updated 2 years ago
- [ICLR 2025] Code&Data for the paper "Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization"☆15Jun 21, 2024Updated 2 years ago
- Software relating to relational empirical risk minimization☆16Jun 12, 2021Updated 5 years ago
- 知识表示和推理项目,收集知识表示和推理算法,部分算法给出了应用案例。☆13Apr 26, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Extending context length of visual language models☆12Dec 18, 2024Updated last year
- [ICML 2025 Oral] This is the official repository of the paper "What Limits Virtual Agent Application? OmniBench: A Scalable Multi-Dimensi…☆22Jun 12, 2025Updated last year
- This is the official repository of Daily-Omni: Towards Audio-Visual Reasoning with Temporal Alignment across Modalities☆42Apr 28, 2026Updated 2 months ago
- Speechflow for emotion recognition related information decomposition☆10Jul 27, 2021Updated 4 years ago
- Official implementation of ICML 2025 paper "Understanding Multimodal LLMs Under Distribution Shifts: An Information-Theoretic Approach"☆12May 27, 2025Updated last year
- Fixed version of https://github.com/tomguluson92/PRNet_PyTorch☆10Mar 30, 2020Updated 6 years ago
- GSM8K-V: Can Vision Language Models Solve Grade School Math Word Problems in Visual Contexts☆40Sep 30, 2025Updated 9 months ago
- ☆122Jun 30, 2024Updated 2 years ago
- Official code for "KnowU-Bench: Towards Interactive, Proactive, and Personalized Mobile Agent Evaluation"☆72Jun 13, 2026Updated 2 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Gradually Updated Neural Networks for Large-Scale Image Recognition at ICML 2018☆10Jun 25, 2018Updated 8 years ago
- An unsupervised text summarization and information retrieval library under the hood using natural language processing models☆15Dec 11, 2020Updated 5 years ago
- ☆11Oct 11, 2023Updated 2 years ago
- [NeurIPS 2024] Fast Best-of-N Decoding via Speculative Rejection☆56Oct 29, 2024Updated last year
- A basic deep learning library, comparable to a very minimal version of PyTorch.☆19Mar 1, 2023Updated 3 years ago
- Gaze decomposition for appearance-based gaze estimation☆12Mar 15, 2020Updated 6 years ago
- This is the official Python implementation repository for a paper entitled "Resolving Camera Position for a Practical Application of Gaz…☆12Jan 11, 2022Updated 4 years ago
- [AAAI 2026] Test-Time Reinforcement Learning for GUI Grounding via Region Consistency https://arxiv.org/abs/2508.05615☆67Nov 8, 2025Updated 7 months ago
- LLM Tokenizer with BPE algorithm☆49May 7, 2024Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- This paper is accpeted by WSDM 2023☆13Mar 13, 2023Updated 3 years ago
- Reproduced the DFT method without using Verl. https://arxiv.org/abs/2508.05629☆23Oct 14, 2025Updated 8 months ago
- 60k hours of phoneme-aligned audio from audio books☆19Jul 27, 2024Updated last year
- 使用Few-Shot方法来做文本分类任务,基于THUCNews数据☆10Jun 4, 2020Updated 6 years ago
- Tensorflow and kaldi implementation of our paper "VAE-based regularization for deep speaker embedding"☆11Mar 24, 2023Updated 3 years ago
- [ICLR 2026] InftyThink: Breaking the Length Limits of Long-Context Reasoning in Large Language Models☆56May 5, 2026Updated last month
- Basic Tools☆13Dec 18, 2021Updated 4 years ago