y-trainerY-Trainer 是一个LLM模型微调训练框架。 📊 核心优势: 📉 精准对抗过拟合: 专门优化,有效解决SFT中的过拟合难题。 🧩 突破遗忘瓶颈: 无需依赖通用语料,即可卓越地保留模型的泛化能力,守住核心竞争力的同时实现专项提升!🏆
☆43Mar 3, 2026Updated last month
Alternatives and similar repositories for y-trainer
Users that are interested in y-trainer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Y-Agent Studio 是一个面向 企业级应用 的Agent开发套,Y-Agent是其中的核心模块。 包含了:支持智能体编排、RAG、流程日志、单元测试、流程测试、语料生产等垂直领域非常需要的功能。 智能体编排可以在同一个流程中,同时支持多智能体协作和流程混合编排…☆26Oct 4, 2025Updated 6 months ago
- 英文文献的《中国图书馆分类法》自动标注小程序☆12Oct 29, 2024Updated last year
- The official GitHub repository for AC-EVAL, an ancient Chinese evaluation suite for large language models (LLMs)☆16Nov 12, 2024Updated last year
- 人文历史知识图谱 三元组涵盖历史/文学/地理/军事/政治/艺术/科学技术史等学科领域 人物关系网络☆19Sep 4, 2025Updated 7 months ago
- An evaluation bentchmark for classical Chinese☆19Dec 13, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆12Feb 18, 2021Updated 5 years ago
- LMTuner: Make the LLM Better for Everyone☆38Sep 21, 2023Updated 2 years ago
- A collection of datafiles created from the library of congress open data dump☆20May 19, 2017Updated 8 years ago
- pytorch版损失函数,改写自科学空间文章,【通过互信息思想来缓解类别不平衡问题】、【将“softmax+交叉熵”推广到多标签分类问题】☆12Aug 22, 2021Updated 4 years ago
- 中国知网论文数据集,24000+篇论文信息。自然语言处理、信息管理、文本分类、文本摘要、关键词抽取、研究热点分析、数据挖掘、数据分析☆53Mar 4, 2025Updated last year
- ☆23Jun 2, 2019Updated 6 years ago
- [Findings of EMNLP 2022] Expose Backdoors on the Way: A Feature-Based Efficient Defense against Textual Backdoor Attacks☆13Feb 26, 2023Updated 3 years ago
- Semantic Scaffolds for Pseudocode-to-Code Generation (accepted by ACL 2020)☆14Jun 7, 2021Updated 4 years ago
- Information about the CodedotAI reading group sessions.☆12Aug 16, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- RC-NFQ: Regularized Convolutional Neural Fitted Q Iteration. A batch algorithm for deep reinforcement learning. Incorporates dropout regu…☆12Mar 17, 2021Updated 5 years ago
- Python3入门机器学习 经典算法与应用 学习☆11Nov 9, 2018Updated 7 years ago
- Source code for ISSTA'24 paper "AI Coders Are Among Us: Rethinking Programming Language Grammar Towards Efficient Code Generation"☆12Oct 21, 2024Updated last year
- [NeurIPS 2024] Self-Optimization Improves the Efficiency of Code Generation☆14May 10, 2025Updated 11 months ago
- https://www.kaggle.com/c/bengaliai-cv19/leaderboard☆12Oct 3, 2023Updated 2 years ago
- The repo contains source code of sampling-based LTL (linear temporal logic) path planning project.☆11Sep 19, 2023Updated 2 years ago
- ☆11Sep 29, 2021Updated 4 years ago
- 术语词典数据集/分词词典/专业词表语料库/词汇知识库/领域词表下载/主题词表/词库/自然语言处理/数据挖掘/深度学习☆30Mar 4, 2025Updated last year
- Adversarial Attack for Pre-trained Code Models☆10Jul 19, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Generating Sentences from Disentangled Syntactic and Semantic Spaces☆11Jun 24, 2019Updated 6 years ago
- ☆11Jul 28, 2021Updated 4 years ago
- Python ESPIRiT implementation☆11Feb 24, 2017Updated 9 years ago
- 备份:人人影视字幕元数据,字幕数据<https://huggingface.co/datasets/qundao/yyets-subtitles>☆12Feb 6, 2025Updated last year
- Quick-start tutorial for specifying a new processor in ghidra☆14Nov 24, 2021Updated 4 years ago
- bamboo是一个中文语言处理系统。☆14Aug 9, 2011Updated 14 years ago
- Last place solutioin to fastMRI Image Reconstruction Challenge 2019 (Single coil track).☆10Dec 8, 2022Updated 3 years ago
- Visualizing ingredient pairings and properties as described in the Flavor Bible☆14Jul 30, 2020Updated 5 years ago
- Code for the ICML 2020 publication "Information Particle Filter Tree: An Online Algorithm for POMDPs with Belief-Based Rewards on Continu…☆14Jul 3, 2020Updated 5 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- [COLING25] CodeJudge Eval: Can Large Language Models be Good Judges in Code Understanding?☆12Dec 3, 2024Updated last year
- add a Arg: label_smoothing for torch.nn.CrossEntropyLoss()☆14Jan 13, 2021Updated 5 years ago
- Compares two images using Siamese Network (machine learning) trained from a Pytorch Implementation☆10Jul 27, 2021Updated 4 years ago
- Official repository for MalKG☆24Feb 12, 2021Updated 5 years ago
- An Abstractive Summarization(for Datasets in English format) Implementation with Transformer and Pointer-generator☆12Dec 31, 2020Updated 5 years ago
- This repo is the artifact of FUEL☆15Apr 8, 2026Updated last week
- DocChecker: Bootstrapping Code-Text Pretrained Language Model to Detect Inconsistency Between Code and Comment☆16Jan 23, 2024Updated 2 years ago