y-trainerY-Trainer 是一个LLM模型微调训练框架。 📊 核心优势: 📉 精准对抗过拟合: 专门优化,有效解决SFT中的过拟合难题。 🧩 突破遗忘瓶颈: 无需依赖通用语料,即可卓越地保留模型的泛化能力,守住核心竞争力的同时实现专项提升!🏆
☆44Mar 3, 2026Updated 2 months ago
Alternatives and similar repositories for y-trainer
Users that are interested in y-trainer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Y-Agent Studio 是一个面向 企业级应用 的Agent开发套,Y-Agent是其中的核心模块。 包含了:支持智能体编排、RAG、流程日志、单元测试、流程测试、语料生产等垂直领域非常需要的功能。 智能体编排可以在同一个流程中,同时支持多智能体协作和流程混合编排…☆27Oct 4, 2025Updated 7 months ago
- 英文文献的《中国图书馆分类法》自动标注小程序☆12Oct 29, 2024Updated last year
- The official GitHub repository for AC-EVAL, an ancient Chinese evaluation suite for large language models (LLMs)☆16Nov 12, 2024Updated last year
- 人文历史知识图谱 三元组涵盖历史/文学/地理/军事/政治/艺术/科学技术史等学科领域 人物关系网络☆19Sep 4, 2025Updated 8 months ago
- ☆12Feb 18, 2021Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- LMTuner: Make the LLM Better for Everyone☆38Sep 21, 2023Updated 2 years ago
- A collection of datafiles created from the library of congress open data dump☆20May 19, 2017Updated 8 years ago
- 中文恶意网页检测数据集与检测方法☆21Mar 4, 2025Updated last year
- pytorch版损失函数,改写自科学空间文章,【通过互信息思想来缓解类别不平衡问题】、【将“softmax+交叉熵”推广到多标签分类问题】☆12Aug 22, 2021Updated 4 years ago
- Sequence Generation with Label Augmentation for Relation Extraction, AAAI2023☆18Feb 13, 2023Updated 3 years ago
- 中国知网论文数据集,24000+篇论文信息。自然语言处理、信息管理、文本分类、文本摘要、关键词抽取、研究热点分析、数据挖掘、数据分析☆53Mar 4, 2025Updated last year
- ☆23Jun 2, 2019Updated 6 years ago
- [Findings of EMNLP 2022] Expose Backdoors on the Way: A Feature-Based Efficient Defense against Textual Backdoor Attacks☆13Feb 26, 2023Updated 3 years ago
- Semantic Scaffolds for Pseudocode-to-Code Generation (accepted by ACL 2020)☆14Jun 7, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- RC-NFQ: Regularized Convolutional Neural Fitted Q Iteration. A batch algorithm for deep reinforcement learning. Incorporates dropout regu…☆12Mar 17, 2021Updated 5 years ago
- Information about the CodedotAI reading group sessions.☆12Aug 16, 2021Updated 4 years ago
- Source code for ISSTA'24 paper "AI Coders Are Among Us: Rethinking Programming Language Grammar Towards Efficient Code Generation"☆12Oct 21, 2024Updated last year
- [NeurIPS 2024] Self-Optimization Improves the Efficiency of Code Generation☆14May 10, 2025Updated last year
- ☆11Sep 29, 2021Updated 4 years ago
- CodeRepoQA dataset☆15Feb 19, 2025Updated last year
- 术语词典数据集/分词词典/专业词表语料库/词汇知识库/领域词表下载/主题词表/词库/自然语言处理/数据挖掘/深度学习☆30Mar 4, 2025Updated last year
- Adversarial Attack for Pre-trained Code Models☆10Jul 19, 2022Updated 3 years ago
- Generating Sentences from Disentangled Syntactic and Semantic Spaces☆11Jun 24, 2019Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆11Jul 28, 2021Updated 4 years ago
- RACE is a multi-dimensional benchmark for code generation that focuses on Readability, mAintainability, Correctness, and Efficiency.☆14Oct 12, 2024Updated last year
- 备份:人人影视字幕元数据,字幕数据<https://huggingface.co/datasets/qundao/yyets-subtitles>☆12Feb 6, 2025Updated last year
- Offical implementation of our paper "Exploring the Potential of Diffusion Large Language Models in Code Generation".☆21Oct 29, 2025Updated 6 months ago
- bamboo是一个中文语言处理系统。☆14Aug 9, 2011Updated 14 years ago
- Last place solutioin to fastMRI Image Reconstruction Challenge 2019 (Single coil track).☆10Dec 8, 2022Updated 3 years ago
- Citrus-V: Advancing Medical Foundation Models with Unified Medical Image Grounding for Clinical Reasoning☆19Sep 26, 2025Updated 7 months ago
- Code for the ICML 2020 publication "Information Particle Filter Tree: An Online Algorithm for POMDPs with Belief-Based Rewards on Continu…☆14Jul 3, 2020Updated 5 years ago
- [COLING25] CodeJudge Eval: Can Large Language Models be Good Judges in Code Understanding?☆12Dec 3, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- add a Arg: label_smoothing for torch.nn.CrossEntropyLoss()☆14Jan 13, 2021Updated 5 years ago
- Official repository for MalKG☆24Feb 12, 2021Updated 5 years ago
- Compares two images using Siamese Network (machine learning) trained from a Pytorch Implementation☆10Jul 27, 2021Updated 4 years ago
- An Abstractive Summarization(for Datasets in English format) Implementation with Transformer and Pointer-generator☆12Dec 31, 2020Updated 5 years ago
- This repo is the artifact of FUEL☆16Apr 24, 2026Updated 2 weeks ago
- DocChecker: Bootstrapping Code-Text Pretrained Language Model to Detect Inconsistency Between Code and Comment☆16Jan 23, 2024Updated 2 years ago
- ☆11Jul 5, 2020Updated 5 years ago