The trainer for HF to record losses of different tasks and objectives.
☆54Mar 12, 2025Updated last year
Alternatives and similar repositories for hf-multitask-trainer
Users that are interested in hf-multitask-trainer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 基于Llama3,通过进一步CPT,SFT,ORPO得到的中文版Llama3☆16Apr 24, 2024Updated last year
- 百度地图坐标拾取工具☆12Jan 27, 2018Updated 8 years ago
- ☆36Feb 28, 2026Updated last month
- [WACV2025] source code of StrDA: https://arxiv.org/abs/2410.09913☆18Apr 15, 2025Updated 11 months ago
- Safe OS process execution for Elixir. Zero zombie processes, NIF-based backpressure, PTY support, and cgroup isolation.☆46Mar 21, 2026Updated last week
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- WisdoMentor - Series: A LLM for undergraduates | 博导智言(辅助大学生 学习)☆13May 9, 2024Updated last year
- Simplifies Chrome DevTools MCP setup for WSL and Docker by automating Chrome remote debugging and network proxy bridging.☆44Mar 19, 2026Updated last week
- An interactive terminal application for streaming and downloading anime from various streaming sources.☆54Updated this week
- Argy: Command-line parsing library for modern C++ — simple, intuitive, and header-only with no dependencies.☆31Aug 31, 2025Updated 6 months ago
- DAG-based workflow engine for Android and low-resource environments, built for mobile-first automation..☆33Mar 17, 2026Updated last week
- Latest open-source "Thinking with images" (O3/O4-mini) papers, covering training-free, SFT-based, and RL-enhanced methods for "fine-grain…☆112Aug 21, 2025Updated 7 months ago
- A python implementation of wego☆15Oct 15, 2016Updated 9 years ago
- A KV storage engine based on LSM Tree, supporting Redis RESP☆32Sep 14, 2025Updated 6 months ago
- Multi-agent investing agent using Claude Agent SDK☆13Oct 3, 2025Updated 5 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- grbl porting for stc mcu☆14Aug 19, 2024Updated last year
- [ICCV'25] Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness☆68Jul 22, 2025Updated 8 months ago
- 放弃幻想、时刻准备、随时面试☆14Dec 17, 2025Updated 3 months ago
- The official code release for "Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning", ICLR 2025☆18May 28, 2025Updated 10 months ago
- [ACL 2025] We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLM…☆68Oct 27, 2024Updated last year
- Introduction about AWESOME_ENTROPY+LRM_PAPERS☆30Dec 16, 2025Updated 3 months ago
- More reliable Video Understanding Evaluation☆15Sep 23, 2025Updated 6 months ago
- A self-made NeurIPS poster template, infused with the unique design style of ShanghaiTech.☆15Dec 26, 2023Updated 2 years ago
- ☆23Mar 4, 2026Updated 3 weeks ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- [ICML 2024] "Envisioning Outlier Exposure by Large Language Models for Out-of-Distribution Detection"☆15Feb 15, 2025Updated last year
- Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering☆63Dec 5, 2024Updated last year
- [CVPR 2024] GSNeRF: Generalizable Semantic Neural Radiance Fields with Enhanced 3D Scene Understanding☆18Jun 10, 2024Updated last year
- PyTorch implementation of QKAN "Quantum-inspired Kolmogorov-Arnold Network" https://arxiv.org/abs/2509.14026☆21Mar 9, 2026Updated 3 weeks ago
- Unsupervised muti-metric fusion for Full-Reference (FR) Image Quality Assessment (IQA)☆11Jul 11, 2014Updated 11 years ago
- On the Complementarity between Pre-Training and Back-Translation for Neural Machine Translation (Findings of EMNLP 2021))☆13Nov 21, 2021Updated 4 years ago
- iLLaVA: An Image is Worth Fewer Than 1/3 Input Tokens in Large Multimodal Models (ICLR2026)☆21Mar 10, 2026Updated 2 weeks ago
- TypeScript 컴시간알리미 파서☆14Mar 14, 2026Updated 2 weeks ago
- TS-LLaVA: Constructing Visual Tokens through Thumbnail-and-Sampling for Training-Free Video Large Language Models☆19Jan 2, 2025Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- [ACL'24 Findings] Official code for "TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback"☆12Dec 6, 2024Updated last year
- 前后端分离的代码发布系统, 前端vue, 后端go☆13Nov 2, 2023Updated 2 years ago
- ☆30Jan 11, 2026Updated 2 months ago
- 从零搭建大语言模型/神经网络框架,以达到深入理解大模型底层运行机制的目的☆19Sep 16, 2025Updated 6 months ago
- PyTorch implementation for our paper "Efficient Meta Reinforcement Learning for Preference-based Fast Adaptation"☆13Apr 19, 2023Updated 2 years ago
- vue-sequence-diagram☆12Dec 10, 2022Updated 3 years ago
- For our ISSTA'23 paper ACETest: Automated Constraint Extraction for Testing Deep Learning Operators☆17Mar 30, 2024Updated 2 years ago