The trainer for HF to record losses of different tasks and objectives.
☆54Mar 12, 2025Updated last year
Alternatives and similar repositories for hf-multitask-trainer
Users that are interested in hf-multitask-trainer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 基于Llama3,通过进一步CPT,SFT,ORPO得到的中文版Llama3☆16Apr 24, 2024Updated 2 years ago
- lightweight and scalable whole-body teleoperation framework for humanoid robots☆93Apr 21, 2026Updated 2 weeks ago
- WisdoMentor - Series: A LLM for undergraduates | 博导智言(辅助大学生 学习)☆13May 9, 2024Updated 2 years ago
- Safe OS process execution for Elixir. Zero zombie processes, NIF-based backpressure, PTY support, and cgroup isolation.☆50Apr 17, 2026Updated 3 weeks ago
- An interactive terminal application for streaming and downloading anime from various streaming sources.☆54Updated this week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- DAG-based workflow engine for Android and low-resource environments, built for mobile-first automation..☆33Mar 17, 2026Updated last month
- Latest open-source "Thinking with images" (O3/O4-mini) papers, covering training-free, SFT-based, and RL-enhanced methods for "fine-grain…☆113Aug 21, 2025Updated 8 months ago
- ☆26Apr 22, 2026Updated 2 weeks ago
- A KV storage engine based on LSM Tree, supporting Redis RESP☆34Sep 14, 2025Updated 7 months ago
- grbl porting for stc mcu☆14Apr 1, 2026Updated last month
- HEtero-Assists Distillation for Heterogeneous Object Detectors☆10Jul 3, 2023Updated 2 years ago
- [ACL 2025] We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLM…☆68Oct 27, 2024Updated last year
- [ACM MM 2025] Multi-Object Sketch Animation with Grouping and Motion Trajectory Priors☆43Aug 14, 2025Updated 8 months ago
- Introduction about AWESOME_ENTROPY+LRM_PAPERS☆30Dec 16, 2025Updated 4 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- More reliable Video Understanding Evaluation☆15Sep 23, 2025Updated 7 months ago
- Use contrastive learning to train a large language model (LLM) as a retriever☆12Jul 19, 2024Updated last year
- Code for the paper "Closing the Curious Case of Neural Text Degeneration"☆12Apr 9, 2025Updated last year
- A rust implementation of Andrej Karpathy's Micrograd☆15Apr 28, 2025Updated last year
- A lightweight, high-performance deep learning inference framework built in Rust. Zen-Infer provides a clean, modular architecture for dep…☆20Jul 31, 2025Updated 9 months ago
- Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering☆63Dec 5, 2024Updated last year
- Unsupervised muti-metric fusion for Full-Reference (FR) Image Quality Assessment (IQA)☆11Jul 11, 2014Updated 11 years ago
- iLLaVA: An Image is Worth Fewer Than 1/3 Input Tokens in Large Multimodal Models (ICLR2026)☆22Mar 29, 2026Updated last month
- A modern Craft CMS starter kit for agencies and developers — featuring Vite, Tailwind, Datastar, DDEV, MCP, LLM Ready.☆29Apr 30, 2026Updated last week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Survey of Learning To Rank☆15Nov 13, 2025Updated 5 months ago
- [ACL'24 Findings] Official code for "TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback"☆12Dec 6, 2024Updated last year
- ☆31Jan 11, 2026Updated 3 months ago
- [NeurIPS D&B'24]Enhancing vision-language models for medical imaging: bridging the 3D gap with innovative slice selection☆24Mar 25, 2026Updated last month
- 前后端分离的代码发布系统, 前端vue, 后端go☆12Apr 22, 2026Updated 2 weeks ago
- Minimalist LLM Grammar Checker for macOS☆21Feb 22, 2026Updated 2 months ago
- Official code for Generative Fractional Diffusion Models☆17Jan 16, 2025Updated last year
- PyTorch implementation for our paper "Efficient Meta Reinforcement Learning for Preference-based Fast Adaptation"☆13Apr 19, 2023Updated 3 years ago
- 从零搭建大语言模型/神经网络框架,以达到深入理解大模型底层运行机制的目的☆19Sep 16, 2025Updated 7 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- vue-sequence-diagram☆12Dec 10, 2022Updated 3 years ago
- [NeurIPS 2025] Official repository for “FlowCut: Rethinking Redundancy via Information Flow for Efficient Vision-Language Models”☆32Dec 9, 2025Updated 5 months ago
- ☆10Mar 28, 2022Updated 4 years ago
- rsbuild svg loader☆13Nov 11, 2024Updated last year
- 从socket开始实现pop3和smtp客户端,实现邮件编写、发送、接收、阅读、删除等基本功能。并实现简单界面(PyQt5)Start from socket to implement pop3 and smtp clients, to realize the basic …☆12Dec 24, 2023Updated 2 years ago
- This is the official repo for Contrastive Vision-Language Alignment Makes Efficient Instruction Learner.☆20Dec 1, 2023Updated 2 years ago
- [TOG 2025] Order Matters: Learning Element Ordering for Graphic Design Generation☆24Aug 5, 2025Updated 9 months ago