The trainer for HF to record losses of different tasks and objectives.
☆54Mar 12, 2025Updated last year
Alternatives and similar repositories for hf-multitask-trainer
Users that are interested in hf-multitask-trainer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 基于Llama3,通过进一步CPT,SFT,ORPO得到的中文版Llama3☆16Apr 24, 2024Updated last year
- Lightweight and scalable whole-body teleoperation framework for humanoid robots☆53Apr 10, 2026Updated last week
- [WACV2025] source code of StrDA: https://arxiv.org/abs/2410.09913☆18Apr 15, 2025Updated last year
- WisdoMentor - Series: A LLM for undergraduates | 博导智言(辅助大学生 学习)☆13May 9, 2024Updated last year
- An interactive terminal application for streaming and downloading anime from various streaming sources.☆54Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆13Mar 28, 2025Updated last year
- [Spotlight ICLR 2023 paper] Continual evaluation for lifelong learning with neural networks, identifying the stability gap.☆35Apr 2, 2023Updated 3 years ago
- ☆13Apr 5, 2023Updated 3 years ago
- Easy modernBERT fine-tuning and multi-task learning☆65Mar 13, 2026Updated last month
- ☆23Mar 15, 2026Updated last month
- A python implementation of wego☆15Oct 15, 2016Updated 9 years ago
- A KV storage engine based on LSM Tree, supporting Redis RESP☆33Sep 14, 2025Updated 7 months ago
- Multi-agent investing agent using Claude Agent SDK☆13Oct 3, 2025Updated 6 months ago
- [ICCV'25] Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness☆68Jul 22, 2025Updated 8 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 放弃幻 想、时刻准备、随时面试☆14Dec 17, 2025Updated 4 months ago
- The official code release for "Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning", ICLR 2025☆18May 28, 2025Updated 10 months ago
- [ACL 2025] We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLM…☆68Oct 27, 2024Updated last year
- 🧩 Features: AI assistant with image analysis, Smart RSS feed, Weather forecasts, Reminders, Currency converter, Chinese command aliases,…☆21May 6, 2025Updated 11 months ago
- Survey on Data-centric Large Language Models☆94Jul 8, 2024Updated last year
- ☆25Mar 4, 2026Updated last month
- [NeurIPS D&B'24]Enhancing vision-language models for medical imaging: bridging the 3D gap with innovative slice selection☆22Mar 25, 2026Updated 3 weeks ago
- Point Cloud Annotation Tool. Built with PPTK and PyQt.☆15May 18, 2023Updated 2 years ago
- A rust implementation of Andrej Karpathy's Micrograd☆15Apr 28, 2025Updated 11 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A lightweight, high-performance deep learning inference framework built in Rust. Zen-Infer provides a clean, modular architecture for dep…☆20Jul 31, 2025Updated 8 months ago
- Uncertainty-Aware Curriculum Learning for Neural Machine Translation (ACL 2020)☆11Jun 12, 2020Updated 5 years ago
- Unsupervised muti-metric fusion for Full-Reference (FR) Image Quality Assessment (IQA)☆11Jul 11, 2014Updated 11 years ago
- PyTorch implementation of QKAN "Quantum-inspired Kolmogorov-Arnold Network" https://arxiv.org/abs/2509.14026☆23Apr 7, 2026Updated last week
- iLLaVA: An Image is Worth Fewer Than 1/3 Input Tokens in Large Multimodal Models (ICLR2026)☆22Mar 29, 2026Updated 3 weeks ago
- A modern Craft CMS starter kit for agencies and developers — featuring Vite, Tailwind, Datastar, DDEV and Claude Code MCP.☆26Updated this week
- TS-LLaVA: Constructing Visual Tokens through Thumbnail-and-Sampling for Training-Free Video Large Language Models☆19Jan 2, 2025Updated last year
- TypeScript 컴시간알리미 파서☆14Mar 31, 2026Updated 2 weeks ago
- Rime 配置可视化工具 ✨☆138Jan 15, 2026Updated 3 months ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆30Jan 11, 2026Updated 3 months ago
- 一个(包含)纯真IP库的单一可执行文件☆17Nov 13, 2025Updated 5 months ago
- 从零搭建大语言模型/神经网络框架,以达到深入理解大模型底层运行机制的目的☆19Sep 16, 2025Updated 7 months ago
- vue-sequence-diagram☆12Dec 10, 2022Updated 3 years ago
- [NeurIPS 2025] Official repository for “FlowCut: Rethinking Redundancy via Information Flow for Efficient Vision-Language Models”☆30Dec 9, 2025Updated 4 months ago
- For our ISSTA'23 paper ACETest: Automated Constraint Extraction for Testing Deep Learning Operators☆17Mar 30, 2024Updated 2 years ago
- [ACL 2025 Main] Official Repository for "Evaluating Language Models as Synthetic Data Generators"☆41Dec 13, 2024Updated last year