The trainer for HF to record losses of different tasks and objectives.
☆54Mar 12, 2025Updated last year
Alternatives and similar repositories for hf-multitask-trainer
Users that are interested in hf-multitask-trainer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 基于Llama3,通过进一步CPT,SFT,ORPO得到的中文版Llama3☆16Apr 24, 2024Updated 2 years ago
- WisdoMentor - Series: A LLM for undergraduates | 博导智言(辅助大学生 学习)☆13May 9, 2024Updated 2 years ago
- Safe OS process execution for Elixir. Zero zombie processes, NIF-based backpressure, PTY support, and cgroup isolation.☆51Apr 17, 2026Updated last month
- An interactive terminal application for streaming and downloading anime from various streaming sources.☆54May 14, 2026Updated 2 weeks ago
- ☆13Mar 28, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆22Feb 8, 2026Updated 3 months ago
- ☆35Apr 28, 2025Updated last year
- ☆13Apr 5, 2023Updated 3 years ago
- [CVPR' 26] MajutsuCity: Language-driven Aesthetic-adaptive City Generation with Controllable 3D Assets and Layouts☆44Apr 27, 2026Updated last month
- [ICCV'25] Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness☆70Jul 22, 2025Updated 10 months ago
- 放弃幻想、时刻准备、随时面试☆14Dec 17, 2025Updated 5 months ago
- [ACL 2025] We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLM…☆68Oct 27, 2024Updated last year
- Introduction about AWESOME_ENTROPY+LRM_PAPERS☆30Dec 16, 2025Updated 5 months ago
- Survey on Data-centric Large Language Models☆92Jul 8, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ICML 2024] "Envisioning Outlier Exposure by Large Language Models for Out-of-Distribution Detection"☆15Feb 15, 2025Updated last year
- Point Cloud Annotation Tool. Built with PPTK and PyQt.☆15May 18, 2023Updated 3 years ago
- A rust implementation of Andrej Karpathy's Micrograd☆15Apr 28, 2025Updated last year
- A lightweight, high-performance deep learning inference framework built in Rust. Zen-Infer provides a clean, modular architecture for dep…☆20Jul 31, 2025Updated 9 months ago
- Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering☆63Dec 5, 2024Updated last year
- GPS-Denied Indoor Navigation System for Drones Autonomous drone navigation indoors without GPS using optical flow, IMU, and lidar sensor …☆28Nov 20, 2025Updated 6 months ago
- Uncertainty-Aware Curriculum Learning for Neural Machine Translation (ACL 2020)☆11Jun 12, 2020Updated 5 years ago
- Unsupervised muti-metric fusion for Full-Reference (FR) Image Quality Assessment (IQA)☆11Jul 11, 2014Updated 11 years ago
- On the Complementarity between Pre-Training and Back-Translation for Neural Machine Translation (Findings of EMNLP 2021))☆13Nov 21, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- iLLaVA: An Image is Worth Fewer Than 1/3 Input Tokens in Large Multimodal Models (ICLR2026)☆22Mar 29, 2026Updated 2 months ago
- ☆26Apr 26, 2025Updated last year
- PyTorch implementation of QKAN "Quantum-inspired Kolmogorov-Arnold Network" https://arxiv.org/abs/2509.14026☆24May 1, 2026Updated 3 weeks ago
- [ACL'24 Findings] Official code for "TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback"☆12Dec 6, 2024Updated last year
- A toolkit to automatically crawl the paper list and download paper pdfs of ACL Ahthology.☆11Nov 12, 2025Updated 6 months ago
- PyTorch implementation for our paper "Efficient Meta Reinforcement Learning for Preference-based Fast Adaptation"☆13Apr 19, 2023Updated 3 years ago
- vue-sequence-diagram☆12Dec 10, 2022Updated 3 years ago
- 从零搭建大语言模型/神经网络框架,以达到深入理 解大模型底层运行机制的目的☆19Sep 16, 2025Updated 8 months ago
- MLLM @ Game☆16May 12, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ACL 2025 Main] Official Repository for "Evaluating Language Models as Synthetic Data Generators"☆40Dec 13, 2024Updated last year
- Rime 配置可视化工具 ✨☆143Jan 15, 2026Updated 4 months ago
- ☆18Apr 17, 2023Updated 3 years ago
- ☆10Mar 28, 2022Updated 4 years ago
- 从socket 开始实现pop3和smtp客户端,实现邮件编写、发送、接收、阅读、删除等基本功能。并实现简单界面(PyQt5)Start from socket to implement pop3 and smtp clients, to realize the basic …☆12Dec 24, 2023Updated 2 years ago
- Golang standards Of Fundamental Astronomy☆19Dec 12, 2023Updated 2 years ago
- Simple python script allowing to convert qmake .pro project files to CMakeLists.txt. Supports Qt5.☆21Oct 13, 2022Updated 3 years ago