zipzou / hf-multitask-trainer
The trainer for HF to record losses of different tasks and objectives.
β37Updated 2 weeks ago
Alternatives and similar repositories for hf-multitask-trainer:
Users that are interested in hf-multitask-trainer are comparing it to the libraries listed below
- Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"β170Updated 3 weeks ago
- TokenSkip: Controllable Chain-of-Thought Compression in LLMsβ98Updated 2 weeks ago
- πLLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Trainingβ75Updated 3 months ago
- Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learningβ161Updated last year
- [EMNLP 2024] mDPO: Conditional Preference Optimization for Multimodal Large Language Models.β69Updated 4 months ago
- β131Updated 8 months ago
- M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoningβ56Updated 3 months ago
- [EMNLP 2024] Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".β73Updated 2 months ago
- β83Updated 2 weeks ago
- The official code repository for PRMBench.β68Updated last month
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)β107Updated 11 months ago
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.β65Updated this week
- β27Updated 3 months ago
- [ICML 2024] Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibratiβ¦β33Updated 8 months ago
- A curated list of awesome resources dedicated to Scaling Laws for LLMsβ71Updated last year
- This repo contains evaluation code for the paper "MileBench: Benchmarking MLLMs in Long Context"β31Updated 8 months ago
- A Survey on the Honesty of Large Language Modelsβ56Updated 3 months ago
- Repo for the EMNLP'24 Paper "Dual-Space Knowledge Distillation for Large Language Models". A general white-box KD framework for both sameβ¦β44Updated 4 months ago
- The official repository for the paper "Can MLLMs Reason in Multimodality? EMMA: An Enhanced MultiModal ReAsoning Benchmark"β47Updated this week
- β166Updated last month
- β64Updated 9 months ago
- [ICML 2024] Selecting High-Quality Data for Training Language Modelsβ158Updated 9 months ago
- [ACL 2024] Long-Context Language Modeling with Parallel Encodingsβ153Updated 9 months ago
- β48Updated last month
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.β104Updated last week
- Less is More: Mitigating Multimodal Hallucination from an EOS Decision Perspective (ACL 2024)β44Updated 5 months ago
- A Survey on Efficient Reasoning for LLMsβ116Updated this week
- MoCLE (First MLLM with MoE for instruction customization and generalization!) (https://arxiv.org/abs/2312.12379)β34Updated 11 months ago
- A method of ensemble learning for heterogeneous large language models.β42Updated 7 months ago
- [ICLR 2025] Released code for paper "Spurious Forgetting in Continual Learning of Language Models"β28Updated last month