Code for a New Loss for Mitigating the Bias of Learning Difficulties in Generative Language Models
☆68Feb 18, 2025Updated last year
Alternatives and similar repositories for LLaMA-MiLe-Loss
Users that are interested in LLaMA-MiLe-Loss are comparing it to the libraries listed below
Sorting:
- This is the official implementation of ScaleBiO: Scalable Bilevel Optimization for LLM Data Reweighting☆24Jul 30, 2024Updated last year
- 🔥🔥🔥Latest Papers, Codes on Uncertainty-based RL☆60Aug 24, 2025Updated 6 months ago
- Repository for "Propagating Knowledge Updates to LMs Through Distillation" (NeurIPS 2023).☆26Aug 25, 2024Updated last year
- List of papers about Large Multimodal model☆31May 31, 2025Updated 9 months ago
- Structured TRIZ prompt engineering for LLMs in an open, portable XML format – MIT licensed.☆16Nov 11, 2025Updated 3 months ago
- [ICLR2025] γ -MOD: Mixture-of-Depth Adaptation for Multimodal Large Language Models☆42Oct 28, 2025Updated 4 months ago
- ☆11Jun 11, 2024Updated last year
- ☆11Oct 25, 2024Updated last year
- ☆10Dec 10, 2023Updated 2 years ago
- 足球比赛预测☆10Mar 9, 2021Updated 5 years ago
- 记录有用的Git repos☆12Jul 28, 2024Updated last year
- ☆11May 24, 2024Updated last year
- AAAI2025☆11Apr 18, 2025Updated 10 months ago
- Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization☆81Dec 25, 2025Updated 2 months ago
- Unofficial implementation of Meta's MovieGen models☆16Nov 25, 2025Updated 3 months ago
- Motion-sensing game control system based on bone point recognition☆10Dec 1, 2023Updated 2 years ago
- Official PyTorch code for "Vector Quantization Prompting for Continual Learning (NeurIPS2024)".☆10Oct 16, 2024Updated last year
- Accelerating GOT-OCRv2 with VLLM☆11Nov 15, 2024Updated last year
- QuTrunk is free, open source, cross platform quantum computing programming framework, including quantum programming API, quantum command …☆17Dec 18, 2023Updated 2 years ago
- [CVPR 2023] "TrojViT: Trojan Insertion in Vision Transformers" by Mengxin Zheng, Qian Lou, Lei Jiang☆14Jan 5, 2024Updated 2 years ago
- Github repository for "Internalizing World Models via Self-Play Finetuning for Agentic RL"☆33Nov 1, 2025Updated 4 months ago
- codes and plots for "Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs"☆10Dec 30, 2024Updated last year
- The code implementation of MuScleLoRA (Accepted in ACL 2024)☆10Dec 1, 2024Updated last year
- ☆34Jan 9, 2026Updated 2 months ago
- Experiments with reasoning models, training techniques, papers☆25Updated this week
- R package for sparse VAR estimation☆12Feb 5, 2026Updated last month
- Code repository for the BMVC 2022 paper: Geometry Driven Progressive Warping for One Shot Face Animation☆12Jan 6, 2023Updated 3 years ago
- Finetuning LLaMA with DeepSpeed☆10Apr 14, 2023Updated 2 years ago
- Envision: Benchmarking Unified Understanding & Generation for Causal World Process Insights☆32Jan 9, 2026Updated 2 months ago
- ☆13Jul 8, 2020Updated 5 years ago
- A Structured Grammar for Chart Annotation☆15May 8, 2025Updated 10 months ago
- ☆10Apr 22, 2021Updated 4 years ago
- Un-official implementation of the Transformer Index for GEnerative Recommenders (TIGER) framework.☆13Jun 6, 2023Updated 2 years ago
- Code for paper "Towards Efficient Pareto Set Approximation via Weight-Ensembling Mixture of Experts"☆11Sep 13, 2024Updated last year
- ☆105May 30, 2023Updated 2 years ago
- LeetCode sources☆13Sep 1, 2013Updated 12 years ago
- Boosting Natural Language Generation from Instructions with Meta-Learning☆11Dec 20, 2022Updated 3 years ago
- 使用指令微调对大模型进行微调。☆11Jun 28, 2023Updated 2 years ago
- Code for EMNLP 2021 paper "Measuring Association Between Labels and Free-Text Rationales"☆12Sep 12, 2023Updated 2 years ago