suu990901/LLaMA-MiLe-Loss

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/suu990901/LLaMA-MiLe-Loss)

suu990901 / LLaMA-MiLe-Loss

Code for a New Loss for Mitigating the Bias of Learning Difficulties in Generative Language Models

☆68

Alternatives and similar repositories for LLaMA-MiLe-Loss

Users that are interested in LLaMA-MiLe-Loss are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

suu990901 / KlearReasoner
View on GitHub
Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization
☆82Dec 25, 2025Updated 6 months ago
THU-MIG / PrefixKV
View on GitHub
PrefixKV: Adaptive Prefix KV Cache is What Vision Instruction-Following Models Need for Efficient Generation [NeurIPS 2025]
☆19Oct 11, 2025Updated 9 months ago
joker-star-l / ai_lab5
View on GitHub
人工智能实验五：多模态情感分类
☆16Jul 14, 2022Updated 4 years ago
lukahhcm / Awesome_Environment_Scaling
View on GitHub
Resources and paper list for 'Scaling Environments for Agents'. This repository accompanies our survey on how environments contribute to …
☆71Jan 28, 2026Updated 5 months ago
keikeiqi / MGTTA
View on GitHub
AAAI2025
☆13Apr 18, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
jiahe7ay / MiniCharacterLLM
View on GitHub
这是一个一键让小参数大模型进行角色扮演的项目，从数据构成和训练都包含在这项目中
☆27Mar 31, 2024Updated 2 years ago
pldlgb / nuggets
View on GitHub
☆89Dec 29, 2023Updated 2 years ago
ongdb-contrib / graph-qabot-demo
View on GitHub
Graph QABot Demo| 图谱问答案例
☆14Apr 11, 2023Updated 3 years ago
zxiang30 / DLFS-Rec
View on GitHub
☆10Dec 10, 2023Updated 2 years ago
chenllliang / ATP-AMR
View on GitHub
Source code for paper "ATP: AMRize Than Parse! Enhancing AMR Parsing with PseudoAMRs" @NAACL-2022
☆15Mar 31, 2023Updated 3 years ago
tanganke / subspace_fusion
View on GitHub
Code for paper "Concrete Subspace Learning based Interference Elimination for Multi-task Model Fusion"
☆14Mar 28, 2024Updated 2 years ago
caskcsg / lightretriever
View on GitHub
Code for LightRetriever: A LLM-based Text Retrieval Architecture with Extremely Faster Query Inference
☆19Oct 19, 2025Updated 9 months ago
ruz048 / AutoLoRA
View on GitHub
☆10Apr 16, 2024Updated 2 years ago
caskcsg / longcontext
View on GitHub
Long Context Research
☆37Jan 26, 2026Updated 5 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
sail-sg / regmix
View on GitHub
[ICLR 2025] 🧬 RegMix: Data Mixture as Regression for Language Model Pre-training (Spotlight)
☆194Feb 17, 2025Updated last year
AmbientTalk / wePoker
View on GitHub
wePoker is a multi-player poker game for Android
☆12Mar 20, 2013Updated 13 years ago
shankarp8 / knowledge_distillation
View on GitHub
Repository for "Propagating Knowledge Updates to LMs Through Distillation" (NeurIPS 2023).
☆27Aug 25, 2024Updated last year
tanganke / pareto_set_learning
View on GitHub
Code for paper "Towards Efficient Pareto Set Approximation via Weight-Ensembling Mixture of Experts"
☆11Sep 13, 2024Updated last year
adriaan-vd-graaf / genome_integration
View on GitHub
MR-link and genome integration. genome_integration is a repository for the analysis of genomic data. Specifically, the repository impleme…
☆11Jul 8, 2022Updated 4 years ago
Gary-code / Awesome-LVLM-paper
View on GitHub
List of papers about Large Multimodal model
☆30May 31, 2025Updated last year
lqtrung1998 / mwp_cot_design
View on GitHub
☆14Oct 11, 2023Updated 2 years ago
liushulinle / UloRL
View on GitHub
An Ultra-Long Output Reinforcement Learning Approach
☆23Jul 31, 2025Updated 11 months ago
google-deepmind / scaling_laws_for_routing
View on GitHub
☆14Jul 21, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
yataoz / face_reenact_GDPW
View on GitHub
Code repository for the BMVC 2022 paper: Geometry Driven Progressive Warping for One Shot Face Animation
☆12Jan 6, 2023Updated 3 years ago
ZrW00 / MuScleLoRA
View on GitHub
The code implementation of MuScleLoRA (Accepted in ACL 2024)
☆10Dec 1, 2024Updated last year
pingponglabs / FaceAnime
View on GitHub
☆10Apr 22, 2021Updated 5 years ago
bigai-nlco / RuleReasoner
View on GitHub
[ICLR 2026] RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling
☆39Feb 25, 2026Updated 4 months ago
shibing624 / text2vec-service
View on GitHub
Service for Bert model to Vector. 高效的文本转向量(Text-To-Vector)服务，支持GPU多卡、多worker、多客户端调用，开箱即用。
☆12May 24, 2022Updated 4 years ago
psunlpgroup / FoVer
View on GitHub
This repository includes code and materials for the paper "Efficient PRM Training Data Synthesis via Formal Verification" (ACL 2026 Findi…
☆18Apr 7, 2026Updated 3 months ago
YujieLu10 / Seeker
View on GitHub
☆11May 24, 2024Updated 2 years ago
EIDOSLAB / unbiased-contrastive-learning
View on GitHub
Code for the paper "Unbiased Supervised Contrastive Learning" | ICLR 2023 https://openreview.net/forum?id=Ph5cJSfD2XN
☆12Sep 22, 2023Updated 2 years ago
GuoTianYu2000 / Active-Dormant-Attention
View on GitHub
codes and plots for "Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs"
☆11Dec 30, 2024Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
benbates30 / tiger_implementation
View on GitHub
Un-official implementation of the Transformer Index for GEnerative Recommenders (TIGER) framework.
☆13Jun 6, 2023Updated 3 years ago
falonss703 / Awesome-Uncertainty-based-Reinforcement-Learning
View on GitHub
🔥🔥🔥Latest Papers, Codes on Uncertainty-based RL
☆58Aug 24, 2025Updated 10 months ago
Tlntin / booking_simulator
View on GitHub
☆11Jan 6, 2024Updated 2 years ago
nju-websoft / KnowLA
View on GitHub
KnowLA: Enhancing Parameter-efficient Finetuning with Knowledgeable Adaptation, NAACL 2024
☆16Jul 29, 2024Updated last year
Vannch16 / DeepLearning_ECG_rec
View on GitHub
DeepLearning Project on ECG recognition
☆17Dec 15, 2020Updated 5 years ago
rdpackages / rddensity
View on GitHub
Manipulation Testing Using Local Polynomial Density Methods
☆12Jul 9, 2026Updated last week
fchest / Speech-Transformer-multi-GPUs
View on GitHub
A PyTorch implementation of Speech Transformer with multi-GPUs, an End-to-End ASR with Transformer network on Mandarin Chinese. This code…
☆10Dec 25, 2019Updated 6 years ago