hongzhouyu / FineMedLinks

The codebase and some introductions of FineMed.

☆23

Alternatives and similar repositories for FineMed

Users that are interested in FineMed are comparing it to the libraries listed below

Sorting:

waltonfuture / Diff-eRank
[NeurIPS 2024] A Novel Rank-Based Metric for Evaluating Large Language Models
☆50Updated last month
liangyuwang / zo2
ZO2 (Zeroth-Order Offloading): Full Parameter Fine-Tuning 175B LLMs with 18GB GPU Memory
☆151Updated this week
microsoft / x-reasoner
X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains
☆46Updated 2 months ago
Dereck0602 / Awesome_Test_Time_LLMs
☆114Updated 4 months ago
THUDM / Awesome-Parameter-Efficient-Fine-Tuning-for-Foundation-Models
Parameter-Efficient Fine-Tuning for Foundation Models
☆75Updated 3 months ago
testtimescaling / testtimescaling.github.io
"what, how, where, and how well? a survey on test-time scaling in large language models" repository
☆52Updated last week
MingyuJ666 / Disentangling-Memory-and-Reasoning
[ACL'25] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.
☆67Updated 2 months ago
UCSC-VLAA / m1
m1: Unleash the Potential of Test-Time Scaling for Medical Reasoning in Large Language Models
☆37Updated 3 months ago
Quinn777 / AtomThink-preview
☆54Updated 4 months ago
RUCAIBox / Virgo
Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*
☆105Updated last month
GuanghaoYe / Emergence-of-Thinking
☆52Updated 5 months ago
TUDB-Labs / MoE-PEFT
An Efficient LLM Fine-Tuning Factory Optimized for MoE PEFT
☆106Updated 4 months ago
LINs-lab / DynMoE
[ICLR 2025] Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Models
☆118Updated last week
TUDB-Labs / MixLoRA
State-of-the-art Parameter-Efficient MoE Fine-tuning Method
☆169Updated 10 months ago
horseee / CoT-Valve
CoT-Valve: Length-Compressible Chain-of-Thought Tuning
☆77Updated 5 months ago
MingLiiii / Layer_Gradient
[ACL'25 Oral] What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective
☆70Updated 3 weeks ago
RUCKBReasoning / CoT-based-Synthesizer
Official code implementation for the ACL 2025 paper: 'CoT-based Synthesizer: Enhancing LLM Performance through Answer Synthesis'
☆27Updated 2 months ago
Chen-GX / C-3PO
[ICML2025] The official implementation of "C-3PO: Compact Plug-and-Play Proxy Optimization to Achieve Human-like Retrieval-Augmented Gene…
☆36Updated 2 months ago
WillDreamer / Awesome-MLLM-Reasoning
Recent Advances on MLLM's Reasoning Ability
☆24Updated 3 months ago
dongxiangjue / Awesome-LLM-Self-Improvement
A curated list of awesome LLM Inference-Time Self-Improvement (ITSI, pronounced "itsy") papers from our recent survey: A Survey on Large …
☆85Updated 6 months ago
yafuly / TPO
Test-time preferenece optimization (ICML 2025).
☆147Updated 2 months ago
shiqichen17 / VLM_Merging
Github repository for "Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging" (ICML 2025)
☆64Updated last month
Raibows / CREAM
Code for "CREAM: Consistency Regularized Self-Rewarding Language Models", ICLR 2025.
☆22Updated 5 months ago
Ahren09 / AgentReview
Official Implementation for EMNLP 2024 (main) "AgentReview: Exploring Academic Peer Review with LLM Agent."
☆77Updated 8 months ago
UCSC-VLAA / VLAA-Thinking
SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models
☆124Updated 2 months ago
maple-research-lab / SLOT
☆96Updated last month
waltonfuture / MM-UPT
Unsupervised GRPO
☆39Updated last month
deepglint / UniME
[ACM MM25] The official code of "Breaking the Modality Barrier: Universal Embedding Learning with Multimodal LLMs"
☆81Updated 2 weeks ago
TEAM-ARM / arm
ARM: Adaptive Reasoning Model
☆44Updated last month
THU-KEG / AdaptThink
☆136Updated last month