AIRI-Institute / LLM-Microscope
☆32Updated 3 weeks ago
Alternatives and similar repositories for LLM-Microscope:
Users that are interested in LLM-Microscope are comparing it to the libraries listed below
- ☆60Updated last month
- Aioli: A unified optimization framework for language model data mixing☆22Updated 2 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆55Updated 6 months ago
- ☆36Updated 6 months ago
- Code, results and other artifacts from the paper introducing the WildChat-50m dataset and the Re-Wild model family.☆28Updated last month
- ☆30Updated 2 months ago
- ☆42Updated last month
- ☆16Updated 2 months ago
- Official implementation of ECCV24 paper: POA☆24Updated 7 months ago
- [ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization☆31Updated last month
- ☆13Updated 3 months ago
- A repository for research on medium sized language models.☆76Updated 10 months ago
- Official repository of "LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging"☆24Updated 4 months ago
- [NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs☆84Updated 4 months ago
- Systematic evaluation framework that automatically rates overthinking behavior in large language models.☆80Updated last month
- ☆24Updated 6 months ago
- Code for paper called Self-Training Elicits Concise Reasoning in Large Language Models☆18Updated 2 weeks ago
- This repo is based on https://github.com/jiaweizzhao/GaLore☆26Updated 6 months ago
- ☆76Updated 2 months ago
- Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044☆32Updated 5 months ago
- Exploration of automated dataset selection approaches at large scales.☆33Updated 3 weeks ago
- [EMNLP 2024 Findings] ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs☆24Updated 5 months ago
- Official repository for the paper "NeuZip: Memory-Efficient Training and Inference with Dynamic Compression of Neural Networks". This rep…☆54Updated 4 months ago
- This is the official repository for Inheritune.☆109Updated last month
- The first dense retrieval model that can be prompted like an LM☆67Updated 6 months ago
- Official implementation of the paper "MMInA: Benchmarking Multihop Multimodal Internet Agents"☆41Updated last month
- Lottery Ticket Adaptation☆38Updated 4 months ago
- Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆75Updated 2 weeks ago
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆83Updated last week
- Official implementation of "Gemini in Reasoning: Unveiling Commonsense in Multimodal Large Language Models"☆36Updated last year