LMTuner: Make the LLM Better for Everyone
☆38Sep 21, 2023Updated 2 years ago
Alternatives and similar repositories for LMTuner
Users that are interested in LMTuner are comparing it to the libraries listed below
Sorting:
- Source code for "An Empirical Study of Code Smells in Transformer-based Code Generation Techniques".☆11Oct 4, 2022Updated 3 years ago
- Answering Ambiguous Questions via Iterative Prompting☆14May 25, 2024Updated last year
- sigma-MoE layer☆21Jan 5, 2024Updated 2 years ago
- ☆15Oct 26, 2021Updated 4 years ago
- Fork of Flame repo for training of some new stuff in development☆19Feb 20, 2026Updated last week
- Transmute AI Lab Model Efficiency Toolkit☆19Oct 2, 2023Updated 2 years ago
- ☆27Nov 25, 2025Updated 3 months ago
- Complexity Based Prompting for Multi-Step Reasoning☆17Mar 10, 2023Updated 2 years ago
- Suri: Multi-constraint instruction following for long-form text generation (EMNLP’24)☆27Oct 3, 2025Updated 5 months ago
- Public repository for "Think Twice: Perspective-Taking Improves Large Language Models’ Theory-of-Mind Capabilities".☆23Aug 16, 2023Updated 2 years ago
- ☆23Mar 25, 2023Updated 2 years ago
- Official code implementation for our paper -- Direct Inversion: Optimization-Free Text-Driven Real Image Editing with Diffusion Models.☆27Nov 18, 2022Updated 3 years ago
- ☆29May 4, 2024Updated last year
- ☆32Jan 1, 2024Updated 2 years ago
- [ACL 2023 Findings] Emergent Modularity in Pre-trained Transformers☆26Jun 7, 2023Updated 2 years ago
- ☆26Nov 23, 2023Updated 2 years ago
- Masked Structural Growth for 2x Faster Language Model Pre-training☆25Apr 28, 2024Updated last year
- ☆24Nov 16, 2023Updated 2 years ago
- An Experiment on Dynamic NTK Scaling RoPE☆64Nov 26, 2023Updated 2 years ago
- ☆62Dec 8, 2023Updated 2 years ago
- Longitudinal Evaluation of LLMs via Data Compression☆33May 29, 2024Updated last year
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆148Nov 9, 2024Updated last year
- ⚡ Instantly turn your python function into web app.☆40May 25, 2023Updated 2 years ago
- This is the repository for our paper "INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning"☆207Feb 18, 2026Updated last week
- [NeurIPS 2023] ShiftAddViT: Mixture of Multiplication Primitives Towards Efficient Vision Transformer☆30Dec 6, 2023Updated 2 years ago
- Continual Resilient (CoRe) Optimizer for PyTorch☆11Jun 10, 2024Updated last year
- Resources for our ACL 2023 paper: Distilling Script Knowledge from Large Language Models for Constrained Language Planning☆36Aug 19, 2023Updated 2 years ago
- HeadlessPivot☆29Updated this week
- Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models☆341Feb 23, 2025Updated last year
- ☆84Nov 10, 2025Updated 3 months ago
- Codes of Approximated Oracle Filter Pruning☆32Sep 8, 2022Updated 3 years ago
- Layer-Condensed KV cache w/ 10 times larger batch size, fewer params and less computation. Dramatic speed up with better task performance…☆156Apr 7, 2025Updated 10 months ago
- The official implementation of EMNLP 2021 paper "#HowYouTagTweets: Learning User Hashtagging Preferences via Personalized Topic Attention…☆11Feb 21, 2023Updated 3 years ago
- [AAAI 2025] Neural-Symbolic Collaborative Distillation: Advancing Small Language Models for Complex Reasoning Tasks☆11Jun 19, 2025Updated 8 months ago
- An EXA-Scale repository of Multi-Modality AI resources from papers and models, to foundational libraries!☆40Feb 1, 2024Updated 2 years ago
- [CVPR 2024] The official pytorch implementation of "A General and Efficient Training for Transformer via Token Expansion".☆47Apr 22, 2024Updated last year
- ☆13Aug 3, 2024Updated last year
- Big Data and Machine Intelligence, Spring 2021.☆12Jul 2, 2021Updated 4 years ago
- ☆10Apr 7, 2025Updated 10 months ago