Source code for Activated LoRA
☆24Nov 22, 2025Updated 4 months ago
Alternatives and similar repositories for activated-lora
Users that are interested in activated-lora are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆23Mar 7, 2025Updated last year
- Arabic Grapheme-to-Phoneme (G2P) Conversion☆13Mar 15, 2025Updated last year
- Evaluating GPT-OSS on BrowseComp-Plus with Native Browsering Tools☆18Oct 17, 2025Updated 5 months ago
- A user-friendly interface built on top of Thinking Machines Tinker API that lets you fine-tune LLMs, chat with your trained model, and de…☆28Jan 31, 2026Updated last month
- codebase release for EMNLP2023 paper publication☆19Sep 18, 2025Updated 6 months ago
- ☆24Jan 29, 2026Updated last month
- Finetune Sesame's CSM 1B model, for fun and profit☆17Mar 24, 2025Updated last year
- Official Implementation of "DeCoRe: Decoding by Contrasting Retrieval Heads to Mitigate Hallucination"☆28Dec 18, 2024Updated last year
- ☆19Jul 25, 2025Updated 7 months ago
- Modified Arena-Hard-Auto LLM evaluation toolkit with an emphasis on Russian language☆47Mar 20, 2025Updated last year
- ☆23Feb 16, 2022Updated 4 years ago
- 🚀 Collection of tuning recipes with HuggingFace SFTTrainer and PyTorch FSDP.☆56Updated this week
- From Llama to Deepseek, grpo/mtp implemented. With pt/sft/lora/qlora included☆30Apr 21, 2025Updated 11 months ago
- FormulaOne: A dataset of algorithmic problems based on MSO formulas.☆25Mar 1, 2026Updated 3 weeks ago
- [NeurIPS 2025 Oral] Official Code for Exploring Diffusion Transformer Designs via Grafting☆72Jan 9, 2026Updated 2 months ago
- A 7B parameter model for mathematical reasoning☆42Feb 17, 2025Updated last year
- Inverse Scaling in Test-Time Compute☆25Dec 3, 2025Updated 3 months ago
- ☆12May 18, 2022Updated 3 years ago
- Evaluation metrics to compare AMR graphs based on Smatch☆29Feb 10, 2020Updated 6 years ago
- The official baseline implementations for Chronocept☆10Dec 21, 2025Updated 3 months ago
- Conversion scripts for coreference☆29Sep 30, 2024Updated last year
- Faster and Lighter LoRA Implementations☆13Nov 21, 2024Updated last year
- training models at home☆34Updated this week
- ☆18Oct 6, 2022Updated 3 years ago
- Tutorial to implement Liquid Time-Constant Neural Network from scratch (eng\rus)☆44May 22, 2024Updated last year
- cursor logs with gpt-4o using litellm proxy☆14Sep 9, 2025Updated 6 months ago
- ☆33Nov 5, 2025Updated 4 months ago
- Set of scripts to finetune LLMs☆38Mar 30, 2024Updated last year
- Telegram bot protecting student chats☆22Jan 23, 2026Updated 2 months ago
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating fundament…☆63Oct 7, 2024Updated last year
- An in-memory compressed cache for gigabytes of data written in Go.☆19Feb 6, 2023Updated 3 years ago
- Dialogue Act classification☆18Jan 15, 2024Updated 2 years ago
- ☆16Nov 24, 2025Updated 4 months ago
- Think of it as an open-source alternative to expensive solutions like the MouthPad, eye-trackers, or even complex systems like Neuralink.…☆38Updated this week
- [ICML 2025 Spotlight] ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference☆286May 1, 2025Updated 10 months ago
- Defeating the Training-Inference Mismatch via FP16☆183Nov 14, 2025Updated 4 months ago
- EPOCH Input System Version 2☆10Jun 5, 2020Updated 5 years ago
- Бенчмарк сравнивает русские аналоги ChatGPT: Saiga, YandexGPT, Gigachat☆61Sep 26, 2023Updated 2 years ago
- ☆22Oct 4, 2023Updated 2 years ago