A Mechanistic‑Interpretability study that finds the structural dynamics of Large Language Models under fine‑tuning.
☆16May 30, 2025Updated 10 months ago
Alternatives and similar repositories for FinetuneCircuits
Users that are interested in FinetuneCircuits are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Mixture of Lora Experts☆10Apr 7, 2024Updated 2 years ago
- Official repository for Activation-Informed Merging (AIM) of Large Language Models☆22Feb 10, 2025Updated last year
- Official implementation of "Modeling Multi-Task Model Merging as Adaptive Projective Gradient Descent".☆22May 23, 2025Updated 10 months ago
- Zynq 7020 移植Threadx 官方例程 (来自expresslogic.sharefile.com)☆11Mar 11, 2022Updated 4 years ago
- ☆22Feb 13, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- robotic mower software☆14May 19, 2015Updated 10 years ago
- ☆27Oct 12, 2024Updated last year
- Bayesian scaling laws for in-context learning.☆15Mar 12, 2025Updated last year
- Collections of RLxLM experiments using minimal codes☆14Feb 17, 2025Updated last year
- HKU DASC 7606 Assignment 1 (Computer Vision), 2023-24 Spring☆18Feb 25, 2024Updated 2 years ago
- [ECCV'24 Oral] PiTe: Pixel-Temporal Alignment for Large Video-Language Model☆17Feb 13, 2025Updated last year
- ☆19Jul 10, 2023Updated 2 years ago
- The official repository of "Whoever Started the Interference Should End It: Guiding Data-Free Model Merging via Task Vectors""☆49Oct 1, 2025Updated 6 months ago
- [ICML 2025] EffiCoder: Enhancing Code Generation in Large Language Models through Efficiency-Aware Fine-tuning☆16May 24, 2025Updated 10 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Python A novel multimodal approach for hybridbrain-computer interface☆20May 25, 2020Updated 5 years ago
- [FPGA'21] CoDeNet is an efficient object detection model on PyTorch, with SOTA performance on VOC and COCO based on CenterNet and Co-Desi…☆28Feb 7, 2023Updated 3 years ago
- DELLA-Merging: Reducing Interference in Model Merging through Magnitude-Based Sampling☆36Jul 12, 2024Updated last year
- ☆17Feb 4, 2025Updated last year
- LONGAGENT: Scaling Language Models to 128k Context through Multi-Agent Collaboration☆11Mar 11, 2024Updated 2 years ago
- Multi-dimensional analysis of orthogonal safety directions in LLM alignment☆22Mar 20, 2025Updated last year
- SemEval2026 Task 3 DimABSA☆31Mar 13, 2026Updated last month
- "FiD-ICL: A Fusion-in-Decoder Approach for Efficient In-Context Learning" (ACL 2023)☆15Jul 24, 2023Updated 2 years ago
- ☆12Oct 20, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆13Apr 22, 2024Updated last year
- Official code release for BOLD5000 Release 2.0☆34Oct 31, 2021Updated 4 years ago
- CORDIC-SNN, followed with "Unsupervised learning of digital recognition using STDP" published in 2015, frontiers☆25Feb 9, 2020Updated 6 years ago
- [ACL 2025 Main] Repository for the paper: 500xCompressor: Generalized Prompt Compression for Large Language Models☆61Mar 9, 2026Updated last month
- [AAAI 2025] Augmenting Math Word Problems via Iterative Question Composing (https://arxiv.org/abs/2401.09003)☆23Oct 2, 2025Updated 6 months ago
- Research on the Construction and Application of Paraphrase Parallel Corpus☆11Oct 26, 2020Updated 5 years ago
- 解压缩<时光印记>软件中的数据☆17Sep 24, 2021Updated 4 years ago
- 面向对象与多线程课程 Database of Documents☆14Nov 27, 2020Updated 5 years ago
- ☆10May 26, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Syphus: Automatic Instruction-Response Generation Pipeline☆14Dec 14, 2023Updated 2 years ago
- Codebase for Hyperdecoders https://arxiv.org/abs/2203.08304☆14Oct 11, 2022Updated 3 years ago
- ☆34May 12, 2024Updated last year
- GeneSis is the first generative approach for lexical substitution (EMNLP 2021).☆13Jul 25, 2023Updated 2 years ago
- 多语言降噪预训练模型MBart的中文生成任务☆11May 27, 2021Updated 4 years ago
- Re-implementation of the StylEx paper, training a GAN to explain a classifier in StyleSpace, paper by Lang et al. (2021).☆38Dec 2, 2023Updated 2 years ago
- A simple network library powered by epoll and proactor pattern.☆12May 28, 2022Updated 3 years ago