Low memory full parameter finetuning of LLMs
☆54Jul 18, 2025Updated 11 months ago
Alternatives and similar repositories for lowmem_finetuning
Users that are interested in lowmem_finetuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Project code for training LLMs to write better unit tests + code☆22May 19, 2025Updated last year
- ☆35Nov 11, 2025Updated 7 months ago
- Implementation of <Model Merging with Functional Dual Anchors>☆47Nov 23, 2025Updated 7 months ago
- RWKV-LM-V7(https://github.com/BlinkDL/RWKV-LM) Under Lightning Framework☆61May 13, 2026Updated last month
- ExpertFingerprinting: Behavioral Pattern Analysis and Specialization Mapping of Experts in GPT-OSS-20B's Mixture-of-Experts Architecture☆27Feb 3, 2026Updated 4 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton☆49Apr 2, 2026Updated 2 months ago
- Aplicação em Python para Optical Character Recognition (OCR), uma técnica para extrair textos em imagens. Adicionalmente, o programa tent…☆12Aug 13, 2021Updated 4 years ago
- Convert MathML to Latex for OneNote to Markdown☆13Mar 17, 2026Updated 3 months ago
- Codebase from our first release.☆58Feb 17, 2026Updated 4 months ago
- Aulas de conceitos básicos de Processamento de Linguagem Natural oferecida no Discord aberto no Turing USP☆10Jul 30, 2021Updated 4 years ago
- Implementation of the Adaptive Resonance Theory (ART) architectures - Fuzzy ART and Fuzzy ARTMAP - for pattern recognition☆11Jan 6, 2019Updated 7 years ago
- Interactive brokers integration for live trading using Rob Carver's pysystem trade backtester.☆10May 15, 2018Updated 8 years ago
- ComfyUI custom nodes for Suno — generate, remix, extend, and shape AI music inside ComfyUI via the muapi.ai API.☆17Apr 30, 2026Updated 2 months ago
- ☆49Mar 31, 2026Updated 3 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Python scripts to read a Portuguese Wikipedia XML dump file, parse it and generate plain text files.☆14Mar 12, 2014Updated 12 years ago
- ☆92Jun 15, 2026Updated 2 weeks ago
- Simple repository for training small reasoning models☆52Feb 17, 2026Updated 4 months ago
- Hands-On Data Analytics for Beginners with Google Colaboratory [Video], published by Packt☆18Jan 15, 2021Updated 5 years ago
- Tiny evaluation of leading LLMs on competitive programming problems☆14Apr 10, 2026Updated 2 months ago
- we have ai at home☆115Jun 18, 2026Updated last week
- Zero-Shot Learning in Named Entity Recognition with Common Sense Knowledge☆17Nov 16, 2021Updated 4 years ago
- Coding with ChatGPT and other LLMs, published by Packt☆16Dec 9, 2024Updated last year
- ☆10Sep 7, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Quickest way to share everything about your research within a single app☆16Feb 1, 2024Updated 2 years ago
- ☆27Sep 10, 2025Updated 9 months ago
- Source code for the AI2 Reasoning Challenge (ARC) submission.☆16Dec 8, 2022Updated 3 years ago
- This course is published by Packt Publishing☆23Aug 2, 2023Updated 2 years ago
- Proposed plumbing commands for cargo☆25Jun 1, 2026Updated last month
- [Re-implementation] FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence☆15Jun 29, 2020Updated 6 years ago
- ☆18Nov 18, 2021Updated 4 years ago
- Inference code for LLaMA models☆21Apr 3, 2025Updated last year
- Brazilian Tertiary Care Dataset☆17Dec 14, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Leveraging☆13Dec 7, 2023Updated 2 years ago
- Verifiers for LLM Reinforcement Learning☆80Apr 15, 2025Updated last year
- Get insights from your research papers with LlamaExtract☆29Aug 8, 2025Updated 10 months ago
- Scripts for training Qwen 2.5 VL with ms-swift and GRPO☆12Feb 27, 2025Updated last year
- A Test Collection of Computer Science Papers for Faceted Query by Example☆23Nov 28, 2021Updated 4 years ago
- Nexusflow function call, tool use, and agent benchmarks.☆29Dec 13, 2024Updated last year
- Charlson Comorbidity Index Regression using Clinical Notes☆10Jul 26, 2018Updated 7 years ago