☆30Sep 11, 2024Updated last year
Alternatives and similar repositories for LLM-finetuning
Users that are interested in LLM-finetuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The infere…☆22May 9, 2025Updated 11 months ago
- Master the essential steps of pretraining large language models (LLMs). Learn to create high-quality datasets, configure model architectu…☆26Aug 7, 2024Updated last year
- Fine-Tuning Llama3-8B LLM in a multi-GPU environment using DeepSpeed☆21May 27, 2024Updated last year
- Official implementation of the paper "Neural Honeytrace: A Robust Plug-and-Play Watermarking Framework against Model Extraction Attacks"☆20Jun 9, 2025Updated 10 months ago
- ☆11Apr 6, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆156Updated this week
- Python parser for generating descriptive graphs from Natural Bond Orbital data ready for use in Graph Neural Networks.☆13Feb 23, 2026Updated last month
- Falcon is a powerful, interpreted programming language.☆17Jan 22, 2023Updated 3 years ago
- An automated data pipeline scaling RL to pretraining levels☆75Oct 11, 2025Updated 6 months ago
- Unofficial Implementation of Evolutionary Model Merging☆41Mar 28, 2024Updated 2 years ago
- ☆20Feb 3, 2026Updated 2 months ago
- ☆12Feb 22, 2023Updated 3 years ago
- Python library for building and running distributed data pipelines using Ray☆61Mar 12, 2026Updated last month
- An end to end ML project. Using MLflow for experiment tracking and model registry. Prefect for workflow orchestration. S3 for artifacts s…☆12Sep 11, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Chrome browser extension that adds customizable keyboard shortcuts to the Overleaf online latex editor.☆13Jun 22, 2024Updated last year
- [🎖️1등(장관상) 솔루션] 2022 국립국어원 인공 지능 언어 능력 평가 (쇼핑몰 리뷰 데이터 속성 기반 감성 분석 : Aspect-Based Sentiment Analysis)☆11Jun 6, 2023Updated 2 years ago
- A PyTorch native library for large model training☆25Apr 1, 2026Updated 2 weeks ago
- Toonification of real face images using PyTorch, Stylegan2 and Image-to-Image translation☆13Jun 14, 2022Updated 3 years ago
- ☆27Feb 10, 2024Updated 2 years ago
- Open source agentic AI CAD generation built on OpenSCAD☆18Jun 5, 2024Updated last year
- ☆29Feb 3, 2026Updated 2 months ago
- Expanded KR-BERT by adding more training data☆13Apr 23, 2021Updated 4 years ago
- [ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models☆23Mar 15, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Workshop for making a Telegram bot and using APIs.☆11May 20, 2025Updated 10 months ago
- Pretrained Language Model(from huggingface)을 사용하여 간단 하게 비슷한 의미를 가지는 문장을 찾을 수 있는 metric을 제공☆13Jul 6, 2023Updated 2 years ago
- ☆10Apr 30, 2025Updated 11 months ago
- Fast and memory efficient PyTorch implementation of the Perceiver with FlashAttention.☆33Nov 4, 2024Updated last year
- Image Segmentation On Custom Dataset Using YOLOv8☆19Jan 12, 2023Updated 3 years ago
- Official Repo for python-vcon and py-vcon-server Python packages☆16Apr 9, 2026Updated last week
- ☆44Oct 13, 2023Updated 2 years ago
- The project aims to detect ships in million pixels satellite images using different object detection algorithms. This makes use of variou…☆15Jun 28, 2020Updated 5 years ago
- PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined Speculation☆32Nov 16, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Medical records you can copy and paste☆12Mar 3, 2023Updated 3 years ago
- ☆15Apr 1, 2024Updated 2 years ago
- Extract structure-functions from data using XAI and LLMs☆27Jan 20, 2025Updated last year
- [WIP] Code for LangToMo☆20Mar 19, 2026Updated 3 weeks ago
- ☆124Apr 7, 2026Updated last week
- ☆14Feb 28, 2025Updated last year
- A Django app to capture OAuth2 tokens for non-authentication purposes, enabling your application to act on behalf of users across externa…☆13Feb 23, 2026Updated last month