Supervised instruction finetuning for LLM with HF trainer and Deepspeed
☆37Jul 6, 2023Updated 2 years ago
Alternatives and similar repositories for Funtuner
Users that are interested in Funtuner are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Reward Model framework for LLM RLHF☆63Jun 7, 2023Updated 3 years ago
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆70Aug 27, 2023Updated 2 years ago
- covid question answering datasets and fine tuned models☆18Apr 27, 2021Updated 5 years ago
- Using multiple LLMs for ensemble Forecasting☆16Jan 17, 2024Updated 2 years ago
- A framework for few-shot evaluation of autoregressive language models.☆16Aug 23, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- An unofficial implementation of SOLAR-10.7B model and the newly proposed interlocked-DUS(iDUS) implementation and experiment details.☆14Mar 20, 2024Updated 2 years ago
- LLM Building Blocks for Python Course☆17Nov 17, 2025Updated 6 months ago
- A Multi-Model AI Assistant - Chatbot☆11Jul 14, 2023Updated 2 years ago
- Research notes and extra resources for all the work at explodinggradients.com☆27Mar 11, 2025Updated last year
- ☆74Sep 5, 2023Updated 2 years ago
- Layout Analysis Dataset with Segmonto (LADaS)☆25May 29, 2026Updated 2 weeks ago
- Adversarial Training and SFT for Bot Safety Models☆41Apr 18, 2023Updated 3 years ago
- ☆32Jan 1, 2024Updated 2 years ago
- Multipack distributed sampler for fast padding-free training of LLMs☆207Aug 10, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Modular task agnostic training pipeline using LFM2 from Liquid AI with unsloth.☆16Sep 13, 2025Updated 9 months ago
- Scripts, notebooks, and articles about data science in general.☆55Jun 17, 2023Updated 2 years ago
- A OpenAI GPT3 based QnA agent for documents and links☆12Jul 11, 2023Updated 2 years ago
- Community Eventing and Scripting examples☆19Aug 11, 2025Updated 10 months ago
- Sakura-SOLAR-DPO: Merge, SFT, and DPO☆116Dec 30, 2023Updated 2 years ago
- ☆19Feb 20, 2023Updated 3 years ago
- ☆30Mar 10, 2024Updated 2 years ago
- Pandas Training © MetaSnake 2022, CC BY-NC☆18Mar 20, 2022Updated 4 years ago
- Set of scripts to finetune LLMs☆38Mar 30, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Genetics for Language Models☆17Jul 1, 2024Updated last year
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Aug 12, 2023Updated 2 years ago
- LobotoMl is a set of scripts and tools to assess production deployments of ML services☆10May 16, 2022Updated 4 years ago
- API and CLI tool to fetch and query Chome DevTools heap snapshots (Python & Playwright)☆16May 16, 2024Updated 2 years ago
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆37Oct 9, 2025Updated 8 months ago
- A Kubernetes operator for managing Prefect servers and work pools☆17Jun 8, 2026Updated last week
- Fast model deployment on AWS Lambda☆15Feb 25, 2024Updated 2 years ago
- Temporal and Causal Reasoning (dataset)☆10Apr 19, 2022Updated 4 years ago
- An HTTP proxy that naively injects NTLM data for the current user into outgoing requests☆14Nov 14, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Lightweight Gleam library for working with streams☆13Jan 3, 2026Updated 5 months ago
- Python client for the Open eXecution Protocol (OXP)☆17May 16, 2025Updated last year
- Just little bits.☆10Aug 5, 2025Updated 10 months ago
- Pure Pony Postgres client☆18May 30, 2026Updated 2 weeks ago
- Example Fabulous app that uses MSAL to authenticate a user on Azure Active Directory☆11Dec 8, 2022Updated 3 years ago
- Documentation about how to write and maintain a Django reusable app.☆49Dec 29, 2008Updated 17 years ago
- Using various instructor clients evaluating the quality and capabilities of extractions and reasoning.☆51Sep 29, 2024Updated last year