A set of scripts and notebooks on LLM finetunning and dataset creation
☆119Sep 27, 2024Updated last year
Alternatives and similar repositories for llm_recipes
Users that are interested in llm_recipes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆45Oct 13, 2023Updated 2 years ago
- A miniture AI training framework for PyTorch☆43Feb 1, 2025Updated last year
- Various transformers for FSDP research☆38Nov 11, 2022Updated 3 years ago
- Some simple scripts that I use day-to-day when working with LLMs and Huggingface Hub☆161Sep 26, 2023Updated 2 years ago
- Build modern UIs in Jupyter with Python☆12Dec 28, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A place to store reusable transformer components of my own creation or found on the interwebs☆80May 30, 2026Updated 2 weeks ago
- A chat implementation for FastHTML☆12Sep 14, 2025Updated 9 months ago
- Useful LLM contexts ready to be used in AIMagic☆32Apr 6, 2026Updated 2 months ago
- Writing Blog Posts with Generative Feedback Loops!☆51Mar 19, 2024Updated 2 years ago
- ☆25Jun 2, 2026Updated last week
- Weights & Biases Addons is a repository consisting of additional unitilities and community contributions for supercharging your Weights &…☆23Jan 2, 2024Updated 2 years ago
- a local chatbot API dockerized for CPU deployment☆22Aug 28, 2025Updated 9 months ago
- JAX Scalify: end-to-end scaled arithmetics☆18Oct 30, 2024Updated last year
- A curated list of resources to help with computational research.☆21Jun 11, 2022Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆32Sep 22, 2024Updated last year
- Datasets and code from our paper, where we use machine learning to predict if ChatGPT will refuse a given prompt.☆38Sep 23, 2023Updated 2 years ago
- ☆15Nov 3, 2022Updated 3 years ago
- Score LLM pretraining data with classifiers☆55Nov 2, 2023Updated 2 years ago
- LLM Workshop by Sourab Mangrulkar☆401Jun 16, 2024Updated last year
- Samples for use with MLOps☆13Jul 6, 2023Updated 2 years ago
- A collection of optimizers, some arcane others well known, for Flax.☆29Aug 6, 2021Updated 4 years ago
- Fastai community entry to 2020 Reproducibility Challenge☆17Oct 20, 2022Updated 3 years ago
- LoRA and DoRA from Scratch Implementations☆222Mar 5, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Zunda: Japanese Enhanced Modality Analyzer client for Python.☆10Nov 30, 2019Updated 6 years ago
- Automatically evaluate your LLMs in Google Colab☆686May 7, 2024Updated 2 years ago
- A Python reimplementation + extension of "Planning with Large Language Models for Code Generation" (https://arxiv.org/abs/2303.05510)☆17Dec 1, 2023Updated 2 years ago
- An advanced distributed knowledge fabric for intelligent document processing, featuring multi-document agents, optimized query handling, …☆52Mar 25, 2026Updated 2 months ago
- ☆11Oct 2, 2024Updated last year
- Manifold-Mixup implementation for fastai V2☆17Oct 1, 2020Updated 5 years ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆83Sep 10, 2023Updated 2 years ago
- A collection of utilities for FastHTML projects.☆14Oct 23, 2024Updated last year
- Estimate similarity of medical concepts based on Unified Medical Language System (UMLS)☆16Jan 17, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [NAACL'25] RuleR: Improving LLM Controllability by Rule-based Data Recycling☆14Sep 27, 2025Updated 8 months ago
- Library to make MongoDB aggregation framework and pipelines easy to use in python.☆22Oct 31, 2025Updated 7 months ago
- course.fast.ai 2022 part 2☆522Apr 28, 2024Updated 2 years ago
- This repository contains the code for running the character-level Sandwich Transformers from our ACL 2020 paper on Improving Transformer …☆57Jan 1, 2021Updated 5 years ago
- A knowledge base bridging theorical with real-world applications.☆22Mar 23, 2026Updated 2 months ago
- Code repo for "Model-Generated Pretraining Signals Improves Zero-Shot Generalization of Text-to-Text Transformers" (ACL 2023)☆22Nov 1, 2023Updated 2 years ago
- BERT Probe: A python package for probing attention based robustness to character and word based adversarial evaluation. Also, with recipe…☆18Jun 24, 2022Updated 3 years ago