Lightweight demos for finetuning LLMs. Powered by π€ transformers and open-source datasets.
β78Oct 20, 2024Updated last year
Alternatives and similar repositories for sft-demos
Users that are interested in sft-demos are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Are foundation LMs multilingual knowledge bases? (EMNLP 2023)β19Dec 8, 2023Updated 2 years ago
- Official code of our work, Representation Learning for Resource-Constrained Keyphrase Generation.β11May 26, 2022Updated 4 years ago
- Lightweight piece tokenization libraryβ12Apr 15, 2024Updated 2 years ago
- Code for NeurIPS LLM Efficiency Challengeβ61Apr 9, 2024Updated 2 years ago
- Repo for the Complete Agentic AI Engineering Courseβ19May 16, 2025Updated last year
- Simple, predictable pricing with DigitalOcean hosting β’ AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Reference code base for ML Engineering in Action, Manning Publications Author: Ben Wilsonβ21Oct 22, 2023Updated 2 years ago
- β17Apr 10, 2024Updated 2 years ago
- This is a repository for the course "From Beginner to LLM Developer" by Towards AI.β12Jan 2, 2025Updated last year
- Multi-Agent LLM System for Digital Scam Protectionβ15Dec 19, 2024Updated last year
- β11Nov 15, 2020Updated 5 years ago
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" givenβ¦β15Oct 16, 2023Updated 2 years ago
- Image Search Engine with HuggingFace Sentence Transformerβ12Aug 31, 2023Updated 2 years ago
- Code for our 2023 IEEE S&P Paper "The Leaky Web: Automated Discovery of Cross-Site Information Leaks in Browsers and the Web"β16Dec 12, 2024Updated last year
- β14Dec 25, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Official repository for the paper "Flow Equivariant Recurrent Neural Networks"β35Jul 2, 2025Updated 11 months ago
- β12Apr 14, 2023Updated 3 years ago
- BrowserAgent: Building Web Agents with Human-Inspired Web Browsing Actions [TMLR2025]β33Jan 13, 2026Updated 5 months ago
- Code Repository for Blog - How to Productionize Large Language Models (LLMs)β12Mar 27, 2024Updated 2 years ago
- Starter pack for NeurIPS LLM Efficiency Challenge 2023.β129Sep 3, 2023Updated 2 years ago
- This repository is to collect keyphrase prediction resource.β29Oct 27, 2023Updated 2 years ago
- β16Apr 14, 2026Updated 2 months ago
- β11Oct 29, 2022Updated 3 years ago
- This repository contains resources, documentation and artifacts describing LLM agentsβ15Jan 22, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [SDM24] Official code for "Time-Transformer"β18Sep 30, 2025Updated 8 months ago
- A bagel, with everything.β326Apr 11, 2024Updated 2 years ago
- Learn to code for NLPβ10Jul 20, 2020Updated 5 years ago
- In this course navigates through the LLMOps pipeline, enabling you to preprocess training data for supervised fine-tuning and deploy custβ¦β15Feb 13, 2024Updated 2 years ago
- A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Modelsβ73Feb 25, 2025Updated last year
- Portable TLauncher Minecraft Launcherβ14Dec 29, 2023Updated 2 years ago
- β14Jun 16, 2020Updated 6 years ago
- Source Code for "Computer Vision Projects with PyTorch" by Akshay Kulkarni, Adarsha Shivananda, and Nitin Ranjan Sharmaβ26Jul 6, 2022Updated 3 years ago
- Memoria is a human-inspired memory architecture for neural networks.β88Oct 20, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This repository contains code to evaluate various multimodal large language models using different instructions across multiple multimodaβ¦β32Mar 9, 2025Updated last year
- β11Aug 8, 2018Updated 7 years ago
- QLoRA: Efficient Finetuning of Quantized LLMsβ11Jul 22, 2023Updated 2 years ago
- an end-to-end data pipeline extracting music listening habits and producing an insightful dashboardβ18Mar 31, 2024Updated 2 years ago
- This repository contains the implementation of evaluation metrics for recommendation systems. We have compared similarity, candidate geneβ¦β27Feb 21, 2025Updated last year
- [NAACL'22-Findings] Dataset for "Retrieval-Augmented Multilingual Keyphrase Generation with Retriever-Generator Iterative Training"β18Sep 21, 2022Updated 3 years ago
- ICCV 2019 Tutorial: Global Optimization for Geometric Understanding with Provable Guaranteesβ15Oct 20, 2022Updated 3 years ago