Lightweight demos for finetuning LLMs. Powered by π€ transformers and open-source datasets.
β78Oct 20, 2024Updated last year
Alternatives and similar repositories for sft-demos
Users that are interested in sft-demos are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Text clustering: HDBSCAN is probably all you need.β21Sep 5, 2023Updated 2 years ago
- Official code of our work, Representation Learning for Resource-Constrained Keyphrase Generation.β11May 26, 2022Updated 4 years ago
- β14Apr 22, 2024Updated 2 years ago
- [ICML 2024] Temporal Spiking Neural Networks with Synaptic Delay for Graph Reasoningβ11Jun 1, 2024Updated last year
- β13May 15, 2021Updated 5 years ago
- Open source password manager - Proton Pass β’ AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Lightweight piece tokenization libraryβ12Apr 15, 2024Updated 2 years ago
- Code for NeurIPS LLM Efficiency Challengeβ60Apr 9, 2024Updated 2 years ago
- Reference code base for ML Engineering in Action, Manning Publications Author: Ben Wilsonβ21Oct 22, 2023Updated 2 years ago
- β17Apr 10, 2024Updated 2 years ago
- Locality Sensitive Hashing for semantic similarity (Python 3.x)β15Jun 8, 2018Updated 7 years ago
- This is a repository for the course "From Beginner to LLM Developer" by Towards AI.β12Jan 2, 2025Updated last year
- Multi-Agent LLM System for Digital Scam Protectionβ15Dec 19, 2024Updated last year
- Code for our 2023 IEEE S&P Paper "The Leaky Web: Automated Discovery of Cross-Site Information Leaks in Browsers and the Web"β15Dec 12, 2024Updated last year
- IBM Quantum Challenge Fall 2023β10May 23, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- β16Oct 15, 2024Updated last year
- β12Apr 14, 2023Updated 3 years ago
- A walk through HuggingFace smolagentsβ49Mar 7, 2025Updated last year
- BrowserAgent: Building Web Agents with Human-Inspired Web Browsing Actions [TMLR2025]β33Jan 13, 2026Updated 4 months ago
- Code Repository for Blog - How to Productionize Large Language Models (LLMs)β12Mar 27, 2024Updated 2 years ago
- Starter pack for NeurIPS LLM Efficiency Challenge 2023.β129Sep 3, 2023Updated 2 years ago
- β15Apr 14, 2026Updated last month
- β25Apr 28, 2025Updated last year
- Twitter auto account report bot using selenium with pythonβ13Apr 19, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- β11Oct 29, 2022Updated 3 years ago
- Draftify is a simple note app to write passing thought and ideas without distractions.β14Mar 9, 2023Updated 3 years ago
- This repository contains resources, documentation and artifacts describing LLM agentsβ15Jan 22, 2025Updated last year
- [SDM24] Official code for "Time-Transformer"β18Sep 30, 2025Updated 7 months ago
- A bagel, with everything.β326Apr 11, 2024Updated 2 years ago
- Learn to code for NLPβ10Jul 20, 2020Updated 5 years ago
- The code, citation & note companion to the AICamp Webinar "Understanding Graph Convolutional Networks" hosted on September 7th, 2021.β11Sep 21, 2021Updated 4 years ago
- In this course navigates through the LLMOps pipeline, enabling you to preprocess training data for supervised fine-tuning and deploy custβ¦β15Feb 13, 2024Updated 2 years ago
- β14Jul 26, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting β’ AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Modelsβ72Feb 25, 2025Updated last year
- β14Jun 16, 2020Updated 5 years ago
- π€ Telegram chatbot frontend for Searx.β16Nov 25, 2018Updated 7 years ago
- This repository contains code to evaluate various multimodal large language models using different instructions across multiple multimodaβ¦β32Mar 9, 2025Updated last year
- [ACL 2024] On the Multi-turn Instruction Following for Conversational Web Agentsβ17Oct 12, 2024Updated last year
- β20Feb 18, 2025Updated last year
- QLoRA: Efficient Finetuning of Quantized LLMsβ11Jul 22, 2023Updated 2 years ago