Lightweight demos for finetuning LLMs. Powered by π€ transformers and open-source datasets.
β78Oct 20, 2024Updated last year
Alternatives and similar repositories for sft-demos
Users that are interested in sft-demos are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Are foundation LMs multilingual knowledge bases? (EMNLP 2023)β19Dec 8, 2023Updated 2 years ago
- Text clustering: HDBSCAN is probably all you need.β21Sep 5, 2023Updated 2 years ago
- Official code of our work, Representation Learning for Resource-Constrained Keyphrase Generation.β11May 26, 2022Updated 3 years ago
- β13May 15, 2021Updated 4 years ago
- Lightweight piece tokenization libraryβ12Apr 15, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Code for NeurIPS LLM Efficiency Challengeβ60Apr 9, 2024Updated 2 years ago
- Repo for the Complete Agentic AI Engineering Courseβ19May 16, 2025Updated 11 months ago
- Langchain_CrewAI_Gemini - An Gemini AI powered AI Agent (Multi-Agent) Project.β14Mar 24, 2024Updated 2 years ago
- Instruction dataset for Arabic with 10,000 instruction and output pairs. CIDAR can be used to fine-tune LLMs to follow instructions.β46Apr 3, 2025Updated last year
- β17Apr 10, 2024Updated 2 years ago
- Locality Sensitive Hashing for semantic similarity (Python 3.x)β15Jun 8, 2018Updated 7 years ago
- This is a repository for the course "From Beginner to LLM Developer" by Towards AI.β12Jan 2, 2025Updated last year
- Multi-Agent LLM System for Digital Scam Protectionβ15Dec 19, 2024Updated last year
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" givenβ¦β15Oct 16, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Image Search Engine with HuggingFace Sentence Transformerβ12Aug 31, 2023Updated 2 years ago
- β16Oct 15, 2024Updated last year
- A walk through HuggingFace smolagentsβ49Mar 7, 2025Updated last year
- BrowserAgent: Building Web Agents with Human-Inspired Web Browsing Actions [TMLR2025]β32Jan 13, 2026Updated 3 months ago
- Starter pack for NeurIPS LLM Efficiency Challenge 2023.β129Sep 3, 2023Updated 2 years ago
- Code Repository for Blog - How to Productionize Large Language Models (LLMs)β12Mar 27, 2024Updated 2 years ago
- This repository is to collect keyphrase prediction resource.β29Oct 27, 2023Updated 2 years ago
- β25Apr 28, 2025Updated last year
- This repository contains resources, documentation and artifacts describing LLM agentsβ15Jan 22, 2025Updated last year
- Simple, predictable pricing with DigitalOcean hosting β’ AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A bagel, with everything.β326Apr 11, 2024Updated 2 years ago
- In this course navigates through the LLMOps pipeline, enabling you to preprocess training data for supervised fine-tuning and deploy custβ¦β14Feb 13, 2024Updated 2 years ago
- Replication package for ESEC/FSE-2019 submission titled Diversity Web Test Generationβ15Feb 13, 2025Updated last year
- Source Code for "Computer Vision Projects with PyTorch" by Akshay Kulkarni, Adarsha Shivananda, and Nitin Ranjan Sharmaβ26Jul 6, 2022Updated 3 years ago
- [ACL 2024] On the Multi-turn Instruction Following for Conversational Web Agentsβ17Oct 12, 2024Updated last year
- This repository contains the implementation of evaluation metrics for recommendation systems. We have compared similarity, candidate geneβ¦β27Feb 21, 2025Updated last year
- Helloworld for agentic frameworks, minimial but runnable! LangGraph, Agno, AutoGen, Smolagents, OpenAI Agents, etc.β60Aug 8, 2025Updated 9 months ago
- ICCV 2019 Tutorial: Global Optimization for Geometric Understanding with Provable Guaranteesβ15Oct 20, 2022Updated 3 years ago
- This repository contains the dataset and code for "WiCE: Real-World Entailment for Claims in Wikipedia" in EMNLP 2023.β42Dec 15, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ε ¨ζ°ηAIη³»ε [ζ’η΄’ε―θ§£ιδΊΊε·₯ζΊζ §]β16Nov 23, 2025Updated 5 months ago
- Codebase for Math Neurosurgery: Isolating LLMs' Math Reasoning Abilities Using Only Forward Passesβ23Jun 15, 2025Updated 10 months ago
- β49Mar 2, 2025Updated last year
- Projects completed under LinuxWorld Informatics Ltd. - MLOps Training.β12Aug 15, 2020Updated 5 years ago
- A Seq2Seq with attention and copy mechanism for sentence summarizationβ13Mar 11, 2019Updated 7 years ago
- Building reliable Retrieval Augmented Generation(RAG) AI Architectureβ13Jul 30, 2024Updated last year
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)β111Mar 7, 2025Updated last year