Training and Fine-tuning an llm in Python and PyTorch.
☆43Aug 30, 2023Updated 2 years ago
Alternatives and similar repositories for instruct_storyteller_tinyllama2
Users that are interested in instruct_storyteller_tinyllama2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- generative models on toys☆12Sep 10, 2024Updated last year
- This repository contains an implementation of the LLaMA 2 (Large Language Model Meta AI) model, a Generative Pretrained Transformer (GPT)…☆74Oct 1, 2023Updated 2 years ago
- Code and data for the paper: IntentionQA: A Benchmark for Evaluating Purchase Intention Comprehension Abilities of Large Language Models …☆11Apr 27, 2024Updated last year
- ☆10Nov 7, 2022Updated 3 years ago
- Repository for "Propagating Knowledge Updates to LMs Through Distillation" (NeurIPS 2023).☆26Aug 25, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Generate interleaved text and image content in a structured format you can directly pass to downstream APIs.☆29Oct 18, 2024Updated last year
- This Repo Contains Script To Fine Tune Open Source Models Using Unsloth by using UI with simple click and progress☆11Oct 3, 2024Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆11Sep 4, 2025Updated 6 months ago
- All code related to medium articles☆20Mar 11, 2026Updated 2 weeks ago
- ☆25Jul 10, 2023Updated 2 years ago
- Template repo for Python projects, especially those focusing on machine learning and/or deep learning.☆15Jan 14, 2026Updated 2 months ago
- ☆17Jun 19, 2023Updated 2 years ago
- Unofficial reimplementation of ViR: Vision Retention Networks by Hatamizadeh et. al. (https://arxiv.org/abs/2310.19731)☆18Jul 26, 2024Updated last year
- A copy of the DirectX Headers from MinGW-64.☆14Sep 7, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Vite + Mantine + Vanilla extract template☆12Mar 14, 2026Updated 2 weeks ago
- A Model Agnostic function to directly remove specified layers from the LLM☆10May 23, 2024Updated last year
- [EMNLP 2025 Main] The official repo of MMLU-ProX benchmark.☆27Aug 26, 2025Updated 7 months ago
- First instruction-tuning dataset distilled from Claude2 (52k Alpaca prompts)!☆13Oct 22, 2023Updated 2 years ago
- PySOM - The Simple Object Machine Smalltalk implemented in Python☆19Aug 19, 2025Updated 7 months ago
- ☆13Mar 11, 2018Updated 8 years ago
- Download TikTok videos online with TikTok Video Downloader. Completely free.☆13Sep 17, 2025Updated 6 months ago
- Sparse Autoencoders (SAE) vs CLIP fine-tuning fun.☆18Dec 19, 2024Updated last year
- 랭체인 튜토리얼☆34Dec 25, 2025Updated 3 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Minimized version of the Orchis server hosted at https://orchis.cherrymint.live☆10Nov 27, 2023Updated 2 years ago
- Eval LLMs☆11May 12, 2024Updated last year
- Convert datasets from Hugging Face to FiftyOne for Visualization☆11Mar 15, 2024Updated 2 years ago
- This is a C++ implementation of cocoapi bbox evaluation code.☆11Dec 9, 2021Updated 4 years ago
- EasyRLHF aims to provide an easy and minimal interface to train aligned language models, using off-the-shelf solutions and datasets☆10Dec 12, 2023Updated 2 years ago
- This repository shows how to implement a basic model for multimodal entailment.☆10Aug 17, 2021Updated 4 years ago
- A clone of Twitter made using React, Firebase☆13May 17, 2021Updated 4 years ago
- Llama from scratch, or How to implement a paper without crying☆582May 29, 2024Updated last year
- Small and Efficient Mathematical Reasoning LLMs☆73Jan 27, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- An application that brings together several anime streaming platforms☆11Mar 1, 2025Updated last year
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jul 22, 2023Updated 2 years ago
- A collection of notebooks aiding the understanding of machine-learning papers.☆10Apr 5, 2021Updated 4 years ago
- This project sets up a real-time data pipeline utilizing Change Data Capture (CDC) to stream changes from a PostgreSQL database to a Clic…☆12May 9, 2024Updated last year
- Train your own small bitnet model☆78Oct 20, 2024Updated last year
- Inference Llama 2 in one file of pure Python☆426Nov 21, 2025Updated 4 months ago
- FastKit Core is a lightweight toolkit that adds structure and common patterns to FastAPI.☆35Feb 12, 2026Updated last month