Training and Fine-tuning an llm in Python and PyTorch.
β43Aug 30, 2023Updated 2 years ago
Alternatives and similar repositories for instruct_storyteller_tinyllama2
Users that are interested in instruct_storyteller_tinyllama2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- β10Nov 7, 2022Updated 3 years ago
- A frontend for large language models like π¨ Koala or π¦ Vicuna running on CPU with llama.cpp, using the API server library provided by lβ¦β15May 30, 2023Updated 3 years ago
- Repository for "Propagating Knowledge Updates to LMs Through Distillation" (NeurIPS 2023).β27Aug 25, 2024Updated last year
- γ7κ°μ§ νλ‘μ νΈλ‘ λ°°μ°λ LLM AI μμ΄μ νΈ κ°λ°γ μΆκ° μ§μ μ μ₯μβ17Apr 1, 2025Updated last year
- Small Multimodal Vision Model "Imp-v1-3b" trained using Phi-2 and Siglip.β17Feb 5, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- PaliGemma Inference and Fine Tuningβ13May 15, 2024Updated 2 years ago
- [EMNLP 2024] Introducing Filtered Direct Preference Optimization (fDPO) that enhances language model alignment with human preferences by β¦β16Nov 27, 2024Updated last year
- β17Jun 19, 2023Updated 3 years ago
- Creates CMM script that can directly executed on Kaggle from easy merge scriptβ14Mar 6, 2026Updated 3 months ago
- Tree-Invent: A novel molecular generative model constrained with topological treeβ14Jul 26, 2023Updated 2 years ago
- Qwen2-VL for OCR & VQAβ19Sep 3, 2024Updated last year
- Contains source code for the winning solution of the xView3 challenge https://iuu.xview.us/.β73Mar 5, 2022Updated 4 years ago
- Vite + Mantine + Vanilla extract templateβ12Jun 10, 2026Updated last week
- A Model Agnostic function to directly remove specified layers from the LLMβ10May 23, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [EMNLP 2025 Main] The official repo of MMLU-ProX benchmark.β29Aug 26, 2025Updated 9 months ago
- Example ML projects that use the Determined library.β34Sep 11, 2024Updated last year
- Example Code for OpenAI DevDay Updatesβ18Nov 10, 2023Updated 2 years ago
- β53Feb 29, 2024Updated 2 years ago
- β36Feb 21, 2025Updated last year
- Script to convert from GGUF format to safetensorsβ40May 13, 2025Updated last year
- Multi Stopwatch for Pythonβ12Sep 28, 2019Updated 6 years ago
- Build text-to-image generative AI models from scratch with Python and PyTorch. Focus on two methods: Diffusion models, which iteratively β¦β66Oct 13, 2025Updated 8 months ago
- First instruction-tuning dataset distilled from Claude2 (52k Alpaca prompts)!β13Oct 22, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- δΈζιθ倧樑εζ΅θ―εΊεοΌε ε€§η±»δΊεδΊδ»»ε‘γηηΊ§εθ―δ»·οΌε½ε 樑εθ·εΎAηΊ§β11May 6, 2024Updated 2 years ago
- λμ²΄μΈ νν 리μΌβ38Dec 25, 2025Updated 5 months ago
- Minimized version of the Orchis server hosted at https://orchis.cherrymint.liveβ10Nov 27, 2023Updated 2 years ago
- Eval LLMsβ11May 12, 2024Updated 2 years ago
- HEAD-QA: A Healthcare Dataset for Complex Reasoningβ33Feb 15, 2021Updated 5 years ago
- This is a C++ implementation of cocoapi bbox evaluation code.β11Dec 9, 2021Updated 4 years ago
- β58Feb 13, 2022Updated 4 years ago
- β35Mar 25, 2025Updated last year
- This repository shows how to implement a basic model for multimodal entailment.β10Aug 17, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Variational autoencoder implementation in tensorflow following the classic paper by Kingma and Welling.β13Jul 12, 2017Updated 8 years ago
- A clone of Twitter made using React, Firebaseβ13May 17, 2021Updated 5 years ago
- Llama from scratch, or How to implement a paper without cryingβ579May 29, 2024Updated 2 years ago
- Small and Efficient Mathematical Reasoning LLMsβ73Jan 27, 2024Updated 2 years ago
- An application that brings together several anime streaming platformsβ12Mar 1, 2025Updated last year
- Train your own small bitnet modelβ84Oct 20, 2024Updated last year
- Inference Llama 2 in one file of pure Pythonβ424Nov 21, 2025Updated 6 months ago