Smol but mighty language model
☆65Apr 4, 2023Updated 2 years ago
Alternatives and similar repositories for smol-gpt
Users that are interested in smol-gpt are comparing it to the libraries listed below
Sorting:
- SoTA Transformers with C-backend for fast inference on your CPU.☆311Dec 9, 2023Updated 2 years ago
- Formalization of Statement of Local Langlands Correspondence for Tori☆12Dec 18, 2018Updated 7 years ago
- Hands-free companionship on demand.☆77Mar 23, 2023Updated 2 years ago
- A multi-agent mind implemented using LLMs engaged in ongoing conversation☆25Mar 1, 2023Updated 3 years ago
- Implementation in the framework of my bachelor thesis: Generative Modelling using Capsule Generative Adversarial Networks☆12Feb 20, 2026Updated 2 weeks ago
- ChatGPT Participates in a Computer Science Exam (2023)☆31Mar 21, 2023Updated 2 years ago
- Finetuning InstructLLaMA on consumer hardware (copy from https://github.com/tloen/alpaca-lora)☆11Mar 17, 2023Updated 2 years ago
- A fully autonomous AI artist☆19Jun 19, 2023Updated 2 years ago
- convert pytorch model to ncnn☆13Dec 5, 2018Updated 7 years ago
- ☆15Feb 5, 2019Updated 7 years ago
- ☆13Jun 18, 2023Updated 2 years ago
- Reimplementation of the task generation part from the Alpaca paper☆119Apr 4, 2023Updated 2 years ago
- LlamaWorksDB is a Retrieval Augmented Generation (RAG) product designed to interact with the documentation of various products such as Ll…☆17May 3, 2024Updated last year
- ☆21Mar 13, 2023Updated 2 years ago
- Techniques used to run BLOOM at inference in parallel☆37Oct 21, 2022Updated 3 years ago
- Experiments to assess SPADE on different LLM pipelines.☆17Apr 7, 2024Updated last year
- NHS England PhD Internship Projects Pages☆19Oct 3, 2025Updated 5 months ago
- LeXFiles and LegalLAMA: Facilitating English Multinational Legal Language Model Development☆21Jul 24, 2023Updated 2 years ago
- Generation of images and bios using StyleGAN and RNN architectures☆19May 15, 2019Updated 6 years ago
- ☆20Jul 11, 2024Updated last year
- Scripts for downloading and pre-processing the `proof-pile`, a high quality dataset of mathematical text and code.☆22Nov 26, 2022Updated 3 years ago
- C++ implementation for 💫StarCoder☆459Sep 9, 2023Updated 2 years ago
- GitHub "AI-Brain-of-Brains" created from (11,400+) hand picked GitHub Repos, Providing advanced search capability for Repos with specific…☆23Oct 4, 2018Updated 7 years ago
- My personal web page☆11Feb 17, 2026Updated 2 weeks ago
- ☆26Jul 11, 2022Updated 3 years ago
- I clearly unravel how I came to invent the supermanifold hypothesis in deep learning, (a part of a system called 'thought curvature') in …☆20Mar 12, 2023Updated 2 years ago
- Code for the paper: Saying No is An Art: Contextualized Fallback Responses for Unanswerable Dialogue Queries☆19Nov 29, 2021Updated 4 years ago
- AI that dreams☆22Apr 10, 2023Updated 2 years ago
- ☆23Sep 27, 2024Updated last year
- The World's Most Difficult video game☆32Dec 24, 2025Updated 2 months ago
- This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"☆27Mar 30, 2023Updated 2 years ago
- The "GPT-API-Accelerate" project provides a set of Python classes for accelerating the process of generating responses to prompts using t…☆23Oct 12, 2024Updated last year
- Drift detection module for machine learning pipelines.☆24Jun 21, 2023Updated 2 years ago
- The prime repository for state-of-the-art Multilingual Question Answering research and development.☆739Sep 18, 2025Updated 5 months ago
- ☆102Mar 18, 2024Updated last year
- mlctl is the control plane for MLOps. It provides a CLI and a Python SDK for supporting key operations related to MLOps, such as "model t…☆25Aug 23, 2021Updated 4 years ago
- A paper describing the implementation of PySR and SymbolicRegression.jl☆66Feb 5, 2024Updated 2 years ago
- experiments with inference on llama☆103Jun 6, 2024Updated last year
- A tool for benchmarking LLMs on Modal☆49Aug 29, 2025Updated 6 months ago