Smol but mighty language model
☆65Apr 4, 2023Updated 3 years ago
Alternatives and similar repositories for smol-gpt
Users that are interested in smol-gpt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- SoTA Transformers with C-backend for fast inference on your CPU.☆312Dec 9, 2023Updated 2 years ago
- Hands-free companionship on demand.☆77Mar 23, 2023Updated 3 years ago
- A fully autonomous AI artist☆19Jun 19, 2023Updated 2 years ago
- Reimplementation of the task generation part from the Alpaca paper☆119Apr 4, 2023Updated 3 years ago
- Web platform for blackboard-video explanations☆18May 22, 2026Updated 3 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆13Jun 18, 2023Updated 2 years ago
- Implementation in the framework of my bachelor thesis: Generative Modelling using Capsule Generative Adversarial Networks☆12Feb 20, 2026Updated 3 months ago
- convert pytorch model to ncnn☆13Dec 5, 2018Updated 7 years ago
- [ICLR 2024 Spotlight] Social Reward: Evaluating and Enhancing Generative AI through Million-User Feedback from an Online Creative Communi…☆12Mar 29, 2024Updated 2 years ago
- Repo for: When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment☆39Jun 5, 2023Updated 3 years ago
- LLM-powered autonomous agent with hierarchical task management☆50Apr 12, 2023Updated 3 years ago
- Formalization of Statement of Local Langlands Correspondence for Tori☆12Dec 18, 2018Updated 7 years ago
- a nextjs app to implement reading documents using openai (embeddings and chat model), pinecone for vectors store and langchain.☆50Apr 22, 2023Updated 3 years ago
- C++ implementation for 💫StarCoder☆458Sep 9, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Moatless Testbeds allows you to create isolated testbed environments in a Kubernetes cluster where you can apply code changes through git…☆14Apr 9, 2025Updated last year
- Recurrent versus Recursive Approaches Towards Compositionality in Semantic Vector Spaces.☆13Sep 22, 2021Updated 4 years ago
- Code for the paper "Large Language Models Share Representations of Latent Grammatical Concepts Across Typologically Diverse Languages" (N…☆17Apr 13, 2025Updated last year
- Command-line script for inferencing from models such as LLaMA, in a chat scenario, with LoRA adaptations☆32Jun 1, 2023Updated 3 years ago
- The official Languini Kitchen repository☆14May 6, 2024Updated 2 years ago
- Finetuning InstructLLaMA on consumer hardware (copy from https://github.com/tloen/alpaca-lora)☆11Mar 17, 2023Updated 3 years ago
- ☆103Mar 18, 2024Updated 2 years ago
- Sample implementation accompanying the NeurIPS 2019 paper 'Powerset Convolutional Neural Networks' by Chris Wendler, Dan Alistarh, and Ma…☆10Oct 26, 2020Updated 5 years ago
- ☆58Mar 13, 2026Updated 2 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Coefficient of Variation (CV) and Coefficient of Quartile Variation (CQV) with Confidence Intervals (CI)☆10May 31, 2026Updated last week
- ☆16Jul 20, 2023Updated 2 years ago
- Scripts for downloading and pre-processing the `proof-pile`, a high quality dataset of mathematical text and code.☆22Nov 26, 2022Updated 3 years ago
- Experiments to assess SPADE on different LLM pipelines.☆17Apr 7, 2024Updated 2 years ago
- The prime repository for state-of-the-art Multilingual Question Answering research and development.☆740Sep 18, 2025Updated 8 months ago
- Models and code from Learning to Predict Denotational Probabilities For Modeling Entailment☆14Feb 1, 2018Updated 8 years ago
- A tiny implementation of an autonomous agent powered by LLMs (OpenAI GPT-4)☆441Apr 4, 2023Updated 3 years ago
- Legal Entity Name Understanding☆22Sep 25, 2025Updated 8 months ago
- Ce descriptif couvre : 🏗️ Infrastructure : Terraform + GCP 🔒 Sécurité : VPC privé 🌐 Réseau : Gateway GCP, firewall 🎯 Composants : Obs…☆38Oct 21, 2025Updated 7 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆45Jun 2, 2023Updated 3 years ago
- AI that dreams☆22Apr 10, 2023Updated 3 years ago
- Movie Reviews Sentiment Analysis☆13Jun 28, 2018Updated 7 years ago
- Code for the "Overcoming Sparsity Artifacts in Crosscoders to Interpret Chat-Tuning" paper.☆18Nov 21, 2025Updated 6 months ago
- LlamaWorksDB is a Retrieval Augmented Generation (RAG) product designed to interact with the documentation of various products such as Ll…☆17May 3, 2024Updated 2 years ago
- Tutorial Apps for Learning R☆18Dec 28, 2017Updated 8 years ago
- I clearly unravel how I came to invent the supermanifold hypothesis in deep learning, (a part of a system called 'thought curvature') in …☆21Mar 12, 2023Updated 3 years ago