Experimenting with small language models
☆74Jan 16, 2024Updated 2 years ago
Alternatives and similar repositories for slms
Users that are interested in slms are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Conformer block with Rotary Position Embedding, modified from lucidrains' implement☆18Sep 13, 2024Updated last year
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- ☆12Jan 19, 2024Updated 2 years ago
- 4G GPU & 10 Minutes for train☆12Aug 9, 2023Updated 2 years ago
- Exploring the minimal architecture required for coherent English language generation.☆12Mar 5, 2025Updated last year
- 1.58-bit LLaMa model☆83Apr 3, 2024Updated last year
- A curated collection of prompts for Grok Imagine by xAI☆25Oct 19, 2025Updated 5 months ago
- A python script which convert arff to csv .☆10Jan 17, 2018Updated 8 years ago
- ☆125Jan 9, 2026Updated 2 months ago
- This content gallery loads content progressively using AJAX and uses CSS3 transitions for animation and positioning. The optional web ser…☆13May 7, 2024Updated last year
- A guide to testing different runpod (and other linux VMs) configurations. Specifically the speed of LLM outputs☆17Jan 12, 2024Updated 2 years ago
- A python package for developing AI applications with local LLMs.☆150Jan 4, 2025Updated last year
- Machine learning-powered music generation. Full-featured tokenizer, customization options, and high-quality output files.☆15Feb 3, 2025Updated last year
- Locally running LLM with internet access☆97Jun 30, 2025Updated 8 months ago
- Code for the ACL 2021 paper "Structural Guidance for Transformer Language Models"☆13Sep 17, 2025Updated 6 months ago
- Deep learning library implemented from scratch in numpy. Mixtral, Mamba, LLaMA, GPT, ResNet, and other experiments.☆54Apr 12, 2024Updated last year
- flow-merge is a powerful Python library that enables seamless merging of multiple transformer-based language models using the most popula…☆20Feb 12, 2025Updated last year
- ☆12Dec 14, 2024Updated last year
- Kanban board made with TailwindCSS☆11Jun 10, 2021Updated 4 years ago
- Simple LLM inference server☆20Jun 13, 2024Updated last year
- A demo of cluade computer use playing minecraft☆23Oct 25, 2024Updated last year
- Core Engine of Singing Voice Conversion & Singing Voice Clone☆27Jun 3, 2025Updated 9 months ago
- A machine learning solution for extracting key entity values (weight, volume, dimensions) from product images.☆18Sep 17, 2024Updated last year
- Gives tone functionality with 10-bit volume control with no extra parts!☆40Nov 25, 2019Updated 6 years ago
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆14Mar 30, 2024Updated last year
- Creates CMM script that can directly executed on Kaggle from easy merge script☆14Mar 6, 2026Updated 2 weeks ago
- Python client for Jikan.moe, MyAnimeList unofficial API with good intentions.☆14Dec 20, 2022Updated 3 years ago
- Source codes of F-HMTC (IJCAI'20)☆17Feb 22, 2021Updated 5 years ago
- A simple message plugin for jQuery☆146Jun 25, 2020Updated 5 years ago
- Load multiple LoRA modules simultaneously and automatically switch the appropriate combination of LoRA modules to generate the best answe…☆159Feb 9, 2024Updated 2 years ago
- A CNN model that classifies fruit images into 80 different types of fruits.☆12Jan 24, 2020Updated 6 years ago
- Repository of useful 'stuff' for the MineRL BASALT Challenge☆16Mar 21, 2023Updated 3 years ago
- (Interspeech 2023 & ICASSP 2024) Official repository for ARMHuBERT and STaRHuBERT☆41Aug 29, 2024Updated last year
- Simple Streamlit UI for Ollama☆21May 13, 2024Updated last year
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆38Jan 29, 2024Updated 2 years ago
- A copy of the DirectX Headers from MinGW-64.☆14Sep 7, 2023Updated 2 years ago
- A PyTorch implementation of Vector Quantized Variational Autoencoder (VQ-VAE) with EMA updates, pretrained encoder, and K-means initializ…☆21Dec 31, 2024Updated last year
- Minecraft AI "city" generator using a tensorflow GAN.☆36Sep 22, 2020Updated 5 years ago
- ☆11Jan 28, 2024Updated 2 years ago