CohleM / lilLMLinks

A little(lil) Language Model (LM). A tiny reproduction of LLaMA 3's model architecture.

☆52

Alternatives and similar repositories for lilLM

Users that are interested in lilLM are comparing it to the libraries listed below

Sorting:

ideaweaver-ai / DeepSeek-Children-Stories-15M-model
☆107Updated 5 months ago
fairydreaming / farel-bench
Testing LLM reasoning abilities with family relationship quizzes.
☆63Updated 10 months ago
nivibilla / build-nanogpt
Video+code lecture on building nanoGPT from scratch
☆68Updated last year
chigkim / Ollama-MMLU-Pro
☆107Updated 3 months ago
TC-Zheng / ActuosusAI
AI management tool
☆121Updated last year
janhq / ReZero
☆158Updated 7 months ago
Nikityyy / lille
A powerful 130-million-parameter model trained from scratch as part of a truly open-source stack, including a custom tokenizer, dataset, …
☆68Updated 2 months ago
severian42 / Vodalus-Expert-LLM-Forge
Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …
☆190Updated last year
pranavjad / tinyllama-bitnet
Train your own small bitnet model
☆75Updated last year
QuixiAI / laserRMT
This is our own implementation of 'Layer Selective Rank Reduction'
☆240Updated last year
adriancable / qwen3.c
Local Qwen3 LLM inference. One easy-to-understand file of C source with no dependencies.
☆146Updated 4 months ago
TheProxyCompany / proxy-structuring-engine
Guaranteed Structured Output from any Language Model via Hierarchical State Machines
☆145Updated last month
Mihaiii / backtrack_sampler
An easy-to-understand framework for LLM samplers that rewind and revise generated tokens
☆146Updated 9 months ago
rafacelente / bllama
1.58-bit LLaMa model
☆83Updated last year
sam-paech / antislop-sampler
☆329Updated 4 months ago
QuixiAI / grokadamw
☆136Updated last year
thomasgauthier / LoRD
Low-Rank adapter extraction for fine-tuned transformers models
☆179Updated last year
av / klmbr
klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs
☆85Updated last year
arcee-ai / PruneMe
Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models
☆257Updated last year
matt-c1 / llama-3-quant-comparison
Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.
☆165Updated last year
VITA-Group / Q-GaLore
Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.
☆202Updated last year
Vaibhavs10 / optimise-my-whisper
☆207Updated last year
anhvth / opensloth
☆229Updated 2 months ago
jd-3d / SOLOBench
☆135Updated 6 months ago
lechmazur / step_game
Multi-Agent Step Race Benchmark: Assessing LLM Collaboration and Deception Under Pressure. A multi-player “step-race” that challenges LLM…
☆77Updated 3 months ago
jukofyork / transplant-vocab
Transplants vocabulary between language models, enabling the creation of draft models for speculative decoding WITHOUT retraining.
☆46Updated last month
rombodawg / Easy_training
☆51Updated 9 months ago
epolewski / EricLLM
A fast batching API to serve LLM models
☆189Updated last year
leafspark / AutoGGUF
automatically quant GGUF models
☆217Updated last month
ideaweaver-ai / Tiny-Children-Stories-30M-model
☆120Updated 5 months ago