CohleM / lilLM
A little(lil) Language Model (LM)
☆48Updated 2 weeks ago
Alternatives and similar repositories for lilLM:
Users that are interested in lilLM are comparing it to the libraries listed below
- Testing LLM reasoning abilities with family relationship quizzes.☆62Updated 3 months ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆139Updated 2 months ago
- Video+code lecture on building nanoGPT from scratch☆67Updated 10 months ago
- ☆129Updated 8 months ago
- ☆47Updated 2 months ago
- Low-Rank adapter extraction for fine-tuned transformers models☆173Updated last year
- AI management tool☆114Updated 6 months ago
- Train your own small bitnet model☆70Updated 6 months ago
- An optimized quantization and inference library for running LLMs locally on modern consumer-class GPUs☆341Updated this week
- Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.☆152Updated 11 months ago
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.☆199Updated 9 months ago
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆231Updated 6 months ago
- 🤗 Benchmark Large Language Models Reliably On Your Data☆287Updated this week
- ☆288Updated last month
- Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models☆237Updated last year
- ☆135Updated 3 weeks ago
- Experimenting with small language models☆67Updated last year
- An efficent implementation of the method proposed in "The Era of 1-bit LLMs"☆154Updated 6 months ago
- 1.58-bit LLaMa model☆81Updated last year
- Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …☆183Updated 9 months ago
- Micro Llama is a small Llama based model with 300M parameters trained from scratch with $500 budget☆150Updated last year
- ☆89Updated 4 months ago
- A pipeline parallel training script for LLMs.☆143Updated last week
- ☆151Updated this week
- ☆69Updated this week
- idea: https://github.com/nyxkrage/ebook-groupchat/☆86Updated 8 months ago
- A fast batching API to serve LLM models☆182Updated last year
- Guaranteed Structured Output from any Language Model via Hierarchical State Machines☆128Updated last week
- ☆204Updated 11 months ago
- ☆117Updated last month