Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free
☆233Oct 31, 2024Updated last year
Alternatives and similar repositories for TPU-Alignment
Users that are interested in TPU-Alignment are comparing it to the libraries listed below
Sorting:
- Low-Rank adapter extraction for fine-tuned transformers models☆180May 2, 2024Updated last year
- alternative way to calculating self attention☆18May 25, 2024Updated last year
- ☆50Mar 14, 2024Updated last year
- This is our own implementation of 'Layer Selective Rank Reduction'☆240May 26, 2024Updated last year
- QLoRA with Enhanced Multi GPU Support☆38Aug 8, 2023Updated 2 years ago
- Collection of autoregressive model implementation☆85Updated this week
- Advanced Coding AI Assistant that uses a Gradio interface to stream coding related responses. ChatRAG supports local and API inference an…☆23May 6, 2025Updated 9 months ago
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆270Jan 10, 2026Updated last month
- An unsupervised model merging algorithm for Transformers-based language models.☆108Apr 29, 2024Updated last year
- An innovative library for efficient LLM inference via low-bit quantization☆352Aug 30, 2024Updated last year
- [ACL'25] Official Code for LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMs☆317Jul 13, 2025Updated 7 months ago
- ☆167Aug 8, 2025Updated 6 months ago
- ☆338Jul 28, 2025Updated 7 months ago
- ☆596Aug 23, 2024Updated last year
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆3,100Feb 16, 2026Updated last week
- A bagel, with everything.☆326Apr 11, 2024Updated last year
- [WIP] Transformer to embed Danbooru labelsets☆13Mar 31, 2024Updated last year
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆83Sep 10, 2023Updated 2 years ago
- Grammar checker with a keyboard shortcut for Ollama and Apple MLX with Automator on macOS.☆81Feb 5, 2024Updated 2 years ago
- Training LLMs with QLoRA + FSDP☆1,536Nov 9, 2024Updated last year
- Modeling code for a BitNet b1.58 Llama-style model.☆25Apr 30, 2024Updated last year
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆150Jan 7, 2026Updated last month
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆2,903Updated this week
- Large-scale LLM inference engine☆1,658Feb 17, 2026Updated last week
- A comfyui typescript client for the bun runtime☆16Nov 24, 2025Updated 3 months ago
- ☆53Jan 9, 2024Updated 2 years ago
- batched loras☆349Sep 6, 2023Updated 2 years ago
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆91Feb 27, 2024Updated 2 years ago
- Minimalistic, extremely fast, and hackable researcher's toolbench for GPT models in 307 lines of code. Reaches <3.8 validation loss on wi…☆355Jul 29, 2024Updated last year
- Fine-tune mistral-7B on 3090s, a100s, h100s☆724Oct 11, 2023Updated 2 years ago
- Entropy Based Sampling and Parallel CoT Decoding☆3,431Nov 13, 2024Updated last year
- Robust recipes to align language models with human and AI preferences☆5,506Sep 8, 2025Updated 5 months ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Jun 21, 2023Updated 2 years ago
- Go ahead and axolotl questions☆11,335Updated this week
- Entropy Based Sampling and Parallel CoT Decoding☆17Oct 9, 2024Updated last year
- Extract a single expert from a Mixture Of Experts model using slerp interpolation.☆19May 26, 2024Updated last year
- a version of baby agi using dspy and typed predictors☆16Mar 9, 2024Updated last year
- Full stack advanced chatbot over LlamaIndex.TS documentation with preview feature using Multi-documents-agents, bootstrapped with create-…☆156Mar 10, 2024Updated last year
- ☆17Feb 16, 2024Updated 2 years ago