ahmed-moubtahij / TokenHealer
☆21Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for TokenHealer
- entropix style sampling + GUI☆25Updated last week
- ☆34Updated last year
- ☆49Updated 7 months ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated 10 months ago
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆42Updated 7 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆40Updated 8 months ago
- ☆31Updated 10 months ago
- Using modal.com to process FineWeb-edu data☆19Updated 2 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆19Updated 9 months ago
- ☆64Updated 5 months ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆105Updated last week
- look how they massacred my boy☆53Updated 3 weeks ago
- Full finetuning of large language models without large memory requirements☆93Updated 10 months ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31Updated 5 months ago
- ☆38Updated this week
- Self-hosted LLM chatbot arena, with yourself as the only judge☆36Updated 9 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆59Updated this week
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆44Updated 11 months ago
- Experimental sampler to make LLMs more creative☆30Updated last year
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆22Updated last month
- ☆40Updated last year
- GPT-2 small trained on phi-like data☆65Updated 8 months ago
- ☆52Updated 5 months ago
- Model REVOLVER, a human in the loop model mixing system.☆33Updated last year
- QLoRA with Enhanced Multi GPU Support☆36Updated last year
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆52Updated last week
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated 8 months ago
- A sleek, customizable interface for managing LLMs with responsive design and easy agent personalization.☆11Updated 2 months ago
- Generates grammer files from typescript for LLM generation☆34Updated 8 months ago
- utilities for loading and running text embeddings with onnx☆39Updated 3 months ago