apehex / tokun
Tokun to can tokens
☆15Updated last week
Related projects ⓘ
Alternatives and complementary repositories for tokun
- ☆41Updated 2 weeks ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆62Updated 3 weeks ago
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆29Updated 6 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆20Updated 9 months ago
- ☆94Updated 2 months ago
- QLoRA with Enhanced Multi GPU Support☆36Updated last year
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆23Updated 8 months ago
- Cerule - A Tiny Mighty Vision Model☆67Updated 2 months ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆113Updated 3 weeks ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated 10 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 4 months ago
- ☆20Updated last year
- ☆49Updated 8 months ago
- Using modal.com to process FineWeb-edu data☆19Updated 2 months ago
- Aana SDK is a powerful framework for building AI enabled multimodal applications.☆34Updated this week
- QLoRA for Masked Language Modeling☆20Updated last year
- An introduction to LLM Sampling☆65Updated 2 weeks ago
- Chat Markup Language conversation library☆54Updated 10 months ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆81Updated last year
- ☆22Updated last year
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.☆51Updated this week
- ☆31Updated 10 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆41Updated 8 months ago
- utilities for loading and running text embeddings with onnx☆39Updated 3 months ago
- Google TPU optimizations for transformers models☆75Updated this week
- entropix style sampling + GUI☆25Updated 3 weeks ago
- Set of scripts to finetune LLMs☆36Updated 7 months ago
- Collection of autoregressive model implementation☆67Updated this week
- ☆45Updated 2 months ago
- ☆62Updated 2 months ago