fairydreaming / farel-bench
Testing LLM reasoning abilities with family relationship quizzes.
☆62Updated 3 months ago
Alternatives and similar repositories for farel-bench:
Users that are interested in farel-bench are comparing it to the libraries listed below
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆139Updated 2 months ago
- Low-Rank adapter extraction for fine-tuned transformers models☆173Updated last year
- ☆129Updated 8 months ago
- The simplest, fastest repository for training/finetuning medium-sized xLSTMs.☆42Updated 11 months ago
- ☆66Updated 11 months ago
- Easy to use, High Performant Knowledge Distillation for LLMs☆65Updated last week
- 1.58-bit LLaMa model☆81Updated last year
- ☆115Updated 3 weeks ago
- ☆112Updated 4 months ago
- Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.☆152Updated 11 months ago
- Distributed Inference for mlx LLm☆89Updated 9 months ago
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.☆198Updated 9 months ago
- Fine-tunes a student LLM using teacher feedback for improved reasoning and answer quality. Implements GRPO with teacher-provided evaluati…☆41Updated 2 months ago
- Train your own SOTA deductive reasoning model☆91Updated last month
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆98Updated last month
- A pipeline parallel training script for LLMs.☆139Updated this week
- Video+code lecture on building nanoGPT from scratch☆66Updated 10 months ago
- Transplants vocabulary between language models, enabling the creation of draft models for speculative decoding WITHOUT retraining.☆26Updated last month
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31Updated 11 months ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆119Updated last year
- Micro Llama is a small Llama based model with 300M parameters trained from scratch with $500 budget☆149Updated last year
- entropix style sampling + GUI☆26Updated 6 months ago
- klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs☆71Updated 7 months ago
- run ollama & gguf easily with a single command☆50Updated 11 months ago
- Simple examples using Argilla tools to build AI☆52Updated 5 months ago
- Easily view and modify JSON datasets for large language models☆75Updated 2 months ago
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆58Updated this week
- A fast batching API to serve LLM models☆182Updated last year
- This is our own implementation of 'Layer Selective Rank Reduction'☆237Updated 11 months ago
- ☆153Updated 9 months ago