IlyaGusev / ping_pong_bench
☆73Updated this week
Related projects ⓘ
Alternatives and complementary repositories for ping_pong_bench
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆181Updated 3 weeks ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆113Updated 3 weeks ago
- Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [F…☆58Updated 5 months ago
- Low-Rank adapter extraction for fine-tuned transformers model☆162Updated 6 months ago
- A pipeline parallel training script for LLMs.☆83Updated this week
- Function Calling Benchmark & Testing☆75Updated 4 months ago
- ☆94Updated 2 months ago
- idea: https://github.com/nyxkrage/ebook-groupchat/☆82Updated 3 months ago
- ☆27Updated last year
- ☆106Updated 2 months ago
- entropix style sampling + GUI☆25Updated 3 weeks ago
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆112Updated last year
- ☆53Updated 5 months ago
- The official evaluation suite and dynamic data release for MixEval.☆224Updated 2 weeks ago
- Simple examples using Argilla tools to build AI☆42Updated last week
- ☆104Updated 8 months ago
- This is our own implementation of 'Layer Selective Rank Reduction'☆232Updated 5 months ago
- An unsupervised model merging algorithm for Transformers-based language models.☆100Updated 6 months ago
- Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models☆196Updated 7 months ago
- ☆64Updated 5 months ago
- ☆33Updated 6 months ago
- ☆38Updated 8 months ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆77Updated 7 months ago
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.☆174Updated 4 months ago
- From Code to Correctness: Closing the Last Mile of Code Generation with Hierarchical Debugging☆58Updated last month
- ☆73Updated 10 months ago
- ☆118Updated 3 months ago
- ☆150Updated 4 months ago
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"☆46Updated last month