☆39Aug 1, 2025Updated 7 months ago
Alternatives and similar repositories for axolotl-cookbook
Users that are interested in axolotl-cookbook are comparing it to the libraries listed below
Sorting:
- Submodule of evalverse forked from [google-research/instruction_following_eval](https://github.com/google-research/google-research/tree/m…☆14May 4, 2024Updated last year
- ☆39Aug 4, 2025Updated 7 months ago
- A Qwen .5B reasoning model trained on OpenR1-Math-220k☆14Oct 11, 2025Updated 5 months ago
- [EMNLP 2024] Multi-modal reasoning problems via code generation.☆28Feb 5, 2025Updated last year
- ☆19Mar 16, 2025Updated last year
- Codes and files for the paper Are Emergent Abilities in Large Language Models just In-Context Learning☆33Jan 9, 2025Updated last year
- simple prompt script to convert hf/ggml files to gguf, and to quantize☆20Sep 25, 2023Updated 2 years ago
- ☆11Sep 19, 2025Updated 6 months ago
- Dataset of personal narratives with Advice Seeking Questions☆15May 10, 2019Updated 6 years ago
- Batch processing using joblib including tqdm progress bars☆20Dec 29, 2021Updated 4 years ago
- Entropy Based Sampling and Parallel CoT Decoding☆17Oct 9, 2024Updated last year
- ☆15Aug 5, 2023Updated 2 years ago
- Clue inspired puzzles for testing LLM deduction abilities☆46Updated this week
- Code for the paper "Data Feedback Loops: Model-driven Amplification of Dataset Biases"☆18Sep 9, 2022Updated 3 years ago
- Deep Autoencoding Predictive Components☆10Mar 4, 2021Updated 5 years ago
- ☆16Feb 22, 2025Updated last year
- [ECCV 2022] The official experimental code of "Sobolev Training for Implicit Neural Representations with Approximated Image Derivatives"☆32Jul 22, 2022Updated 3 years ago
- Code for experiments on transformers using Markovian data.☆22Nov 22, 2024Updated last year
- [ICLR 2026] RPG: KL-Regularized Policy Gradient (https://arxiv.org/abs/2505.17508)☆65Feb 19, 2026Updated last month
- All-in-one environment to use Dria, the collective knowledge for AI.☆14Mar 15, 2024Updated 2 years ago
- ToolAgents is a lightweight and flexible framework for creating function-calling agents with various language models and APIs.☆29Updated this week
- ☆15Aug 11, 2022Updated 3 years ago
- A framework for evaluating function calls made by LLMs☆40Jul 23, 2024Updated last year
- Python SDK for FirstBatch: Real-time personalization using vectorDBs☆17Nov 26, 2023Updated 2 years ago
- Notebooks on ML AI experiments☆29Jan 3, 2026Updated 2 months ago
- Deep-RL algorithm Implementations using Pytorch☆16Jun 2, 2023Updated 2 years ago
- An automation platform for graphically modeled workflows. Focus on network automation. Open Source under Apache License.☆11Nov 13, 2025Updated 4 months ago
- A Multilingual Dataset For Cross-lingual News Recommendation☆21Mar 27, 2024Updated last year
- ☆19Mar 25, 2025Updated 11 months ago
- Waffer-thin FlaskGPT on Vercel.☆12Jun 1, 2023Updated 2 years ago
- ☆35Sep 22, 2025Updated 5 months ago
- High performance GPT-OSS MLX implementation☆37Aug 6, 2025Updated 7 months ago
- The official repository of paper "Pass@k Training for Adaptively Balancing Exploration and Exploitation of Large Reasoning Models''☆112Aug 15, 2025Updated 7 months ago
- [ICML 2024] Junk DNA Hypothesis: A Task-Centric Angle of LLM Pre-trained Weights through Sparsity; Lu Yin*, Ajay Jaiswal*, Shiwei Liu, So…☆16Apr 21, 2025Updated 11 months ago
- Chain-of-thought 방식을 활용하여 llama2를 fine-tuning☆10Nov 18, 2023Updated 2 years ago
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.☆41Apr 4, 2025Updated 11 months ago
- ☆137Mar 20, 2025Updated last year
- Numbeo Unofficial API☆15Oct 16, 2022Updated 3 years ago
- Trying to deconstruct RWKV in understandable terms☆14May 6, 2023Updated 2 years ago