QuixiAI / dolphin-utilsLinks
☆15Updated last month
Alternatives and similar repositories for dolphin-utils
Users that are interested in dolphin-utils are comparing it to the libraries listed below
Sorting:
- ☆107Updated 3 months ago
- All the world is a play, we are but actors in it.☆49Updated 6 months ago
- 🤖 Complete reproduction of 'AlphaGo Moment for Model Architecture Discovery' using MLX-LM instead of GPT-4. Autonomous neural architectu…☆27Updated 6 months ago
- Glyphs, acting as collaboratively defined symbols linking related concepts, add a layer of multidimensional semantic richness to user-AI …☆56Updated 11 months ago
- ☆17Updated last year
- MLX-based QA pair generator and LLM finetuning tool in Streamlit☆42Updated 3 months ago
- entropix style sampling + GUI☆27Updated last year
- ☆119Updated last year
- ☆30Updated last year
- ☆159Updated last month
- Tcurtsni: Reverse Instruction Chat, ever wonder what your LLM wants to ask you?☆23Updated last year
- Luth is a state-of-the-art series of fine-tuned LLMs for French☆41Updated 3 months ago
- Marketplace ML experiment - training without backprop☆27Updated 4 months ago
- ☆62Updated 6 months ago
- Entropy Based Sampling and Parallel CoT Decoding☆17Updated last year
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆85Updated 5 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆92Updated last year
- look how they massacred my boy☆63Updated last year
- ☆68Updated last year
- The original BabyAGI, updated with LiteLLM and no vector database reliance (csv instead)☆21Updated last year
- ☆68Updated 8 months ago
- A pure MLX-based training pipeline for fine-tuning LLMs using GRPO on Apple Silicon.☆225Updated 3 months ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆61Updated last year
- Deep research agents using MiniMax M2.1 interleaved thinking☆194Updated last month
- Project code for training LLMs to write better unit tests + code☆21Updated 8 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Updated 3 months ago
- The next evolution of Agents☆48Updated last week
- ☆19Updated last year
- ☆159Updated 9 months ago
- ☆17Updated last year