cognitivecomputations / dolphinflow-optimizerLinks

☆20

Alternatives and similar repositories for dolphinflow-optimizer

Users that are interested in dolphinflow-optimizer are comparing it to the libraries listed below

Sorting:

cognitivecomputations / grokadamw
☆132Updated 10 months ago
brendanhogan / picoDeepResearch
☆63Updated last month
cognitivecomputations / kraken
☆66Updated last year
smolorg / smoltropix
MLX port for xjdr's entropix sampler (mimics jax implementation)
☆64Updated 7 months ago
teknium1 / ShareGPT-Builder
☆114Updated 6 months ago
stockeh / mlx-optimizers
A collection of optimizers for MLX
☆36Updated 3 weeks ago
xjdr-alt / llmri
look how they massacred my boy
☆63Updated 8 months ago
N8python / mlx-pretrain
A simple MLX implementation for pretraining LLMs on Apple Silicon.
☆80Updated last month
OpenPipe / deductive-reasoning
Train your own SOTA deductive reasoning model
☆94Updated 3 months ago
s-smits / grpo-optuna
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
☆53Updated 4 months ago
google-deepmind / latent-multi-hop-reasoning
[ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?
☆68Updated 3 months ago
fal-ai-community / llmdifftracker
Lightweight package that tracks and summarizes code changes using LLMs (Large Language Models)
☆34Updated 3 months ago
kubernetes-bad / reward-composer
Lego for GRPO
☆28Updated 3 weeks ago
JD-P / RetroInstruct
Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.
☆32Updated 3 months ago
JoeLi12345 / nGPT
an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)
☆101Updated 3 months ago
Danau5tin / calculator_agent_rl
Training an LLM to use a calculator with multi-turn reinforcement learning, achieving a **62% absolute increase in evaluation accuracy**.
☆41Updated last month
cognitivecomputations / agenticworker
☆23Updated 7 months ago
allenai / infinigram-api
☆60Updated 2 weeks ago
not-lain / pxia
minimalistic AI library that resembles HF's transformers
☆13Updated 5 months ago
av / klmbr
klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs
☆76Updated 9 months ago
argilla-io / argilla-cookbook
Simple examples using Argilla tools to build AI
☆53Updated 7 months ago
Mihaiii / backtrack_sampler
An easy-to-understand framework for LLM samplers that rewind and revise generated tokens
☆140Updated 4 months ago
cognitivecomputations / dolphin-logger
☆96Updated last week
samefarrar / entropix_mlx
Modify Entropy Based Sampling to work with Mac Silicon via MLX
☆50Updated 7 months ago
PuchToTalk / DOOM-MistralAI
Mistral7B playing DOOM
☆28Updated last year
agokrani / distillKitPlus
Easy to use, High Performant Knowledge Distillation for LLMs
☆85Updated last month
lechmazur / nyt-connections
Benchmark that evaluates LLMs using 651 NYT Connections puzzles extended with extra trick words
☆101Updated last week
Goekdeniz-Guelmez / mlx-lm-lora
Train Large Language Models on MLX.
☆94Updated this week
julien-blanchon / arxflix
Arxflix turns your boring Arxiv research paper into a captivating video.
☆51Updated 3 weeks ago
menloresearch / ReZero
☆149Updated 2 months ago