VsonicV/es-at-scale

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/VsonicV/es-at-scale)

VsonicV / es-at-scale

This repo contains the source code for the paper "Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning"

☆373

Alternatives and similar repositories for es-at-scale

Users that are interested in es-at-scale are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

dibbla / Quantized-Evolution-Strategies
View on GitHub
Quantized Evolution Strategies: High-precision Fine-tuning of Quantized LLMs at Low-precision Cost
☆21May 14, 2026Updated 2 months ago
ESHyperscale / HyperscaleES
View on GitHub
Jax Codebase for Evolutionary Strategies at the Hyperscale
☆348Feb 27, 2026Updated 4 months ago
ESHyperscale / nano-egg
View on GitHub
Evolution Pretraining Fully in Int Formats
☆177Feb 25, 2026Updated 4 months ago
paraynaud / MTH8408-Hiv24
View on GitHub
☆15Mar 25, 2024Updated 2 years ago
iamkanghyunchoi / falqon
View on GitHub
Official repository of paper [FALQON: Accelerating LoRA Fine-tuning with Low-Bit Floating-Point Arithmetic, NeurIPS 2025]
☆21Dec 2, 2025Updated 7 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
assafbk / OPRM
View on GitHub
Overflow Prevention Enhances Long-Context Recurrent LLMs (COLM 2025)
☆18Jul 8, 2025Updated last year
nmandic78 / AI-VoiceAssistant
View on GitHub
A Python-based voice assistant integrating speech-to-text (STT), text-to-speech (TTS), and powerful AI capabilities using either a local …
☆19Jun 28, 2026Updated 3 weeks ago
akhileshthite / zipify-tunes
View on GitHub
Convert any playlist CSVs into MP3 files with metadata, bring back your MP3 player!
☆19Jul 4, 2026Updated 2 weeks ago
tensake / litehook
View on GitHub
Lightweight social media monitoring tool built with Rust
☆17Jun 10, 2026Updated last month
JackXing875 / NeneBot
View on GitHub
綾地寧々は世界一可愛い！
☆16Jul 14, 2026Updated last week
Stilwell-Git / Adaptation-with-Noisy-OracLE
View on GitHub
PyTorch implementation for our paper "Efficient Meta Reinforcement Learning for Preference-based Fast Adaptation"
☆14Apr 19, 2023Updated 3 years ago
zkingston / unknot
View on GitHub
Unknot Motion Planning
☆15Mar 30, 2025Updated last year
SYuan03 / VisaAppointmentWatcher
View on GitHub
☆15Jul 15, 2025Updated last year
shangshang-wang / Tora
View on GitHub
Tora: Torchtune-LoRA for RL
☆87Dec 2, 2025Updated 7 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
Belyenochi / openclaw-edd
View on GitHub
Evaluation-Driven Development for OpenClaw agents — mine golden cases from real sessions, catch regressions before they ship.
☆18Mar 17, 2026Updated 4 months ago
lucidrains / x-evolution
View on GitHub
Implementation of various evolutionary algorithms, starting with evolutionary strategies
☆51May 10, 2026Updated 2 months ago
vishnu97770 / VELOTYPE
View on GitHub
Adaptive AI-powered typing practice system that analyzes repeated user mistakes and generates personalized corrective tasks using FastAPI…
☆16Jun 6, 2026Updated last month
PrimeIntellect-ai / prime-rl
View on GitHub
Agentic RL Training at Scale
☆1,702Updated this week
lkarlslund / tokenrouter
View on GitHub
One OpenAI-compatible endpoint for all your AI providers
☆16Apr 23, 2026Updated 2 months ago
jaredrummler / consoul
View on GitHub
A beautiful terminal-based AI chat interface built with Textual and LangChain
☆15Jan 7, 2026Updated 6 months ago
test-time-training / discover
View on GitHub
☆608May 24, 2026Updated last month
ErikZ719 / CoTA
View on GitHub
[ICLR 26] Context Tokens are Anchors: Understanding the Repeat Curse in dMLLMs from an Information Flow Perspective
☆16Mar 6, 2026Updated 4 months ago
Jivoronix / blockchain-data-validator
View on GitHub
☆15Jan 31, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Yonoi / AGL-STAN
View on GitHub
The code of paper "Spatial-Temporal Attention Network for Crime Prediction with Adaptive Graph Learning"
☆15Mar 18, 2023Updated 3 years ago
dannysun85 / ClawX
View on GitHub
依托ClawX完全重构和增加了全新的功能！
☆16Feb 23, 2026Updated 4 months ago
timjuenemann / wikipedia-mcp
View on GitHub
Wikipedia MCP Server written in TypeScript
☆15Apr 18, 2025Updated last year
mallorbc / llama_dataset_formats
View on GitHub
☆20Jan 24, 2024Updated 2 years ago
SparkZu / RadioLLM
View on GitHub
RadioLLM: Introducing Large Language Model into Cognitive Radio via Hybrid Prompt and Token Reprogrammings
☆19Jun 14, 2025Updated last year
mattkang0 / weatpy
View on GitHub
A python implementation of wego
☆15Oct 15, 2016Updated 9 years ago
arcee-ai / pybubble
View on GitHub
☆81Feb 18, 2026Updated 5 months ago
omar-A-hassan / medsci-agent
View on GitHub
Biomedical research agent with 28 MCP tools powered by MedGemma, TxGemma and OpenCode
☆18Mar 14, 2026Updated 4 months ago
green-anger / MemoryPool
View on GitHub
Fast Efficient Fixed-Size Memory Pool
☆15Dec 12, 2018Updated 7 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
junminhong / awesome-agent-skills
View on GitHub
A curated list of agent skills, resources, and tools for building customizable AI workflows (Claude Code, Codex, Kiro-CLI)
☆15Jun 19, 2026Updated last month
tbrumue / heapo
View on GitHub
HEAPO – An Open Dataset for Heat Pump Optimization with Smart Electricity Meter Data and On-Site Inspection Protocols
☆16Mar 26, 2025Updated last year
MaximeRivest / ovllm
View on GitHub
☆39Aug 4, 2025Updated 11 months ago
lsm1103 / session-dashboard
View on GitHub
Used to browse and monitor the historical session records of AI programming tools（Claude Code、Codex CLI、Cursor、Aider）
☆16Mar 18, 2026Updated 4 months ago
ycdfwzy / PL-MSCKF
View on GitHub
☆16Jan 6, 2023Updated 3 years ago
hmishfaq / LSAC
View on GitHub
The official code release for "Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning", ICLR 2025
☆22May 28, 2025Updated last year
acgessler / rust-persistent-kv
View on GitHub
Persistent fault-tolerant key-value store in rust
☆16Mar 17, 2025Updated last year