omikad / probsLinks

PROBS algorithm implementation

☆8

Alternatives and similar repositories for probs

Users that are interested in probs are comparing it to the libraries listed below

Sorting:

kyegomez / OpenStrawberry
An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO
☆30Updated last week
AtakanTekparmak / agento
Very minimal (and stateless) agent framework
☆44Updated 6 months ago
s-smits / grpo-optuna
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
☆55Updated 5 months ago
lucidrains / grokfast-pytorch
Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"
☆101Updated 7 months ago
mmhamdy / open-language-models
A list of language models with permissive licenses such as MIT or Apache 2.0
☆24Updated 4 months ago
BorealisAI / neuzip
Official repository for the paper "NeuZip: Memory-Efficient Training and Inference with Dynamic Compression of Neural Networks". This rep…
☆59Updated 8 months ago
Agora-Lab-AI / OmegaViT
OmegaViT (ΩViT) is a cutting-edge vision transformer architecture that combines multi-query attention, rotary embeddings, state space mod…
☆14Updated this week
NVIDIA / NeMo-Inspector
A tool for an analysis of LLM generations.
☆40Updated last month
RWKV / ZeroCoT
https://x.com/BlinkDL_AI/status/1884768989743882276
☆28Updated 2 months ago
evanatyourservice / llm-jax
Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.
☆17Updated 4 months ago
ritabratamaiti / AnyModal
AnyModal is a Flexible Multimodal Language Model Framework for PyTorch
☆100Updated 7 months ago
OpenPipe / rl-experiments
OpenPipe Reinforcement Learning Experiments
☆28Updated 4 months ago
Pleias / Quest-Best-Tokens
An introduction to LLM Sampling
☆79Updated 7 months ago
gokborayilmaz / multi-keyword-trend-analyzer-agent-
This AI Agent retrieves the latest news articles based on a multi keyword using the Serp API. It processes the results and returns struct…
☆11Updated 5 months ago
joey00072 / ohara
Collection of autoregressive model implementation
☆86Updated 3 months ago
lechmazur / pgg_bench
Public Goods Game (PGG) Benchmark: Contribute & Punish is a multi-agent benchmark that tests cooperative and self-interested strategies a…
☆37Updated 3 months ago
AlexBodner / How_Much_VRAM
☆101Updated 10 months ago
bradhilton / temporal-clue
Clue inspired puzzles for testing LLM deduction abilities
☆38Updated 4 months ago
menhguin / minp_paper
Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper
☆38Updated 4 months ago
huggingface / discord-bots
☆50Updated last year
codelion / pts
Pivotal Token Search
☆111Updated last week
ElleLeonne / Lightning-ReLoRA
A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.
☆33Updated last year
lucidrains / mind-evolution
Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind
☆55Updated last month
mlabonne / chessllm
☆38Updated last year
lilakk / BLEUBERI
Official repository for "BLEUBERI: BLEU is a surprisingly effective reward for instruction following"
☆25Updated last month
attentionmech / tensorlens
aesthetic tensor visualiser
☆24Updated 3 months ago
govtech-responsibleai / KnowOrNot
☆19Updated 3 weeks ago
deepgrove-ai / Bonsai
☆22Updated 4 months ago
cpldcpu / llmbenchmark
Various LLM Benchmarks
☆24Updated last month
VatsaDev / NanoPoor
NanoGPT-speedrunning for the poor T4 enjoyers
☆68Updated 3 months ago