Al-th / grpo_experimentLinks
Experiment on reimplementation of GRPO RL
☆12Updated 6 months ago
Alternatives and similar repositories for grpo_experiment
Users that are interested in grpo_experiment are comparing it to the libraries listed below
Sorting:
- A star for organising blocks and playing with transformers.☆23Updated last year
- Framework for specifying and proving properties—such as robustness, fairness, and interpretability—of machine learning models using Lean …☆66Updated last month
- Copies of prolog solvers for use from python☆18Updated last year
- Rewriting Principia Mathematica in Lean☆134Updated 3 weeks ago
- Advanced Python Function Debugging with MCP Integration.☆57Updated 2 months ago
- A probabilistic approximate DNF counter☆37Updated this week
- ☆66Updated 3 months ago
- Full Automation of Goal-driven LLM Dialog Threads with And-Or Recursors and Refiner Oracles☆45Updated this week
- ☆27Updated 11 months ago
- Analyzing hacker news in real-time with Bytewax and Proton☆38Updated last year
- PILF: A IPWT-inspired bionic continual learning experiment focus on mitigate catastrophic forgetting with Surprise-gated Mixture of Exper…☆36Updated last month
- Sequential Logic☆111Updated 2 weeks ago
- A GPU Accelerated Binary Vector Store☆47Updated 6 months ago
- A Full Transcript of the Lighthill Debate on AI from 1973, with Introductory Remarks☆31Updated last year
- PostgreSQL Prolog language handler☆133Updated last year
- Hierarchical topic segmentation of meeting transcripts using embeddings and divisive clustering.☆53Updated last year
- LLM plugin for pulling content from Hacker News☆116Updated 3 months ago
- Your AI research assistant☆79Updated 5 months ago
- Ask questions, let GPT do the SQL.☆132Updated 2 years ago
- A query language for exploring knowledge graphs.☆143Updated 3 months ago
- Tools for LLM agents.☆63Updated 8 months ago
- A fast minimalistic implementation of guided generation on Apple Silicon using Outlines and MLX☆56Updated last year
- A multithreaded async event loop for python☆59Updated 11 months ago
- A tiny autograd engine with a Jax-like API☆74Updated last month
- A Bayesian based architecture to evaluate the optimal weights of different stocks in a portfolio according to Global Minimum Variance and…☆25Updated 3 years ago
- Prototyping a question and answer bot over PDFs☆39Updated last year
- yet another scalar autograd engine - featuring complex numbers and fixed DAG☆26Updated last year
- Visualize text embeddings☆40Updated 2 years ago
- Implement recursion using English as the programming language and an LLM as the runtime.☆239Updated 2 years ago
- Extremely memory-efficient vector database☆74Updated 11 months ago