Al-th / grpo_experiment
Experiment on reimplementation of GRPO RL
☆12Updated 2 months ago
Alternatives and similar repositories for grpo_experiment:
Users that are interested in grpo_experiment are comparing it to the libraries listed below
- A star for organising blocks and playing with transformers.☆23Updated 11 months ago
- Framework for specifying and proving properties—such as robustness, fairness, and interpretability—of machine learning models using Lean …☆58Updated last month
- This is a numpy implementation of the Skip-gram algorithm described in Mikolov et al's Word2Vec paper. It is intended for didactic purpos…☆35Updated last year
- Designing bridge trusses with Pytorch autograd☆61Updated last year
- yet another scalar autograd engine - featuring complex numbers and fixed DAG☆26Updated last year
- Prototyping a question and answer bot over PDFs☆39Updated last year
- Python library containing different modules to build circuits.☆46Updated 2 years ago
- ☆27Updated 7 months ago
- Hierarchical topic segmentation of meeting transcripts using embeddings and divisive clustering.☆53Updated 8 months ago
- [EMNLP 2024 Findings] Code for deciphering CoT using shift ciphers☆12Updated 5 months ago
- Analyzing hacker news in real-time with Bytewax and Proton☆39Updated last year
- Extremely memory-efficient vector database☆67Updated 7 months ago
- The Python client and Jupyter helper for CozoDB☆52Updated 5 months ago
- A probabilistic approximate DNF counter☆36Updated last year
- A GPU Accelerated Binary Vector Store☆47Updated 2 months ago
- Kerf (Kerf2) is a columnar tick database and time-series language for Linux/OSX/BSD/iOS/Android. It is written in C++ and natively speaks…☆27Updated 2 years ago
- ☆14Updated 7 months ago
- A Bayesian based architecture to evaluate the optimal weights of different stocks in a portfolio according to Global Minimum Variance and…☆25Updated 2 years ago
- A multithreaded async event loop for python☆58Updated 6 months ago
- Visualize text embeddings☆37Updated last year
- A playground to make it easy to try crazy things☆33Updated this week
- A text-to-SQL prototype on the northwind sqlite dataset☆12Updated 6 months ago
- This is the repo for all cell sorting code and data☆16Updated 5 months ago
- convert a scikit-learn decision tree into a Keras model☆39Updated last year
- A fork of llama3.c used to do some R&D on inferencing☆21Updated 4 months ago
- Typed python equivalent for R pipes.☆12Updated 2 years ago
- I moved this folder. Keeping this repo up for archival purposes only.☆17Updated 10 months ago
- a graph definition and execution library for python☆16Updated 2 years ago
- Interpolate between embedding points with llm☆36Updated 9 months ago
- Code for "Learning to Play the Chaos Game: Dreaming of fractal foliage by differentiating iterated function systems"☆14Updated 4 years ago