omikad / probs
PROBS algorithm implementation
☆10Updated 2 weeks ago
Alternatives and similar repositories for probs:
Users that are interested in probs are comparing it to the libraries listed below
- Alpha-Zero Connect Four NN trained via self play☆13Updated 3 months ago
- The code repository for the CURLoRA research paper. Stable LLM continual fine-tuning and catastrophic forgetting mitigation.☆41Updated 4 months ago
- Play chess against large language models.☆41Updated 11 months ago
- A simple library for working with Hugging Face models.☆14Updated 3 weeks ago
- A pure and fast NumPy implementation of Mamba with cache support.☆17Updated 7 months ago
- Simplified implementation of UMAP like dimensionality reduction algorithm☆44Updated 2 months ago
- One Line To Build Zero-Data Classifiers in Minutes☆33Updated 3 months ago
- ☆25Updated 4 months ago
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆26Updated last week
- Accompanying code and SEP dataset for the "Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?" paper.☆46Updated 7 months ago
- Mobile Viewer for W&B, built on top of Flutter.☆32Updated 10 months ago
- Very minimal (and stateless) agent framework☆41Updated last week
- Exploration: using technology to aid people who lack both the ability to speak and fine motor control.☆12Updated 2 months ago
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆14Updated this week
- ☆111Updated last month
- ☆30Updated 4 months ago
- Chat WebUI is an easy-to-use user interface for interacting with AI, and it comes with multiple useful built-in tools.☆14Updated 2 months ago
- The official repository for HyperZ⋅Z⋅W Operator Connects Slow-Fast Networks for Full Context Interaction.☆31Updated last week
- ☆62Updated 11 months ago
- implementation of https://arxiv.org/pdf/2312.09299☆20Updated 6 months ago
- Gpu benchmark☆50Updated 3 months ago
- Training code for Sparse Autoencoders on Embedding models☆35Updated last month
- ☆60Updated last week
- A high throughput, end-to-end RL library for infinite horizon tasks.☆18Updated 7 months ago
- Lightweight tools for quick and easy LLM demo's☆26Updated 3 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆21Updated last month
- ☆12Updated 10 months ago
- Official repository for the paper "NeuZip: Memory-Efficient Training and Inference with Dynamic Compression of Neural Networks". This rep…☆46Updated 2 months ago
- ☆46Updated this week