Online Preference Alignment for Language Models via Count-based Exploration
☆17Jan 14, 2025Updated last year
Alternatives and similar repositories for COPO
Users that are interested in COPO are comparing it to the libraries listed below
Sorting:
- Official implementation of paper: LiNo: Advancing Recursive Residual Decomposition of Linear and Nonlinear Patterns for Robust Time Serie…☆18Dec 19, 2025Updated 2 months ago
- ☆33Jul 15, 2025Updated 7 months ago
- A feature-rich concurrency kit, yet another DAG framework☆10Jan 18, 2026Updated last month
- 知乎爬虫---知乎点赞数超过1000的问题及回答,知乎神回复☆23May 10, 2016Updated 9 years ago
- Some microbenchmarks and design docs before commencement☆12Feb 1, 2021Updated 5 years ago
- Paster core module using KiteX☆10Aug 30, 2023Updated 2 years ago
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆33Feb 10, 2025Updated last year
- ☆14Aug 12, 2024Updated last year
- ADAPTIVE RESONANCE THEORY. Gail A. Carpenter and Stephen Grossberg☆10Feb 10, 2015Updated 11 years ago
- ☆22Dec 11, 2025Updated 2 months ago
- ☆11Oct 31, 2024Updated last year
- A Caffe/C++ implementation of Deep Deterministic Policy Gradient☆10Feb 1, 2019Updated 7 years ago
- A Multi-Session and Multi-Therapy Benchmark for High-Realism AI Psychological Counselor☆30Jan 13, 2026Updated last month
- NuART-Py: Python Library of Adaptive Resonance Theory Neural Network☆10Jan 26, 2020Updated 6 years ago
- Part of a research scholarship. I built a basic 2d driving sim with simulated lidar data to train Deep Q Neural Network. So far after abo…☆11Feb 15, 2017Updated 9 years ago
- Implementation of BIMRL: Brain Inspired Meta Reinforcement Learning - Roozbeh Razavi et al. (IROS 2022)☆10Dec 1, 2022Updated 3 years ago
- ☆18Feb 16, 2025Updated last year
- A job management system for python☆10Jan 16, 2026Updated last month
- Official repo of paper LM2☆47Feb 13, 2025Updated last year
- Decoding Attention is specially optimized for MHA, MQA, GQA and MLA using CUDA core for the decoding stage of LLM inference.☆46Jun 11, 2025Updated 8 months ago
- Multi-view Broad Learning Systerm☆10Mar 20, 2022Updated 3 years ago
- ☆21Jun 16, 2025Updated 8 months ago
- Task models for human robot collaboration☆12Jul 17, 2018Updated 7 years ago
- 🎹 Instruct.KR 2025 Summer Meetup: 오픈소스 LLM, vLLM으로 Production까지 🎹☆23Aug 2, 2025Updated 7 months ago
- concurrent map implementation using bucket list like a skip list.☆10May 29, 2022Updated 3 years ago
- ardrone simulation in gazebo(for kinetic and gazebo 7). Now it can run.☆10Oct 27, 2017Updated 8 years ago
- Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper☆47Aug 13, 2025Updated 6 months ago
- Accurate counters with Kafka & RocksDB.☆15Jan 22, 2021Updated 5 years ago
- Docker base images for C++ development using vcpkg☆10Jan 27, 2026Updated last month
- Google AI Research☆10Mar 11, 2020Updated 5 years ago
- PyTorch implementation of the "Learning an Adaptive Learning Rate Schedule" paper found here: https://arxiv.org/abs/1909.09712.☆12Jan 15, 2020Updated 6 years ago
- DI, IoC container / DI、IoC 容器☆14Nov 9, 2023Updated 2 years ago
- Low-rank Tensor Based Proximity Learning for Multi-view Clustering, TKDE2022☆11Dec 31, 2021Updated 4 years ago
- Multi-layer perceptron, Autoencoder, and Restricted Boltzmann Machine☆10Sep 15, 2018Updated 7 years ago
- Long Context Research☆29Jan 26, 2026Updated last month
- An alternative to elasticsearch engine written in Go for small set of documents that uses inverted index to build the index and utilizes …☆15Jun 14, 2020Updated 5 years ago
- Simple, Non authoritative Benchmarks for embedded databases running in Github Actions☆11Jul 11, 2024Updated last year
- Benchmarking Deepseek R1 API response speeds across different providers for performance comparison.☆10Feb 15, 2025Updated last year
- Make sure your remote stays up to date with changes to local code☆18Jun 26, 2020Updated 5 years ago