☆15May 27, 2019Updated 6 years ago
Alternatives and similar repositories for OnlineClusteringOfBandits
Users that are interested in OnlineClusteringOfBandits are comparing it to the libraries listed below
Sorting:
- ☆11Aug 10, 2020Updated 5 years ago
- Notes from Simons Institute program "Foundations of Machine Learning"☆13May 5, 2017Updated 8 years ago
- [NeurIPS 2025 Spotlight] Transformer Copilot: Learning from The Mistake Log in LLM Fine-tuning☆18Nov 14, 2025Updated 4 months ago
- ☆11Dec 21, 2023Updated 2 years ago
- Bayesian Inverse Reinforcement Learning with simple environments☆19May 17, 2022Updated 3 years ago
- Active Learning using Multi Label Image Dataset☆14Feb 20, 2019Updated 7 years ago
- ☆13Jul 3, 2022Updated 3 years ago
- Library for Multi-Armed Bandit Algorithms☆57Apr 2, 2017Updated 8 years ago
- Codes for graphlet counting☆11Dec 11, 2017Updated 8 years ago
- Simulations for Dueling Bandit Algorithms, including our Double Thompson Sampling (D-TS) algorithms☆25Sep 27, 2016Updated 9 years ago
- Towards Adaptive ML Benchmarks: Web-Agent-Driven Construction, Domain Expansion, and Metric Optimization☆20Sep 12, 2025Updated 6 months ago
- User Simulation for Conversational Recommendation☆19Jan 30, 2026Updated last month
- ☆27Oct 6, 2025Updated 5 months ago
- Code for the work "Adaptive Sample Scheduling for Direct Preference Optimization", which was accepted to the 2025 Conference on Neural In…☆40Sep 30, 2025Updated 5 months ago
- Implementation of Inverse Propensity Matrix Factorization with Pytorch-Lightning☆12Sep 23, 2020Updated 5 years ago
- ☆10Oct 28, 2020Updated 5 years ago
- ☆11Oct 5, 2024Updated last year
- ☆23Sep 30, 2024Updated last year
- A Python 3 Bandit Visualization Package☆11Oct 16, 2017Updated 8 years ago
- face recognition; LDA; PCA; eigenface☆11Nov 7, 2018Updated 7 years ago
- [ACL'25 Main] SelfElicit: Your Language Model Secretly Knows Where is the Relevant Evidence! | 让你的LLM更好地利用上下文文档:一个基于注意力的简单方案☆26Feb 17, 2025Updated last year
- Unsupervised Chinese Typography Transfer☆17Mar 25, 2023Updated 2 years ago
- Beyond log-likelihood: exploring alternative objectives for supervised fine-tuning of language model post-training☆55Oct 4, 2025Updated 5 months ago
- Contextual bandit algorithm called LinUCB / Linear Upper Confidence Bounds as proposed by Li, Langford and Schapire☆33Feb 2, 2023Updated 3 years ago
- an active learning framework in python☆45Apr 30, 2016Updated 9 years ago
- active learning in python☆35Mar 23, 2017Updated 8 years ago
- Author implementation of "Learning to Search in Long Documents Using Document Structure" (Mor Geva and Jonathan Berant, 2018)☆22Jul 12, 2018Updated 7 years ago
- Comostional question answering☆17Jun 18, 2021Updated 4 years ago
- The implementation of Multiple Choice Questions based Multi-Interest Policy Learning for Conversational Recommendation☆29May 8, 2022Updated 3 years ago
- Papers being part of the state of the art on reinforcement learning☆21Jan 16, 2020Updated 6 years ago
- A curated list of resources for "Flow Matching Meets Biology and Life Science: A Survey". Nature Portfolio Journal Artificial Intelligenc…☆70Mar 7, 2026Updated last week
- This repo is reproduction resources for linear alignment paper, still working☆18May 19, 2024Updated last year
- A TensorFlow implementation of SOFA, the Simulator for OFfline LeArning and evaluation.☆21Nov 29, 2020Updated 5 years ago
- Deeper insights into graph convolutional networks for semi-supervised learning☆19Dec 19, 2018Updated 7 years ago
- Active Learning in R☆47May 21, 2017Updated 8 years ago
- Reference implementation of algorithms for reinforcement learning and Markov decision processes.☆12Jan 28, 2021Updated 5 years ago
- Sentence encoder and training code for Mean-Max AAE☆16Nov 8, 2018Updated 7 years ago
- Code and Models for paper "AutoSeM: Automatic Task Selection and Mixing in Multi-Task Learning. Han Guo, Ramakanth Pasunuru, and Mohit Ba…☆24Apr 15, 2019Updated 6 years ago
- Codes accompanying the paper "Offline Reinforcement Learning with Value-Based Episodic Memory" (ICLR 2022 https://arxiv.org/abs/2110.0979…☆15Mar 9, 2022Updated 4 years ago