This repository contains code for the paper "Better Estimation of the KL Divergence Between Language Models"
☆18May 30, 2025Updated 9 months ago
Alternatives and similar repositories for kl-rb
Users that are interested in kl-rb are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Sampling-Based Minimum Bayes-Risk Decoding for Neural Machine Translation☆16Oct 14, 2022Updated 3 years ago
- ☆17Aug 30, 2025Updated 6 months ago
- Framework for type-safe pure functional and non-cubical tensor processing, written in Idris 2☆29Mar 10, 2026Updated 2 weeks ago
- ☆14Apr 29, 2025Updated 10 months ago
- ☆11Apr 28, 2024Updated last year
- Direct preference optimization with f-divergences.☆16Nov 3, 2024Updated last year
- Robust and safe deep reinforcement learning algorithms☆16Mar 27, 2024Updated last year
- Watering and draining the Earth (and other celestial objects)☆18Nov 24, 2021Updated 4 years ago
- Code repository for "RL Grokking Recipe: How RL Unlocks and Transfers New Algorithms in LLMs""☆32Oct 12, 2025Updated 5 months ago
- Path-SGD: Path-Normalized Optimization in Deep Neural Networks☆19Nov 26, 2018Updated 7 years ago
- Learning deep learning☆13Jun 15, 2018Updated 7 years ago
- Python 3 implementation of the affiliation metrics and tests for reproducing the experiments described in "Local Evaluation of Time Serie…☆26May 30, 2022Updated 3 years ago
- Minimum Bayes Risk Decoding for Hugging Face Transformers☆60Jun 3, 2024Updated last year
- Natural Perturbation for Robust Question Answering☆12Apr 7, 2020Updated 5 years ago
- Interactive documentation and programming with Scala, iPython notebook style.☆19Mar 9, 2016Updated 10 years ago
- Aho–Corasick algorithm automation implement in Golang☆10Apr 22, 2016Updated 9 years ago
- ALBERT Persian Playground☆13Jun 12, 2023Updated 2 years ago
- An unofficial Python 3 version of jemdoc.☆11Feb 8, 2026Updated last month
- Code for running forward and backward versions of GPT2☆10Nov 20, 2021Updated 4 years ago
- Because it's there.☆16Sep 22, 2024Updated last year
- Determinantal Point Processes in Python (NumPy)☆24Jul 5, 2017Updated 8 years ago
- A simple demonstration of using ctypes to call a C++ class from Python☆30May 6, 2020Updated 5 years ago
- Composable inference algorithms with LLMs and programmable logic☆70Dec 4, 2024Updated last year
- Reversible programming in Agda☆13Jun 22, 2023Updated 2 years ago
- Normalized and modified version of Bijankhan corpus☆13Feb 21, 2023Updated 3 years ago
- ☆15May 17, 2022Updated 3 years ago
- Tidy autoregressive inference in JAX☆15Sep 1, 2025Updated 6 months ago
- Modern Methods of Applied Statistics (Spring 2023) STAT 34800☆10May 20, 2023Updated 2 years ago
- bash script to find and execute java classes with main methods☆19Oct 24, 2025Updated 4 months ago
- ☆11Mar 4, 2020Updated 6 years ago
- Exercises from the Fall 2023 Algolab course at ETH Zürich☆23Jan 8, 2025Updated last year
- ☆15Nov 19, 2018Updated 7 years ago
- Simple-to-use scoring function for arbitrarily tokenized texts.☆48Feb 19, 2025Updated last year
- Official Implementation of implicit reference attack☆11Oct 16, 2024Updated last year
- Code used to run experiments for the ICLR 2023 paper "Computational Language Acquisition with Theory of Mind".☆15Apr 27, 2023Updated 2 years ago
- code pour les billets "Refactorer Future[Option[T]]" sur☆12Jun 14, 2017Updated 8 years ago
- A repository for the EMNLP 2021 paper "Is Information Density Uniform in Task-Oriented Dialogues?" and for the CoNLL 2021 paper "Analysin…☆10Jun 17, 2024Updated last year
- Web-grounded natural language instructions☆18Nov 25, 2024Updated last year
- Understanding the win conditions for a game of League of Legends.☆11Aug 14, 2020Updated 5 years ago