This repository contains code for the paper "Better Estimation of the KL Divergence Between Language Models"
☆19May 30, 2025Updated 11 months ago
Alternatives and similar repositories for kl-rb
Users that are interested in kl-rb are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Sampling-Based Minimum Bayes-Risk Decoding for Neural Machine Translation☆16Oct 14, 2022Updated 3 years ago
- ☆22Aug 30, 2025Updated 8 months ago
- Framework for type-safe pure functional and non-cubical tensor processing, written in Idris 2☆38Mar 27, 2026Updated last month
- ☆14Apr 29, 2025Updated last year
- ☆11Apr 28, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Direct preference optimization with f-divergences.☆16Nov 3, 2024Updated last year
- Robust and safe deep reinforcement learning algorithms☆17Mar 27, 2024Updated 2 years ago
- Watering and draining the Earth (and other celestial objects)☆18Nov 24, 2021Updated 4 years ago
- Code repository for "RL Grokking Recipe: How RL Unlocks and Transfers New Algorithms in LLMs""☆33Oct 12, 2025Updated 6 months ago
- Path-SGD: Path-Normalized Optimization in Deep Neural Networks☆19Nov 26, 2018Updated 7 years ago
- Learning deep learning☆13Jun 15, 2018Updated 7 years ago
- Python 3 implementation of the affiliation metrics and tests for reproducing the experiments described in "Local Evaluation of Time Serie…☆27May 30, 2022Updated 3 years ago
- Minimum Bayes Risk Decoding for Hugging Face Transformers☆60Jun 3, 2024Updated last year
- Natural Perturbation for Robust Question Answering☆12Apr 7, 2020Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Interactive documentation and programming with Scala, iPython notebook style.☆19Mar 9, 2016Updated 10 years ago
- Aho–Corasick algorithm automation implement in Golang☆10Apr 22, 2016Updated 10 years ago
- ALBERT Persian Playground☆13Jun 12, 2023Updated 2 years ago
- An unofficial Python 3 version of jemdoc.☆11Feb 8, 2026Updated 2 months ago
- Code for running forward and backward versions of GPT2☆10Nov 20, 2021Updated 4 years ago
- Because it's there.☆16Sep 22, 2024Updated last year
- A simple demonstration of using ctypes to call a C++ class from Python☆30May 6, 2020Updated 5 years ago
- Determinantal Point Processes in Python (NumPy)☆24Jul 5, 2017Updated 8 years ago
- Reversible programming in Agda☆13Jun 22, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Composable inference algorithms with LLMs and programmable logic☆70Dec 4, 2024Updated last year
- Normalized and modified version of Bijankhan corpus☆13Feb 21, 2023Updated 3 years ago
- ☆15May 17, 2022Updated 3 years ago
- Tidy autoregressive inference in JAX☆15Sep 1, 2025Updated 8 months ago
- Modern Methods of Applied Statistics (Spring 2023) STAT 34800☆10May 20, 2023Updated 2 years ago
- bash script to find and execute java classes with main methods☆19Oct 24, 2025Updated 6 months ago
- ☆10Mar 5, 2023Updated 3 years ago
- ☆11Mar 4, 2020Updated 6 years ago
- ☆15Nov 19, 2018Updated 7 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Simple-to-use scoring function for arbitrarily tokenized texts.☆48Feb 19, 2025Updated last year
- Exercises from the Fall 2023 Algolab course at ETH Zürich☆23Jan 8, 2025Updated last year
- Official Implementation of implicit reference attack☆11Oct 16, 2024Updated last year
- Code used to run experiments for the ICLR 2023 paper "Computational Language Acquisition with Theory of Mind".☆15Apr 27, 2023Updated 3 years ago
- Roadmap and Materials for learning practical Data Science☆27Updated this week
- code pour les billets "Refactorer Future[Option[T]]" sur☆12Jun 14, 2017Updated 8 years ago
- A repository for the EMNLP 2021 paper "Is Information Density Uniform in Task-Oriented Dialogues?" and for the CoNLL 2021 paper "Analysin…☆10Jun 17, 2024Updated last year