Official implementation of "Steering LLM Reasoning Through Bias-Only Adaptation" and "Small Vectors, Big Effects: A Mechanistic Study of RL-Induced Reasoning via Steering Vectors"
☆53Oct 7, 2025Updated 8 months ago
Alternatives and similar repositories for steering-reasoning
Users that are interested in steering-reasoning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implementation of the paper "Linear Transformers with Learnable Kernel Functions are Better In-Context Models"☆169Jan 16, 2025Updated last year
- Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)☆43Aug 22, 2023Updated 2 years ago
- Official Implementation for "In-Context Reinforcement Learning from Noise Distillation"☆35Sep 18, 2024Updated last year
- ☆13Aug 7, 2021Updated 4 years ago
- ☆11Jan 21, 2020Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code for the paper "PALBERT: Teaching ALBERT to Ponder", NeurIPS 2022 Spotlight☆37Apr 8, 2023Updated 3 years ago
- Official implementation for "Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows", NeurIPS 2022, O…☆12Jan 31, 2023Updated 3 years ago
- Repository for "Revisiting Non-Acyclic GFlowNets in Discrete Environments" (ICML 2025)☆14Oct 8, 2025Updated 8 months ago
- The official implementation of the paper SAEdit: Token-level control for continuous image editing via Sparse AutoEncoder☆23Oct 19, 2025Updated 8 months ago
- 👀 VITRina: VIsual Token Representations☆11Jun 15, 2023Updated 3 years ago
- Введение в машинное обучение на БИ☆13Jul 1, 2022Updated 4 years ago
- ☆13Jun 4, 2024Updated 2 years ago
- zero-vocab or low-vocab embeddings☆18Jul 17, 2022Updated 3 years ago
- ☆33Oct 28, 2025Updated 8 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A collection of my data science articles published in Towards Data Science and Towards AI.☆16Sep 19, 2025Updated 9 months ago
- ICASSP2026 HumDial Challenge☆48May 28, 2026Updated last month
- CREATE Environment for long-horizon physics-puzzle tasks with diverse tools☆18Nov 22, 2022Updated 3 years ago
- VC-FB and MC-FB algorithms from "Zero-Shot Reinforcement Learning from Low Quality Data" (NeurIPS 2024)☆29Jan 14, 2025Updated last year
- Streamlit app presented to the Streamlit LLMs Hackathon September 23☆15May 13, 2024Updated 2 years ago
- Official implementation for "Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size", NeurIPS 2022, Offline RL Worksho…☆21Feb 27, 2023Updated 3 years ago
- The code of paper "Toward Optimal LLM Alignments Using Two-Player Games".☆17Jun 20, 2024Updated 2 years ago
- Code for ACL 2022 long paper: Can Prompt Probe Pretrained Language Models? Understanding the Invisible Risks from a Causal View☆10May 17, 2022Updated 4 years ago
- [CVPR 2025] Official implementation of paper "Prosody-Enhanced Acoustic Pre-training and Acoustic-Disentangled Prosody Adapting for Movie…☆23Jun 6, 2025Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Репозиторий курса "Практические аспекты обучения больших языковых моделей", ВМК МГУ, осень, 2024☆20Dec 24, 2024Updated last year
- ☆29Oct 26, 2024Updated last year
- (CVPR 26 Findings) Official implementation of the paper "Bind-Your-Avatar: Multi-Talking-Character Video Generation with Dynamic 3D-mask-…☆34Apr 7, 2026Updated 2 months ago
- Practice vocab for gre/toefl/ielts etc. Or just download the vocabulary JS files from here (https://github.com/surajk95/wordsta/tree/mast…☆17Sep 25, 2024Updated last year
- ☆10Jul 14, 2024Updated last year
- Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients☆26Sep 10, 2024Updated last year
- ☆13Aug 27, 2021Updated 4 years ago
- This is the official implementation of "ImageReFL: Balancing Quality and Diversity in Human-Aligned Diffusion Models"☆53Jun 10, 2025Updated last year
- Environments and Algorithms for Generative Flow Networks in JAX☆90May 10, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official Pytorch implementation of "Neural Optimal Transport with General Cost Functionals" (ICLR 2024)☆24Aug 29, 2024Updated last year
- Identity verification from speech☆19Jul 19, 2022Updated 3 years ago
- [ACM MM24] Official implementation of paper "From Speaker to Dubber: Movie Dubbing with Prosody and Duration Consistency Learning"☆34May 7, 2025Updated last year
- Примеры пропозалов для подачи заявки в Open.TLab☆27Dec 15, 2022Updated 3 years ago
- My solutions of Yandex.Blitz (Yandex Cup) Machine Learning Track 2018☆20Aug 29, 2022Updated 3 years ago
- Deep Learning model for lexical stress detection in spoken English☆28Mar 17, 2020Updated 6 years ago
- ACL 2023 short: Balancing Lexical and Semantic Quality in Abstractive Summarization☆16Dec 18, 2023Updated 2 years ago