Official repository for the paper Number Cookbook: Number Understanding of Language Models and How to Improve It.
☆21Mar 31, 2025Updated last year
Alternatives and similar repositories for number_cookbook
Users that are interested in number_cookbook are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Towards a Mechanistic Understanding of Large Reasoning Models: A Survey of Training, Inference, and Failures☆33Jan 29, 2026Updated 3 months ago
- exploring whether LLMs perform case-based or rule-based reasoning☆31Mar 2, 2024Updated 2 years ago
- ☆19Apr 26, 2026Updated 3 weeks ago
- ☆18Oct 18, 2024Updated last year
- ☆15Apr 14, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code repository for the paper on "Predicting the Performance of Black-Box LLMs through Self-Queries".☆12Jan 9, 2025Updated last year
- ☆13Jan 22, 2025Updated last year
- Code repository for "The Clock and the Pizza: Two Stories in Mechanistic Explanation of Neural Networks"☆20Nov 24, 2023Updated 2 years ago
- My personal site, using Wowchemy☆13May 13, 2026Updated last week
- [TVCG & VR'25] LAPIG: Language Guided Projector Image Generation with Surface Adaptation and Stylization☆11Apr 16, 2026Updated last month
- Code and data release of the paper Enhancing LLM Complex Problem-Solving with Hybrid Thinking and Dynamic Workflows☆15Oct 4, 2024Updated last year
- [IEEE T-BIOM] FaceXBench: Evaluating Multimodal LLMs on Face Understanding☆20Jan 15, 2026Updated 4 months ago
- [ICLR 2026] Adaptive Social Learning via Mode Policy Optimization for Language Agents☆50Feb 2, 2026Updated 3 months ago
- Control LLM☆23Apr 6, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- An Open-Source RAG Workload Trace to Optimize RAG Serving Systems☆36Nov 18, 2025Updated 6 months ago
- https://footprints.baulab.info☆18Oct 4, 2024Updated last year
- [ACL2024] Exploring the Potential of Large Language Models in Computational Argumentation☆18Aug 21, 2024Updated last year
- ☆20Aug 19, 2024Updated last year
- [CVPR 2020] A generative model with latent factors that are independent and localized.☆12Mar 27, 2025Updated last year
- ☆13Apr 9, 2026Updated last month
- MinPrompt: Graph-based Minimal Prompt Data Augmentation for Few-shot Question Answering☆14May 3, 2024Updated 2 years ago
- Memory experiments with LLMs☆10Mar 31, 2023Updated 3 years ago
- ☆34Oct 13, 2025Updated 7 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Unlocking the Essence of Beauty: Advanced Aesthetic Reasoning with Relative-Absolute Policy Optimization☆26Jan 27, 2026Updated 3 months ago
- The official code repo and data hub of top_nsigma sampling strategy for LLMs.☆26Feb 11, 2025Updated last year
- A thesis template compliant with King's College London and UCL rules☆19Dec 14, 2025Updated 5 months ago
- INFRA-COMPASS is a tool that leverages Large Language Models (LLMs) to create and maintain an inventory of state and local codes and ordi…☆17Updated this week
- ☆22Jul 1, 2024Updated last year
- ☆10Nov 6, 2024Updated last year
- Official implementation of "Weakly-supervised positional contrastive learning: application to cirrhosis classification", MICCAI 2023☆11Dec 16, 2025Updated 5 months ago
- Utility functions/scripts for working with GPUs.☆10Jul 5, 2021Updated 4 years ago
- [EMNLP'24 (Main)] DRPO(Dynamic Rewarding with Prompt Optimization) is a tuning-free approach for self-alignment. DRPO leverages a search-…☆25Nov 17, 2024Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Official Pytorch Implementation of Paper "DarwinLM: Evolutionary Structured Pruning of Large Language Models"☆20Feb 21, 2025Updated last year
- Source code related to the research paper entitled RVENet: A Large Echocardiographic Dataset for the Deep Learning-Based Assessment of Ri…☆12Mar 10, 2024Updated 2 years ago
- ACL 2026 & NAACL 2025: Bridging Retrieval and Inference through Evidence Fusion☆13Apr 9, 2026Updated last month
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!☆11May 24, 2023Updated 3 years ago
- This repository contains the code used for the experiments in the paper "Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity…☆31Oct 27, 2025Updated 6 months ago
- ☆19Jan 3, 2025Updated last year