robertvacareanu/llm4regression

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/robertvacareanu/llm4regression)

robertvacareanu / llm4regression

Examining how large language models (LLMs) perform across various synthetic regression tasks when given (input, output) examples in their context, without any parameter update

☆162

Alternatives and similar repositories for llm4regression

Users that are interested in llm4regression are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AlexIoannides / llm-regression
View on GitHub
Exploring the classical regression capabilities of LLMs.
☆18May 20, 2024Updated 2 years ago
whyNLP / LCKV
View on GitHub
Layer-Condensed KV cache w/ 10 times larger batch size, fewer params and less computation. Dramatic speed up with better task performance…
☆157Apr 7, 2025Updated last year
laramohan / wikillm
View on GitHub
LLMs as Collaboratively Edited Knowledge Bases
☆52Feb 8, 2026Updated 5 months ago
MattYoon / reasoning-models-confidence
View on GitHub
[NeurIPS 2025] Reasoning Models Better Express Their Confidence"
☆23Nov 19, 2025Updated 8 months ago
ruizheliUOA / ARC_JSD
View on GitHub
A Jensen-Shannon Divergence Driven Mechanistic Study of Context Attribution in Retrieval-Augmented Generation
☆15Aug 28, 2025Updated 10 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
montrealrobotics / iv_rl
View on GitHub
IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation
☆40Jul 18, 2025Updated last year
john-hewitt / implicit-ins
View on GitHub
Codebase for Instruction Following without Instruction Tuning
☆36Sep 24, 2024Updated last year
epfml / schedules-and-scaling
View on GitHub
Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"
☆93Oct 30, 2024Updated last year
ArmelRandy / tree-of-problems
View on GitHub
[EMNLP 2024] Tree of Problems: Improving structured problem solving with compositionality
☆20Mar 4, 2025Updated last year
liziniu / HyperDQN
View on GitHub
Code for ICLR 2022 Paper (HyperDQN: A Randomized Exploration Method for Deep Reinforcement Learning)
☆12Nov 28, 2023Updated 2 years ago
Guitaricet / relora
View on GitHub
Official code for ReLoRA from the paper Stack More Layers Differently: High-Rank Training Through Low-Rank Updates
☆474Apr 21, 2024Updated 2 years ago
amirzandieh / HyperAttention
View on GitHub
Triton Implementation of HyperAttention Algorithm
☆48Dec 11, 2023Updated 2 years ago
flowersteam / LLM-Culture
View on GitHub
Code for the "Cultural evolution in populations of Large Language Models" paper
☆35Jul 7, 2026Updated 2 weeks ago
ArashRabbani / DeepAngle
View on GitHub
Fast calculation of contact angles in tomography images using deep learning
☆16Apr 2, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
TRI-ML / linear_open_lm
View on GitHub
A repository for research on medium sized language models.
☆78May 23, 2024Updated 2 years ago
dvruette / concept-guidance
View on GitHub
Code accompanying the paper "A Language Model's Guide Through Latent Space". It contains functionality for training and using concept vec…
☆21Feb 23, 2024Updated 2 years ago
araujoalexandre / Lipschitz-SLL-Networks
View on GitHub
☆10Oct 27, 2023Updated 2 years ago
siyan-zhao / ICL_decision_boundary
View on GitHub
official code for paper Probing the Decision Boundaries of In-context Learning in Large Language Models. https://arxiv.org/abs/2406.11233…
☆20Jul 27, 2025Updated 11 months ago
GindaChen / FlexFlashAttention3
View on GitHub
FlexAttention w/ FlashAttention3 Support
☆27Oct 5, 2024Updated last year
Yuanhy1997 / HyPe
View on GitHub
HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation [ACL 2023]
☆14Jul 11, 2023Updated 3 years ago
recursal / GoldFinch-paper
View on GitHub
GoldFinch and other hybrid transformer components
☆46Jul 20, 2024Updated 2 years ago
axgoujon / weakly_convex_ridge_regularizer
View on GitHub
☆12Aug 28, 2023Updated 2 years ago
berndprach / 1LipschitzLayersCompared
View on GitHub
☆13Jul 2, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
UBCDingXin / CCDM
View on GitHub
The official implementation of CCDM and iCCDM.
☆23Feb 7, 2026Updated 5 months ago
OpenNLPLab / LASP
View on GitHub
Linear Attention Sequence Parallelism (LASP)
☆87Jun 4, 2024Updated 2 years ago
zaydzuhri / pythia-mlkv
View on GitHub
Multi-Layer Key-Value sharing experiments on Pythia models
☆34Jun 14, 2024Updated 2 years ago
pprp / ACBench
View on GitHub
[ICML25] Agentic Compression Benchmark (ACBench)
☆17Jul 2, 2025Updated last year
akhilkedia / TranformersGetStable
View on GitHub
[ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"
☆11Jul 19, 2024Updated 2 years ago
abdelfattah-lab / TokenButler
View on GitHub
☆27May 12, 2026Updated 2 months ago
Ryu1845 / hyena-jax
View on GitHub
Implementation of Hyena Hierarchy in JAX
☆10Apr 30, 2023Updated 3 years ago
stanfordmlgroup / ManyICL
View on GitHub
☆147May 23, 2024Updated 2 years ago
tum-ai / number-token-loss
View on GitHub
A regression-alike loss to improve numerical reasoning in language models - ICML 2025
☆30Aug 18, 2025Updated 11 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
acfr / LBDN
View on GitHub
Direct parameterization of Lipschitz-bounded deep networks
☆15May 28, 2023Updated 3 years ago
allenai / CommonGen-Eval
View on GitHub
Evaluating LLMs with CommonGen-Lite
☆95Mar 21, 2024Updated 2 years ago
hbin0701 / Self-Explore
View on GitHub
[𝐄𝐌𝐍𝐋𝐏 𝐅𝐢𝐧𝐝𝐢𝐧𝐠𝐬 𝟐𝟎𝟐𝟒 & 𝐀𝐂𝐋 𝟐𝟎𝟐𝟒 𝐍𝐋𝐑𝐒𝐄 𝐎𝐫𝐚𝐥] 𝘌𝘯𝘩𝘢𝘯𝘤𝘪𝘯𝘨 𝘔𝘢𝘵𝘩𝘦𝘮𝘢𝘵𝘪𝘤𝘢𝘭 𝘙𝘦𝘢𝘴𝘰𝘯𝘪𝘯…
☆52May 4, 2024Updated 2 years ago
swarnaHub / System-1.x
View on GitHub
PyTorch code for System-1.x: Learning to Balance Fast and Slow Planning with Language Models
☆25Jul 22, 2024Updated 2 years ago
tomaarsen / attention_sinks
View on GitHub
Extend existing LLMs way beyond the original training length with constant memory usage, without retraining
☆735Apr 10, 2024Updated 2 years ago
tml-epfl / icl-alignment
View on GitHub
Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]
☆33Jan 23, 2025Updated last year
uservan / ThinkPO
View on GitHub
☆17Aug 1, 2025Updated 11 months ago