KaiNylund/lm-weights-encode-time

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/KaiNylund/lm-weights-encode-time)

KaiNylund / lm-weights-encode-time

☆68

Alternatives and similar repositories for lm-weights-encode-time

Users that are interested in lm-weights-encode-time are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sanketvmehta / lifelong-learning-pretraining-and-sam
View on GitHub
Code for the paper "Mehta, S. V., Patil, D., Chandar, S., & Strubell, E. (2023). An Empirical Investigation of the Role of Pre-training i…
☆18Mar 18, 2024Updated 2 years ago
alexrs / herd
View on GitHub
Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.
☆11Feb 11, 2024Updated 2 years ago
acmi-lab / pretraining-with-nonsense
View on GitHub
Pretraining summarization models using a corpus of nonsense
☆13Sep 28, 2021Updated 4 years ago
matfrei / CLIPMasterPrints
View on GitHub
Code for Fooling Contrastive Language-Image Pre-trainined Models with CLIPMasterPrints
☆15Jan 25, 2026Updated 6 months ago
xiamengzhou / training_trajectory_analysis
View on GitHub
[ACL 2023]: Training Trajectories of Language Models Across Scales https://arxiv.org/pdf/2212.09803.pdf
☆25Nov 14, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
yizhongw / llm-temporal-alignment
View on GitHub
Methods and evaluation for aligning language models temporally
☆31Mar 2, 2024Updated 2 years ago
feradauto / MoralCoT
View on GitHub
Repo for: When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment
☆40Jun 5, 2023Updated 3 years ago
wellecks / llemma_formal2formal
View on GitHub
Llemma formal2formal (tactic prediction) theorem proving experiments
☆20Oct 17, 2023Updated 2 years ago
alexrame / diwa
View on GitHub
DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization
☆31Jan 31, 2023Updated 3 years ago
r-three / mats
View on GitHub
☆33Jul 8, 2024Updated 2 years ago
Rose-STL-Lab / AutoSTPP
View on GitHub
Automatic Integration for Neural Spatio-Temporal Point Process models (AI-STPP) is a new paradigm for exact, efﬁcient, non-parametric inf…
☆25Oct 14, 2024Updated last year
allenai / bff
View on GitHub
☆39Apr 17, 2024Updated 2 years ago
ok1zjf / LBAE
View on GitHub
PyTorch implementation of the ICML 2020 paper "Latent Bernoulli Autoencoder"
☆25Apr 8, 2021Updated 5 years ago
vsahil / MIMETIC-2
View on GitHub
Official Code for MIMETIC^2
☆13Nov 19, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
lfsszd / CS-Drafting
View on GitHub
Cascade Speculative Drafting
☆33Apr 2, 2024Updated 2 years ago
rustnl / rustnl2023
View on GitHub
RustNL 2023 conference
☆15Jan 24, 2024Updated 2 years ago
jiangycTarheel / SQ-Transformer
View on GitHub
☆10Feb 12, 2024Updated 2 years ago
nlpie-research / Lightweight-Clinical-Transformers
View on GitHub
This project develops compact transformer models tailored for clinical text analysis, balancing efficiency and performance for healthcare…
☆18Mar 26, 2024Updated 2 years ago
jihyechoi77 / malade
View on GitHub
Repository for the paper "MALADE: Orchestration of LLM-powered Agents with Retrieval Augmented Generation for Pharmacovigilance"
☆29Feb 19, 2025Updated last year
HanGuo97 / lq-lora
View on GitHub
☆129Jan 22, 2024Updated 2 years ago
alon-albalak / FLAD
View on GitHub
Few-shot Learning with Auxiliary Data
☆31Dec 8, 2023Updated 2 years ago
jungokasai / beam_with_patience
View on GitHub
☆46Apr 13, 2022Updated 4 years ago
sjunhongshen / ORCA
View on GitHub
Official implementation of ORCA proposed in the paper "Cross-Modal Fine-Tuning: Align then Refine"
☆75Mar 6, 2024Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
JoshEngels / MultiDimensionalFeatures
View on GitHub
Code for reproducing our paper "Not All Language Model Features Are Linear"
☆90Nov 27, 2024Updated last year
wesg52 / world-models
View on GitHub
Extracting spatial and temporal world models from LLMs
☆262Oct 17, 2023Updated 2 years ago
Jiacheng-Zhu-AIML / AsymmetryLoRA
View on GitHub
Preprint: Asymmetry in Low-Rank Adapters of Foundation Models
☆40Feb 27, 2024Updated 2 years ago
tobna / TaylorShift
View on GitHub
This repository contains the code for the paper "TaylorShift: Shifting the Complexity of Self-Attention from Squared to Linear (and Back)…
☆15Feb 25, 2026Updated 5 months ago
locuslab / T-MARS
View on GitHub
Code for T-MARS data filtering
☆35Aug 23, 2023Updated 2 years ago
sabithsn / APPDIA-Discourse-Style-Transfer
View on GitHub
Data and code for APPDIA: A Discourse-aware Transformer-based Style Transfer Model for Offensive Social Media Conversations (COLING 2022)…
☆13Sep 8, 2022Updated 3 years ago
nlp-waseda / mtl-eadrg
View on GitHub
Emotion-Aware Dialogue Response Generation by Multi-Task Learning
☆13Jan 22, 2022Updated 4 years ago
google-deepmind / affordances_option_models
View on GitHub
☆22Nov 8, 2021Updated 4 years ago
ethancaballero / broken_neural_scaling_laws
View on GitHub
Code Release for "Broken Neural Scaling Laws" (BNSL) paper
☆59Oct 29, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
facebookresearch / SIMAT
View on GitHub
codebase for the SIMAT dataset and evaluation
☆39Feb 16, 2022Updated 4 years ago
locuslab / scaling_laws_data_filtering
View on GitHub
☆64Apr 9, 2024Updated 2 years ago
Devansh3712 / PySQL
View on GitHub
Python wrapper for making MySQL queries easier
☆10Mar 13, 2023Updated 3 years ago
locuslab / llava-token-compression
View on GitHub
☆47Nov 8, 2024Updated last year
Thartvigsen / GRACE
View on GitHub
[NeurIPS'23] Aging with GRACE: Lifelong Model Editing with Discrete Key-Value Adaptors
☆86Dec 21, 2024Updated last year
pratyushasharma / laser
View on GitHub
The Truth Is In There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction
☆397Jul 9, 2024Updated 2 years ago
delyan-boychev / imaginet
View on GitHub
☆11Apr 25, 2026Updated 3 months ago