lilianweng / lilianweng.github.io
View external linksLinks

My personal page

☆710

Alternatives and similar repositories for lilianweng.github.io

Users that are interested in lilianweng.github.io are comparing it to the libraries listed below

Sorting:

mjalali / renyi-kernel-entropy
View on GitHub
[NeurIPS 2023] Code base for the Renyi Kernel Entropy (RKE) metric for generative models.
☆13Jun 18, 2025Updated 7 months ago
microsoft / Intrepid
View on GitHub
INTeractive learning via REPresentatIon Discovery
☆36Jun 2, 2024Updated last year
radarFudan / mamba-minimal-jax
View on GitHub
☆35Nov 22, 2024Updated last year
Dao-AILab / flash-attention
View on GitHub
Fast and memory-efficient exact attention
☆22,231Updated this week
Wenlin-Chen / DiGS
View on GitHub
Official PyTorch implementation of Diffusive Gibbs Sampler (DiGS), proposed in the paper Diffusive Gibbs Sampling (published at ICML 2024…
☆10Aug 15, 2024Updated last year
aai-institute / nnbench
View on GitHub
A small framework for benchmarking machine learning models.
☆21Jun 6, 2025Updated 8 months ago
lavinal712 / control-lora-v3
View on GitHub
☆11Dec 15, 2025Updated last month
agentsea / toolfuse
View on GitHub
A common protocol for AI agent tools
☆10Oct 21, 2024Updated last year
ElvishElvis / LCA-on-the-line
View on GitHub
LCA-on-the-line (ICML 2024 Oral)
☆13Feb 13, 2025Updated last year
hikariming / SynapseHub
View on GitHub
LLM智能路由网关、 Enterprise Intelligent AI-API Distribution Gateway
☆13Jan 24, 2025Updated last year
robenkleene / thwomp
View on GitHub
Thwomp is a four oscillator drum synthesizer for Max for Live.
☆12Feb 4, 2026Updated last week
NPLawrence / stochastic_dynamics
View on GitHub
Almost Surely Stable Deep Dynamics [NeurIPS 2020]
☆13Dec 8, 2022Updated 3 years ago
google-deepmind / zipfian_environments
View on GitHub
☆28Jul 28, 2022Updated 3 years ago
xia0nan / David-Silver-Reinforcement-Learning-UCL
View on GitHub
Study repo for David Silver's Reinforcement Learning Course
☆12Apr 26, 2019Updated 6 years ago
klimzaporojets / consistent-EL
View on GitHub
Implementation of our paper "Towards Consistent Document-Level Entity Linking: Joint Models for Entity Linking and Coreference Resolution…
☆12Nov 13, 2022Updated 3 years ago
vllm-project / vllm
View on GitHub
A high-throughput and memory-efficient inference and serving engine for LLMs
☆70,205Updated this week
automl / unlocking_state_tracking
View on GitHub
Expanding linear RNN state-transition matrix eigenvalues to include negatives improves state-tracking tasks and language modeling without…
☆19Mar 15, 2025Updated 10 months ago
ronentk / dbca-splitter
View on GitHub
Independent implementation of DBCA method from http://arxiv.org/abs/1912.09713
☆11Nov 25, 2020Updated 5 years ago
princetonvisualai / directional-bias-amp
View on GitHub
https://arxiv.org/abs/2102.12594
☆14Oct 3, 2023Updated 2 years ago
FrancescoSaverioZuppichini / detector
View on GitHub
☆13Apr 28, 2023Updated 2 years ago
CosineAI / experiments
View on GitHub
Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.
☆15Sep 4, 2024Updated last year
mshamash / OnePetri
View on GitHub
Accelerate common Petri dish assays with AI.
☆15Oct 28, 2025Updated 3 months ago
deepspeedai / DeepSpeed
View on GitHub
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
☆41,578Feb 7, 2026Updated last week
huggingface / trl
View on GitHub
Train transformer language models with reinforcement learning.
☆17,360Updated this week
hkproj / multi-latent-attention
View on GitHub
☆46May 24, 2025Updated 8 months ago
google / flax
View on GitHub
Flax is a neural network library for JAX that is designed for flexibility.
☆7,066Feb 7, 2026Updated last week
Amanda-Zheng / LEBED
View on GitHub
Pytorch implementation for ICLR24:"Online GNN Evaluation Under Test-Time Graph Distribution Shifts"
☆16Mar 23, 2024Updated last year
tianchiguaixia / qwen1.5-ner
View on GitHub
使用Qwen1.5-0.5B-Chat模型进行通用信息抽取任务的微调，旨在：验证生成式方法相较于抽取式NER的效果；为新手提供简易的模型微调流程，尽量减少代码量；大模型训练的数据格式处理。
☆15Sep 6, 2024Updated last year
belindal / state-tracking
View on GitHub
Code and data for paper "(How) do Language Models Track State?"
☆21Mar 31, 2025Updated 10 months ago
shibing624 / AIDailyNews
View on GitHub
auto push daily news with ai
☆13Updated this week
huggingface / peft
View on GitHub
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
☆20,619Updated this week
Unakar / Efficient_AI
View on GitHub
此项目是我个人对MIT 6.5940 课程作业的答案，学习笔记和心得。
☆15Mar 1, 2024Updated last year
mpolinowski / python-scikitlearn-cheatsheet
View on GitHub
SciKit Learn Machine Learning Cheat Sheet
☆21Jun 17, 2023Updated 2 years ago
dalgu90 / splitnet-imagenet22k
View on GitHub
SplitNet implemented based on ResNet-50 trained on ImageNet-22K
☆16Jun 18, 2018Updated 7 years ago
karpathy / nanoGPT
View on GitHub
The simplest, fastest repository for training/finetuning medium-sized GPTs.
☆52,955Nov 12, 2025Updated 3 months ago
adityatelange / hugo-PaperMod
View on GitHub
A fast, clean, responsive Hugo theme.
☆13,073Jan 25, 2026Updated 2 weeks ago
meta-llama / llama
View on GitHub
Inference code for Llama models
☆59,141Jan 26, 2025Updated last year
glaive-ai / function-calling-server
View on GitHub
☆36Feb 8, 2024Updated 2 years ago
huggingface / transformers
View on GitHub
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal model…
☆156,173Feb 7, 2026Updated last week

lilianweng / lilianweng.github.ioView external linksLinks

Alternatives and similar repositories for lilianweng.github.io

lilianweng / lilianweng.github.io
View external linksLinks