lilianweng / lilianweng.github.ioView external linksLinks
My personal page
☆710Jun 10, 2025Updated 8 months ago
Alternatives and similar repositories for lilianweng.github.io
Users that are interested in lilianweng.github.io are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2023] Code base for the Renyi Kernel Entropy (RKE) metric for generative models.☆13Jun 18, 2025Updated 7 months ago
- INTeractive learning via REPresentatIon Discovery☆36Jun 2, 2024Updated last year
- ☆35Nov 22, 2024Updated last year
- Fast and memory-efficient exact attention☆22,231Updated this week
- Official PyTorch implementation of Diffusive Gibbs Sampler (DiGS), proposed in the paper Diffusive Gibbs Sampling (published at ICML 2024…☆10Aug 15, 2024Updated last year
- A small framework for benchmarking machine learning models.☆21Jun 6, 2025Updated 8 months ago
- ☆11Dec 15, 2025Updated last month
- A common protocol for AI agent tools☆10Oct 21, 2024Updated last year
- LCA-on-the-line (ICML 2024 Oral)☆13Feb 13, 2025Updated last year
- LLM智能路由网关、 Enterprise Intelligent AI-API Distribution Gateway☆13Jan 24, 2025Updated last year
- Thwomp is a four oscillator drum synthesizer for Max for Live.☆12Feb 4, 2026Updated last week
- Almost Surely Stable Deep Dynamics [NeurIPS 2020]☆13Dec 8, 2022Updated 3 years ago
- ☆28Jul 28, 2022Updated 3 years ago
- Study repo for David Silver's Reinforcement Learning Course☆12Apr 26, 2019Updated 6 years ago
- Implementation of our paper "Towards Consistent Document-Level Entity Linking: Joint Models for Entity Linking and Coreference Resolution…☆12Nov 13, 2022Updated 3 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆70,205Updated this week
- Expanding linear RNN state-transition matrix eigenvalues to include negatives improves state-tracking tasks and language modeling without…☆19Mar 15, 2025Updated 10 months ago
- Independent implementation of DBCA method from http://arxiv.org/abs/1912.09713☆11Nov 25, 2020Updated 5 years ago
- https://arxiv.org/abs/2102.12594☆14Oct 3, 2023Updated 2 years ago
- ☆13Apr 28, 2023Updated 2 years ago
- Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.☆15Sep 4, 2024Updated last year
- Accelerate common Petri dish assays with AI.☆15Oct 28, 2025Updated 3 months ago
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.☆41,578Feb 7, 2026Updated last week
- Train transformer language models with reinforcement learning.☆17,360Updated this week
- ☆46May 24, 2025Updated 8 months ago
- Flax is a neural network library for JAX that is designed for flexibility.☆7,066Feb 7, 2026Updated last week
- Pytorch implementation for ICLR24:"Online GNN Evaluation Under Test-Time Graph Distribution Shifts"☆16Mar 23, 2024Updated last year
- 使用Qwen1.5-0.5B-Chat模型进行通用信息抽取任务的微调,旨在: 验证生成式方法相较于抽取式NER的效果; 为新手提供简易的模型微调流程,尽量减少代码量; 大模型训练的数据格式处理。☆15Sep 6, 2024Updated last year
- Code and data for paper "(How) do Language Models Track State?"☆21Mar 31, 2025Updated 10 months ago
- auto push daily news with ai☆13Updated this week
- 🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.☆20,619Updated this week
- 此项目是我个人对MIT 6.5940 课程作业的答案,学习笔记和心得。☆15Mar 1, 2024Updated last year
- SciKit Learn Machine Learning Cheat Sheet☆21Jun 17, 2023Updated 2 years ago
- SplitNet implemented based on ResNet-50 trained on ImageNet-22K☆16Jun 18, 2018Updated 7 years ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆52,955Nov 12, 2025Updated 3 months ago
- A fast, clean, responsive Hugo theme.☆13,073Jan 25, 2026Updated 2 weeks ago
- Inference code for Llama models☆59,141Jan 26, 2025Updated last year
- ☆36Feb 8, 2024Updated 2 years ago
- 🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal model…☆156,173Feb 7, 2026Updated last week