suquark / llm4phd
Examples and instructions about use LLMs (especially ChatGPT) for PhD
☆109Updated 2 years ago
Alternatives and similar repositories for llm4phd:
Users that are interested in llm4phd are comparing it to the libraries listed below
- [ICLR 2025] DeFT: Decoding with Flash Tree-attention for Efficient Tree-structured LLM Inference☆19Updated this week
- ☆75Updated 2 years ago
- My Curriculum Vitae☆62Updated 3 years ago
- ChatGPT - Review & Rebuttal: A browser extension for generating reviews and rebuttals, powered by ChatGPT. 利用 ChatGPT 生成审稿意见和回复的浏览器插件☆250Updated 2 years ago
- https://csstipendrankings.org☆206Updated this week
- My paper/code reading notes in Chinese☆46Updated 10 months ago
- Official implementation of "Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization"☆77Updated last year
- A comprehensive overview of Data Distillation and Condensation (DDC). DDC is a data-centric task where a representative (i.e., small but …☆13Updated 2 years ago
- ☆69Updated last month
- The implementation for MLSys 2023 paper: "Cuttlefish: Low-rank Model Training without All The Tuning"☆44Updated last year
- ☆100Updated 3 years ago
- 🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.☆18Updated last year
- ☆26Updated 3 years ago
- Code associated with the paper **Fine-tuning Language Models over Slow Networks using Activation Compression with Guarantees**.☆28Updated last year
- Repository of the paper "Accelerating Transformer Inference for Translation via Parallel Decoding"☆116Updated last year
- Efficient research work environment setup for computer science and general workflow for Deep Learning experiments☆123Updated 3 years ago
- ☆35Updated 5 years ago
- An simple pytorch implementation of Flash MultiHead Attention☆21Updated last year
- LongSpec: Long-Context Speculative Decoding with Efficient Drafting and Verification☆45Updated last month
- ICLR2023 statistics☆60Updated last year
- ICLR2024 statistics☆47Updated last year
- Official implementation of ICML 2024 paper "ExCP: Extreme LLM Checkpoint Compression via Weight-Momentum Joint Shrinking".☆47Updated 9 months ago
- Deep learning images developed from nvidia/cuda-cudnn-devel-ubuntu.☆23Updated 2 years ago
- Efficient 2:4 sparse training algorithms and implementations☆54Updated 4 months ago
- A list of awesome neural symbolic papers.☆47Updated 2 years ago
- An auxiliary project analysis of the characteristics of KV in DiT Attention.☆29Updated 4 months ago
- The accompanying code for "Memory-efficient Transformers via Top-k Attention" (Ankit Gupta, Guy Dar, Shaya Goodman, David Ciprut, Jonatha…☆66Updated 3 years ago
- ☆51Updated last year
- Syphus: Automatic Instruction-Response Generation Pipeline☆14Updated last year
- "Found in the Middle: How Language Models Use Long Contexts Better via Plug-and-Play Positional Encoding" Zhenyu Zhang, Runjin Chen, Shiw…☆29Updated 11 months ago