This repo offers advanced tutorials for LLMs, BERT-based models, and multimodal models, covering fine-tuning, quantization, vocabulary expansion, and tasks like text classification, similarity calculation, and image-text matching.
☆24May 5, 2025Updated last year
Alternatives and similar repositories for LLM101
Users that are interested in LLM101 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ⛏️This is the storage of my Slides、Reports and Papers. | 存储PPT、报告和论文☆12Oct 27, 2024Updated last year
- Encourage Medical LLM to engage in deep thinking similar to DeepSeek-R1.☆26Apr 24, 2025Updated last year
- Official Code for Large-vocabulary forensic pathological analyses via prototypical cross-modal contrastive learning☆18Jul 24, 2025Updated 9 months ago
- ☆13Feb 21, 2025Updated last year
- [ICML 2023] "Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?" by Ruisi Cai, Zhenyu Zhang, Zhangyang Wang☆16May 4, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [ICML 2024] Code release for "On the Emergence of Cross-Task Linearity in Pretraining-Finetuning Paradigm"☆11Feb 20, 2025Updated last year
- G^3: Geolocation via Guidebook Grounding, Findings of EMNLP 2022☆17Sep 10, 2024Updated last year
- ☆10Dec 8, 2022Updated 3 years ago
- 🚀 轻量视频🎥 大模型🤖☆22Apr 27, 2025Updated last year
- ☆12Sep 29, 2024Updated last year
- ☆12Jan 17, 2023Updated 3 years ago
- ACL24☆11Jun 7, 2024Updated last year
- 校招复习之旅:机器学习(MachineLearning)、深度学习(DeepLearning)、Leetcode、NLP等 (思维导图型笔记)算法岗面试☆18Jun 20, 2021Updated 4 years ago
- [ICLR 2026] Empowering Small VLMs to Think with Dynamic Memorization and Exploration☆16Mar 18, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆18Aug 13, 2024Updated last year
- 天池大数据竞赛 千里马大赛 风险识别与预测赛题 Top5☆14May 16, 2019Updated 6 years ago
- Submission Guide + Discussion Board for AI Singapore Global Challenge for Safe and Secure LLMs (Track 1A).☆16Jul 4, 2024Updated last year
- 树莓派上基于TensorFlow Lite的图像识别☆20Nov 3, 2022Updated 3 years ago
- ☆18May 6, 2025Updated 11 months ago
- [IJCAI 2023] CLE-ViT: Contrastive Learning Encoded Transformer for Ultra-Fine-Grained Visual Categorization.☆10Nov 3, 2023Updated 2 years ago
- ☆32Jun 9, 2025Updated 10 months ago
- ☆14May 5, 2019Updated 7 years ago
- Official PyTorch implementation for "Where You Edit is What You Get: Text-Guided Image Editing with Region-Based Attention" (Pattern Reco…☆10Oct 1, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Monitor Chrome Browsing to detect levels of Depression☆17May 21, 2019Updated 6 years ago
- A query predictor pipeline and service to predict resource usages of Presto queries☆14May 2, 2023Updated 3 years ago
- A repository for fake news detection.☆19Jun 29, 2023Updated 2 years ago
- [ICLR2025 Spotlight] Advantage-Guided Distillation for Preference Alignment in Small Language Models☆26Feb 10, 2025Updated last year
- A collection of optimal and heuristic scheduling tools☆16Apr 24, 2026Updated last week
- OODRobustBench: a Benchmark and Large-Scale Analysis of Adversarial Robustness under Distribution Shift. ICML 2024 and ICLRW-DMLR 2024☆23Jul 25, 2024Updated last year
- The code of LogCL☆24Mar 5, 2025Updated last year
- AttriMIL for Whole-Slide Pathological Image Analysis☆24Nov 17, 2025Updated 5 months ago
- NAACL2021-Temporal Knowledge Graph Completion using a Linear TemporalRegularizer and Multivector Embeddings☆20Jan 27, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆89Apr 11, 2026Updated 3 weeks ago
- ☆27Dec 17, 2025Updated 4 months ago
- This repository contains the entire pipline (including data preprocessing, training, testing, evaluation and visualization) for the Shear…☆10Dec 3, 2019Updated 6 years ago
- 基于开源SSD的重构,简化了代码结构☆11Jan 23, 2019Updated 7 years ago
- ☆21Mar 16, 2025Updated last year
- The source code of paper "An Effective System for Multi-format Information Extraction".☆18Aug 14, 2021Updated 4 years ago
- plate☆12Feb 19, 2019Updated 7 years ago