Awesome-SLM: a curated list of Small Language Model
β31Jun 24, 2024Updated last year
Alternatives and similar repositories for Awesome-SLM
Users that are interested in Awesome-SLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π°π· Korean LLM Datasets | Pre-training, SFT, DPO, RLHF, CoT | νκ΅μ΄ LLM λ°μ΄ν°μ νλ μ΄μ β41Jan 20, 2026Updated 4 months ago
- survery of small language modelsβ18Jul 23, 2024Updated last year
- Code for the EACL 2024 paper: "Small Language Models Improve Giants by Rewriting Their Outputs"β12Apr 20, 2024Updated 2 years ago
- small language models training made easyβ15Dec 15, 2024Updated last year
- Simple implementation of two layers of Disentangled GCNβ13Mar 24, 2021Updated 5 years ago
- Proton VPN Special Offer - Get 70% off β’ AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- These are papers that I read and reviewed related to NLP, CV, and Deep Learning π You can check paper links and my reviews πβ13Jan 3, 2024Updated 2 years ago
- β10Nov 30, 2024Updated last year
- Learning energy decompositions for partial inference in GFlowNetsβ16Jun 4, 2024Updated 2 years ago
- AI-driven image & avatar creator for WordPress, powered by DALLΒ·E. Generate unique, royalty-free images, variations, and avatars with seaβ¦β22Jun 29, 2024Updated last year
- Unofficial pytorch implementation of DisenGCNβ14Jun 19, 2023Updated 2 years ago
- DBLP BibTeX - bibtex wrapper for automatic DBLP & IACR ePrint downloadsβ20Mar 29, 2023Updated 3 years ago
- Code for paper "Compositional Sculpting of Iterative Generative Processes"β25Oct 2, 2023Updated 2 years ago
- λͺ¨λμ AI μΌμΈμ Agentλ‘ μμ±νλ RAG κ°μ λ ν¬μ§ν 리μ λλ€.β19Dec 16, 2025Updated 6 months ago
- Simple PyTorch profiler that combines DeepSpeed Flops Profiler and TorchInfoβ11Feb 12, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- β30Sep 3, 2025Updated 9 months ago
- The app for visualizing allocated GPUs by SLURMβ13Jan 21, 2024Updated 2 years ago
- Implementation of: Kristiadi, Agustinus, and Asja Fischer. "Predictive Uncertainty Quantification with Compound Density Networks." (2019)β¦β16May 26, 2022Updated 4 years ago
- Create reliability diagrams to quantify ML calibration.β10Feb 1, 2022Updated 4 years ago
- Official code for the SDM2022 paper -- SSSNET: Semi-Supervised Signed Network Clustering.β25Oct 13, 2024Updated last year
- This project is intended to build and deploy an SNPE model on Qualcomm Devices, which are having unsupported layers which are not part ofβ¦β10Oct 4, 2021Updated 4 years ago
- Sublime Merge ζη¨β10Nov 8, 2019Updated 6 years ago
- β17Jul 28, 2024Updated last year
- SVD-AE: Simple Autoencoders for Collaborative Filteringβ18Aug 22, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Link Prediction with Signed Latent Factors in Signed Social Networks (SIGKDD 2019)β15Oct 24, 2021Updated 4 years ago
- Code for paper "ElasticTrainer: Speeding Up On-Device Training with Runtime Elastic Tensor Selection" (MobiSys'23)β14Nov 1, 2023Updated 2 years ago
- β11Jan 21, 2021Updated 5 years ago
- Official repository of Graph RAG-Tool Fusion and ToolLinkOS dataset.β24Feb 13, 2025Updated last year
- [ICML 2022] ShiftAddNAS: Hardware-Inspired Search for More Accurate and Efficient Neural Networksβ15May 18, 2022Updated 4 years ago
- Introducing Bundle Recommendation in Conversational Recommendation Scenarios on RecSys 2022β22Dec 9, 2022Updated 3 years ago
- [NAACL 2025] The official implementation of paper "Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language Mβ¦β28Mar 14, 2024Updated 2 years ago
- A Google Colab for DFDNet: Blind Face Restorationβ12Aug 9, 2021Updated 4 years ago
- CharFormer(Tay et al., 2022; Gradient-based Subword Tokenizer + T5) model implementation for Huggingface Transformersβ19Oct 14, 2024Updated last year
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- β12May 20, 2025Updated last year
- β15May 10, 2021Updated 5 years ago
- "Graph Convolutions Enrich the Self-Attention in Transformers!" NeurIPS 2024β27Mar 19, 2025Updated last year
- μκ³ λ¦¬μ¦ κ΅¬νμΌλ‘ λ°°μ°λ μ νλμ with νμ΄μ¬β24Sep 21, 2023Updated 2 years ago
- Electricity Theft Detectionβ13May 8, 2019Updated 7 years ago
- A forked version of thesisdown for writing UNSW theses with bookdown and RMarkdownβ11Jan 12, 2018Updated 8 years ago
- Plugin to support creating and developing Mbed OS projects in CLionβ10May 28, 2021Updated 5 years ago