☆107Oct 2, 2024Updated last year
Alternatives and similar repositories for SLM_Survey
Users that are interested in SLM_Survey are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- One-size-fits-all model for mobile AI, a novel paradigm for mobile AI in which the OS and hardware co-manage a foundation model that is c…☆30Mar 5, 2024Updated 2 years ago
- ☆102Jan 17, 2024Updated 2 years ago
- LLMs in a tiny box, under 3 Watt☆18Dec 30, 2024Updated last year
- Federated Few-shot Learning for Mobile NLP. Conditionally accepted by MobiCom'23.☆16Aug 18, 2023Updated 2 years ago
- Our unique contributions are in tools/train/benchmark.☆22Apr 14, 2025Updated 11 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- [ICLR 2025] RaSA: Rank-Sharing Low-Rank Adaptation☆10May 19, 2025Updated 10 months ago
- ☆19Feb 28, 2022Updated 4 years ago
- Cascade Speculative Drafting☆33Apr 2, 2024Updated 2 years ago
- Code for the paper "Pretrained Models for Multilingual Federated Learning" at NAACL 2022☆11Aug 9, 2022Updated 3 years ago
- ☆103Jan 24, 2026Updated 2 months ago
- Low-latency ASR using SpeechBrain StreamingASR and torchaudio StreamReader.☆18Apr 19, 2025Updated 11 months ago
- ☆21Mar 26, 2025Updated last year
- A demo of end-to-end federated learning system.☆69Jun 1, 2022Updated 3 years ago
- ☆56Aug 19, 2025Updated 7 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Pytorch implementation of paper: Small Pre-trained Language Models Can be Fine-tuned as Large Models via Over-Parameterization.☆12May 18, 2023Updated 2 years ago
- Prototyp MegaScale-Infer: Serving Mixture-of-Experts at Scale with Disaggregated Expert Parallelism☆28Apr 4, 2025Updated last year
- ☆212Jan 17, 2024Updated 2 years ago
- Ko-Arena-Hard-Auto: An automatic LLM benchmark for Korean☆22Apr 23, 2025Updated 11 months ago
- Lottery Ticket Adaptation☆40Nov 20, 2024Updated last year
- Parameter Efficient Transfer Learning with Diff Pruning☆74Feb 3, 2021Updated 5 years ago
- Survey of Small Language Models from Penn State, ...☆250Nov 6, 2025Updated 5 months ago
- FBI-LLM: Scaling Up Fully Binarized LLMs from Scratch via Autoregressive Distillation☆51Aug 24, 2025Updated 7 months ago
- [KO-Platy🥮] Korean-Open-platypus를 활용하여 llama-2-ko를 fine-tuning한 KO-platypus model☆73Aug 24, 2025Updated 7 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Feb 22, 2024Updated 2 years ago
- ☆10Jan 9, 2024Updated 2 years ago
- Persistent, read-only, FUSE-based caching file system☆21Jan 14, 2020Updated 6 years ago
- ☆40Sep 22, 2021Updated 4 years ago
- DMax: Aggressive Parallel Decoding for dLLMs☆85Updated this week
- ☆26Nov 10, 2025Updated 5 months ago
- ☆31Aug 27, 2024Updated last year
- Evaluate gpt-4o on CLIcK (Korean NLP Dataset)☆20May 18, 2024Updated last year
- a curated list of the role of small models in the LLM era☆112Sep 23, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆191Mar 7, 2025Updated last year
- Official repository for the paper "NeuZip: Memory-Efficient Training and Inference with Dynamic Compression of Neural Networks". This rep…☆60Oct 31, 2024Updated last year
- Code for Arxiv 2023: Improving Language Model Negociation with Self-Play and In-Context Learning from AI Feedback☆208May 24, 2023Updated 2 years ago
- 데일리 매크로 리포트 텔레그램 봇 (히트맵, 지수, 환율, 원자재, 채권 등)☆83Apr 1, 2026Updated last week
- Awesome Chinese Corpus Datasets and Models.☆18Oct 28, 2019Updated 6 years ago
- Manage scalable open LLM inference endpoints in Slurm clusters☆284Jul 11, 2024Updated last year
- sigma-MoE layer☆21Jan 5, 2024Updated 2 years ago