☆108Oct 2, 2024Updated last year
Alternatives and similar repositories for SLM_Survey
Users that are interested in SLM_Survey are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆102Jan 17, 2024Updated 2 years ago
- ☆67Nov 16, 2024Updated last year
- LLMs in a tiny box, under 3 Watt☆18Dec 30, 2024Updated last year
- Our unique contributions are in tools/train/benchmark.☆22Apr 14, 2025Updated last year
- Survey Paper List - Efficient LLM and Foundation Models☆266Sep 22, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ICLR 2025] RaSA: Rank-Sharing Low-Rank Adaptation☆10May 19, 2025Updated 11 months ago
- Cascade Speculative Drafting☆33Apr 2, 2024Updated 2 years ago
- Code for the paper "Pretrained Models for Multilingual Federated Learning" at NAACL 2022☆11Aug 9, 2022Updated 3 years ago
- MobiSys#114☆23Aug 17, 2023Updated 2 years ago
- ☆36May 28, 2024Updated last year
- ☆104Jan 24, 2026Updated 3 months ago
- ☆24Mar 26, 2025Updated last year
- A demo of end-to-end federated learning system.☆69Jun 1, 2022Updated 3 years ago
- ☆57Aug 19, 2025Updated 8 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆30Nov 18, 2022Updated 3 years ago
- Prototyp MegaScale-Infer: Serving Mixture-of-Experts at Scale with Disaggregated Expert Parallelism☆30Apr 4, 2025Updated last year
- Lottery Ticket Adaptation☆40Nov 20, 2024Updated last year
- Survey of Small Language Models from Penn State, ...☆251Nov 6, 2025Updated 5 months ago
- FedGS: Federated Graph-based Sampling with Arbitrary Client Availability, arxiv.org/abs/2211.13975) was accepted by AAAI 2023 Conference.☆17Jan 4, 2023Updated 3 years ago
- FBI-LLM: Scaling Up Fully Binarized LLMs from Scratch via Autoregressive Distillation☆51Aug 24, 2025Updated 8 months ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Feb 22, 2024Updated 2 years ago
- "Efficient Federated Learning for Modern NLP", to appear at MobiCom 2023.☆33Aug 18, 2023Updated 2 years ago
- Using open-source LLM Llama2 by Meta on local CPU inference for document question-and-answer☆15Oct 5, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆10Jan 9, 2024Updated 2 years ago
- Persistent, read-only, FUSE-based caching file system☆21Jan 14, 2020Updated 6 years ago
- ☆40Sep 22, 2021Updated 4 years ago
- ☆26Nov 10, 2025Updated 5 months ago
- An Interfernce RAG-based LLM Pipeline with Best Practice LLMOps☆13Aug 20, 2024Updated last year
- Evaluate gpt-4o on CLIcK (Korean NLP Dataset)☆20May 18, 2024Updated last year
- a curated list of the role of small models in the LLM era☆112Sep 23, 2024Updated last year
- DMax: Aggressive Parallel Decoding for dLLMs☆110Apr 20, 2026Updated last week
- XVERSE-MoE-A36B: A multilingual large language model developed by XVERSE Technology Inc.☆39Sep 12, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆199Mar 7, 2025Updated last year
- Official repository for the paper "NeuZip: Memory-Efficient Training and Inference with Dynamic Compression of Neural Networks". This rep…☆60Oct 31, 2024Updated last year
- ☆28Dec 15, 2025Updated 4 months ago
- ☆10Mar 8, 2025Updated last year
- Learning to Generate STRUCTURED Output with Schema Reinforcement Learning☆23Mar 2, 2025Updated last year
- GoLU, a novel, self-gated and element-wise activation function that performs well over a diverse set of tasks☆25Oct 4, 2025Updated 6 months ago
- sigma-MoE layer☆21Jan 5, 2024Updated 2 years ago