smart-lty / nano-PEARLLinks
Draft-Target Disaggregation LLM Serving System via Parallel Speculative Decoding.
☆117Updated last week
Alternatives and similar repositories for nano-PEARL
Users that are interested in nano-PEARL are comparing it to the libraries listed below
Sorting:
- A high-performance inference engine for LLMs, optimized for diverse AI accelerators.☆707Updated this week
- [EMNLP 2025] RAG-Instruct: Boosting LLMs with Diverse Retrieval-Augmented Instructions☆136Updated 7 months ago
- AI Infra主要是指AI的基础建设,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术。☆251Updated last year
- Omni Model Benchmark with high quality and diversity, which reveals the Compositional Law. We’re now focused on Chinese scenarios — and a…☆72Updated 2 weeks ago
- [NeurIPS 2025 spotlight] QFFT, Question-Free Fine-Tuning for Adaptive Reasoning☆90Updated 2 weeks ago
- ☆71Updated 4 months ago
- Deep Research☆303Updated 2 months ago
- OK Computer in a Box: Your Self-Hosted Agent Workflow Layer☆100Updated last month
- ☆22Updated last week
- ☆117Updated 2 weeks ago
- [ICCV 2023] RepQ-ViT: Scale Reparameterization for Post-Training Quantization of Vision Transformers☆136Updated last year
- Python port of Moses tokenizer, truecaser and normalizer☆112Updated 2 years ago
- ☆30Updated last month
- ☆39Updated 3 weeks ago
- Local nonlinear causal attention latent diffusion models for visual story synthesizing☆32Updated 7 months ago
- This package is designed to bypass puppeteer's bot-detecting captchas such as Cloudflare. It acts like a real browser and can be managed …☆31Updated last year
- The directed brute force cracking tool, after collecting information, uses it to generate a special dictionary containing the feature inf…☆39Updated 2 years ago
- 整理了各大厂的 GitHub 地址及热门开源项目,帮助大家更高效地了解国产开源生态☆109Updated 4 months ago
- BuildArena, where LLM agents design, build, and test rockets, cars, and bridges in a physics simulator given a goal-directed sentence.☆77Updated 2 weeks ago
- ⚡Fast-start scaffold for Gin Framework APIs. Includes MySQL, Redis-powered JWT auth, and a well-structured architecture to launch your Go…☆42Updated last month
- Repository UCB-Coursework contains course study materials, assignments, labs, and projects for CS61C, CS162, CS168, CS188 and CS288 for …☆23Updated 9 months ago
- 已转移到grapi(已弃)☆18Updated last year
- ☆31Updated last month
- ☆31Updated 4 months ago
- A corporate law RAG system with innovative retrieval and contextual strategies☆63Updated last month
- 🛠️ A node-based tooling for FixIt site initialization.☆29Updated last month
- A third-party React-based web admin panel for XXL-JOB — delivering the best experience you’ve ever had.☆194Updated 3 months ago
- MCPMark is a comprehensive, stress-testing MCP benchmark designed to evaluate model and agent capabilities in real-world MCP use.☆317Updated last week
- toon of java☆34Updated last week
- ☆30Updated last month