MobileLLM / ChainStream
A Stream-based LLM Agent Framework for Continuous Context Sensing and Sharing
☆29Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for ChainStream
- LlamaTouch: A Faithful and Scalable Testbed for Mobile UI Task Automation☆46Updated 3 months ago
- Paper list for Personal LLM Agents☆331Updated 6 months ago
- Source code for the paper "Empowering LLM to use Smartphone for Intelligent Task Automation"☆263Updated 7 months ago
- ☆128Updated last week
- A Comprehensive Benchmark for Software Development.☆84Updated 5 months ago
- Survey Paper List - Efficient LLM and Foundation Models☆217Updated last month
- Modular and structured prompt caching for low-latency LLM inference☆65Updated this week
- [OSDI'24] Serving LLM-based Applications Efficiently with Semantic Variable☆111Updated last month
- ☆24Updated 7 months ago
- Awesome-LLM-KV-Cache: A curated list of 📙Awesome LLM KV Cache Papers with Codes.☆96Updated this week
- PyTorch implementation of paper "Response Length Perception and Sequence Scheduling: An LLM-Empowered LLM Inference Pipeline".☆74Updated last year
- awesome llm plaza: daily tracking all sorts of awesome topics of llm, e.g. llm for coding, robotics, reasoning, multimod etc.☆153Updated this week
- A large-scale simulation framework for LLM inference☆271Updated last month
- [ACL 2024] AUTOACT: Automatic Agent Learning from Scratch for QA via Self-Planning☆177Updated last month
- Code associated with the paper **Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative Decoding**☆135Updated 5 months ago
- Super-Efficient RLHF Training of LLMs with Parameter Reallocation☆116Updated last month
- A new tool learning benchmark aiming at well-balanced stability and reality, based on ToolBench.☆112Updated last month
- ShortcutsBench: A Large-Scale Real-World Benchmark for API-Based Agents☆74Updated last month
- ☆94Updated 9 months ago
- Gentopia Agent Zoo and Agent Benchmark☆28Updated last year
- 📰 Must-read papers on KV Cache Compression (constantly updating 🤗).☆123Updated this week
- Langchain Agent finetuning using 7B - LLAMA 2 , on hotpotQA (Retroformer framework)☆14Updated last year
- [COLM 2024] TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding☆228Updated 2 months ago
- Spec-Bench: A Comprehensive Benchmark and Unified Evaluation Platform for Speculative Decoding (ACL 2024 Findings)☆183Updated 2 weeks ago
- ☆63Updated last month
- 📰 Must-read papers and blogs on Speculative Decoding ⚡️☆451Updated this week
- ☆190Updated 2 months ago
- ☆103Updated 3 weeks ago
- [NeurIPS'23] H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.☆388Updated 3 months ago
- Ouroboros: Speculative Decoding with Large Model Enhanced Drafting (EMNLP 2024 main)☆76Updated last month