ronantakizawa / sleeptimecomputeLinks
A Demo of Running Sleep-time Compute to Reduce LLM Latency
☆16Updated 7 months ago
Alternatives and similar repositories for sleeptimecompute
Users that are interested in sleeptimecompute are comparing it to the libraries listed below
Sorting:
- Testing the different LLM and RAG Tests while I learn along the way☆207Updated 6 months ago
- Enhancing LLMs with LoRA☆197Updated 2 months ago
- world's stupidest moe llm in 103M parameters☆19Updated 5 months ago
- A Demo of Cache-Augmented Generation (CAG) in an LLM☆118Updated 6 months ago
- Allows two LLMs to communicate and run code in the terminal☆27Updated last year
- ☆28Updated 6 months ago
- Inference, Fine Tuning and many more recipes with Gemma family of models☆276Updated 5 months ago
- Reasoning Systems with tool use are strong zero-shot object detectors☆60Updated 2 months ago
- 🚀 FlexLLama - Lightweight self-hosted tool for running multiple llama.cpp server instances with OpenAI v1 API compatibility and multi-GP…☆46Updated last month
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆29Updated 11 months ago
- Use smol agents to do research and then update csv coumns with its findings.☆41Updated 10 months ago
- A persistent local memory for AI, LLMs, or Copilot in VS Code.☆182Updated 2 months ago
- Open collaboration infrastructure that enables communication, coordination, trust and payments for The Internet of Agents.☆204Updated this week
- 🗣️ Real‑time, low‑latency voice, vision, and conversational‑memory AI assistant built on LiveKit and local LLMs ✨☆100Updated 6 months ago
- Generates breakthrough ideas from a single prompt through an 8 stage walkthrough, with optional research proposal paper.☆58Updated 2 months ago
- ☆176Updated 4 months ago
- Multi-agent autonomous research system using LangGraph and LangChain. Generates citation-backed reports with credibility scoring and web …☆90Updated 2 weeks ago
- Command-line personal assistant using your favorite proprietary or local models with access to over 30+ tools☆112Updated 6 months ago
- Personal voice assistant, with voice interruption and Twilio support☆18Updated 10 months ago
- ☆201Updated 3 months ago
- ☆134Updated 2 weeks ago
- ☆21Updated 4 months ago
- Plug-and-play memory for LLMs in 3 lines of code. Add persistent, intelligent, human-like memory and recall to any model in minutes.☆242Updated last month
- LLM Fine Tuning Toolbox images for Ryzen AI 395+ Strix Halo☆39Updated 3 months ago
- AI management tool☆121Updated last year
- A versatile AI chatbot leveraging function-calling language models via Ollama. Features include advanced function calling, self-reflectio…☆17Updated 10 months ago
- ☆62Updated 6 months ago
- A truly open version of gpt-oss which shows the entire pre-training from scratch☆80Updated 3 months ago
- An extension that lets the AI take the wheel, allowing it to use the mouse and keyboard, recognize UI elements, and prompt itself :3...no…☆127Updated last year
- Serving LLMs in the HF-Transformers format via a PyFlask API☆72Updated last year