richardzhuang0412 / EmbedLLMView external linksLinks
Repo for EmbedLLM: Learning Compact Representations of Large Language Models
☆27Sep 25, 2025Updated 4 months ago
Alternatives and similar repositories for EmbedLLM
Users that are interested in EmbedLLM are comparing it to the libraries listed below
Sorting:
- ☆14Jul 24, 2024Updated last year
- Code repo for efficient quantized MoE inference with mixture of low-rank compensators☆31Apr 14, 2025Updated 10 months ago
- Official code for the paper "Examining Post-Training Quantization for Mixture-of-Experts: A Benchmark"☆29Jun 30, 2025Updated 7 months ago
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆153Jun 13, 2024Updated last year
- ☆17Dec 30, 2025Updated last month
- A powerful MCP testing tool with multi-provider LLM support (Ollama, OpenAI, Claude, Gemini). Test, debug, and develop MCP servers with a…☆18Jan 7, 2026Updated last month
- Text to audio with Tik-Tok Voices☆13Apr 6, 2023Updated 2 years ago
- Official implementation of the ICLR paper "Streamlining Redundant Layers to Compress Large Language Models"☆39May 1, 2025Updated 9 months ago
- Home server set up☆13Oct 5, 2025Updated 4 months ago
- Standalone desktop application for Text-to-Speech (TTS) utilizing the Kokoro-82M AI model for pdf files☆28Updated this week
- Simple and powerful extension for searching web and viewing website content.☆11Apr 11, 2025Updated 10 months ago
- US Neighborhood data in GeoJSON format from OpenSource Zillow Neighborhood Boundaries Shapefiles☆11Oct 27, 2016Updated 9 years ago
- [AAAI 2025] Neural-Symbolic Collaborative Distillation: Advancing Small Language Models for Complex Reasoning Tasks☆11Jun 19, 2025Updated 7 months ago
- ☆12Jun 19, 2024Updated last year
- ☆10Sep 29, 2024Updated last year
- Technical docs to help you make you Halo Strix WORK!☆23Jan 10, 2026Updated last month
- Copilot with deepseek and more...☆13Mar 7, 2025Updated 11 months ago
- This repository includes code for the paper "Towards Zero Touch Networks: Cross-Layer Automated Security Solutions for 6G Wireless Networ…☆14Mar 5, 2025Updated 11 months ago
- Continuous Pipelined Speculative Decoding☆16Jan 4, 2026Updated last month
- This repository is the official implementation of the source code of the paper "B2Opt: Learning to Optimize Black-box Optimization with L…☆11Aug 16, 2024Updated last year
- The Open Source Voice Agent Platform. Orchestrate ultra-low latency AI pipelines for real-time conversations over WebRTC.☆39Dec 21, 2025Updated last month
- ☆18Dec 9, 2025Updated 2 months ago
- Custom Engineered Agents and Tools for Vibe Coders | Agents for TRAE.AI, Smart MCPs, GLM Models integration and more...☆22Dec 24, 2025Updated last month
- GitOps automation for plain old docker compose stack deploy☆10Dec 25, 2024Updated last year
- ☆17Mar 20, 2025Updated 10 months ago
- This repo contains the codes and the notebooks used for the paper "DarkVec: Automatic Analysis of Darknet Traffic with Word Embeddings".☆13Feb 3, 2024Updated 2 years ago
- Local LLM set-up☆18Jul 1, 2024Updated last year
- ☆15Apr 9, 2025Updated 10 months ago
- [ICCV 2025] Official repo of "EC-Flow: Enabling Versatile Robotic Manipulation from Action-Unlabeled Videos via Embodiment-Centric Flow"☆26Oct 16, 2025Updated 3 months ago
- 📝🤖 WriteAI - Simplify your writing process with AI. Generate emails 📧, articles 📝, essays 📚, & more with ease. Writing is made easy …☆12Feb 21, 2023Updated 2 years ago
- Setup an MCP server in 60 seconds.☆13Dec 12, 2024Updated last year
- This approach of Intrusion Detection uses two GPT models, which are trained on normal network traffic, to predict sequences of communicat…☆11Oct 3, 2023Updated 2 years ago
- Clipboard Regex Replace is a lightweight GoLang application that allows you to automatically apply regex-based replacements to your clipb…☆10Jan 20, 2026Updated 3 weeks ago
- The official implementation of ManiAgent☆22Jan 4, 2026Updated last month
- PaliGemma Inference and Fine Tuning☆13May 15, 2024Updated last year
- FlexAudioPrint is a Python-based app for transcribing audio to text using OpenAI's Whisper model. It offers a Gradio web interface and a …☆10Jan 29, 2026Updated 2 weeks ago
- GUI for WireSock VPN client on Windows☆14Jul 8, 2024Updated last year
- A Prompt Enhancer for flux.1 in ComfyUI☆12Jan 11, 2026Updated last month
- Proxy for OpenAI☆15Sep 2, 2025Updated 5 months ago