This is the official implementation of our ACL 2025 Main paper "Balancing Diversity and Risk in LLM Sampling".
☆17Oct 16, 2025Updated 8 months ago
Alternatives and similar repositories for Benchmarking-and-Guiding-Adaptive-Sampling-Decoding-for-LLMs
Users that are interested in Benchmarking-and-Guiding-Adaptive-Sampling-Decoding-for-LLMs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official code repo and data hub of top_nsigma sampling strategy for LLMs.☆26Feb 11, 2025Updated last year
- A QT GUI for large language models☆40Dec 27, 2023Updated 2 years ago
- A swarm of LLM agents that will help you test, document, and productionize your code!☆19Jun 22, 2026Updated last week
- ☆30Dec 2, 2024Updated last year
- Particle system written on the GPU using compute shaders in Unity.☆11May 14, 2020Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- An advanced Self-Centered Intelligence (SCI) prototype that represents a new paradigm in AI-human interaction.☆27May 21, 2026Updated last month
- Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.☆12Feb 11, 2024Updated 2 years ago
- Huggingface Backup - Jupyter, Colab and Python Script☆10Jun 22, 2026Updated last week
- Project of Singing Voice Conversion.☆16Oct 27, 2023Updated 2 years ago
- This GUI aims to simplify the process of converting GGUF files to llamafile format by providing an intuitive and convenient way for users…☆14Jan 2, 2026Updated 6 months ago
- Rivet plugin to access E2B goodies☆10Feb 6, 2025Updated last year
- Get a flat look for your models with one click, without touching your mesh!☆19Jul 10, 2017Updated 8 years ago
- ☆11Aug 26, 2024Updated last year
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆18Oct 13, 2025Updated 8 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- From Dataset Labeling, Entity Extraction to production Knowledge Graph Deployment: The Power of NLP and LLMs Combined.☆12Jun 19, 2026Updated 2 weeks ago
- Unity compute shader implementation of andy Lomas growth algorithm☆13Aug 5, 2019Updated 6 years ago
- Examples for using the SiLLM framework for training and running Large Language Models (LLMs) on Apple Silicon☆16May 8, 2025Updated last year
- Various agents from all of the top agent frameworks to integrate into swarms! Langchain, Griptape, CrewAI, and more!☆18Dec 22, 2025Updated 6 months ago
- ☆13Jun 15, 2026Updated 2 weeks ago
- A tiny server to run local inference on MLX model in the style of OpenAI☆13Jan 31, 2024Updated 2 years ago
- A procedural galaxy generator in C# for .NET and Mono.☆15Feb 6, 2020Updated 6 years ago
- A Next.js chatbot app demonstrating seamless integration with window.ai.☆15Jun 25, 2023Updated 3 years ago
- A simple github actions script to build a llamafile and uploads to huggingface☆17Jan 11, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and FastChat-T5.☆11May 26, 2023Updated 3 years ago
- OmniByteFormer is a generalized Transformer model that can process any type of data by converting it into byte sequences, bypassing tradi…☆16Jun 22, 2026Updated last week
- 🧬 [WIP] Lobe Flow - an open-source ai powered node flow editor☆22Dec 18, 2023Updated 2 years ago
- In-browser semantic search demo using EmbeddingGemma and Transformers.js. No server required.☆38Jun 13, 2026Updated 3 weeks ago
- My Gen AI research☆11Jun 3, 2024Updated 2 years ago
- Cut2Next: Generating Next Shot via In-Context Tuning☆33Aug 21, 2025Updated 10 months ago
- NDIToolbox is an open source extensible signal and image processing application under development by TRI/Austin designed to assist with t…☆10Aug 19, 2018Updated 7 years ago
- Automate the batch upload and parsing of documents into Dify's knowledge base, reducing manual intervention and wait time.☆16Aug 29, 2024Updated last year
- smolLM with Entropix sampler on pytorch☆149Oct 31, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆15Sep 8, 2023Updated 2 years ago
- Implementation of the BatchTopK activation function for training sparse autoencoders (SAEs)☆66Jul 24, 2025Updated 11 months ago
- This project aims to utilize Generative AI for the next marketing strategy in the case of e-commerce customer segmentation.☆13Mar 19, 2024Updated 2 years ago
- Python client for txtai☆15Updated this week
- Template for creating a BioCypher-driven knowledge graph☆13Jan 15, 2026Updated 5 months ago
- ☆14Apr 26, 2021Updated 5 years ago
- This is a FastAPI based LLM server. Load multiple LLM models (MLX or llama.cpp) simultaneously using multiprocessing.☆17Apr 8, 2026Updated 2 months ago