wassname / prob_jsonformer
Generate Structured JSON with probs from Language Models
☆16Updated 9 months ago
Alternatives and similar repositories for prob_jsonformer:
Users that are interested in prob_jsonformer are comparing it to the libraries listed below
- A Python library to orchestrate LLMs in a neural network-inspired structure☆46Updated 5 months ago
- entropix style sampling + GUI☆25Updated 4 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆67Updated 4 months ago
- ☆38Updated last year
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆37Updated last year
- LLM-Training-API: Including Embeddings & ReRankers, mergekit, LaserRMT☆26Updated last year
- ☆20Updated last year
- A guidance compatibility layer for llama-cpp-python☆34Updated last year
- High level tool use for LLMs☆34Updated 7 months ago
- Verbosity control for AI agents☆60Updated 9 months ago
- ☆14Updated 2 months ago
- JacQues is a Dash-based interactive web application that facilitates real-time chat and document management.☆22Updated 6 months ago
- Modified Beam Search with periodical restart☆12Updated 6 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Updated last month
- Deploy a FastHTML app in just a few lines of simple python code on Modal's serverless infra.☆26Updated 7 months ago
- LMQL implementation of tree of thoughts☆34Updated last year
- The original BabyAGI, updated with LiteLLM and no vector database reliance (csv instead)☆21Updated 5 months ago
- A framework that uses multi-agents to enable users to perform a systematic data science pipeline with just two inputs.☆39Updated 7 months ago
- ☆39Updated last year
- ☆17Updated 3 months ago
- Host LLM via text-generation-inference☆15Updated last year
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Updated last year
- Complex RAG backend☆28Updated 11 months ago
- Serving LLMs in the HF-Transformers format via a PyFlask API☆71Updated 6 months ago
- One Line To Build Zero-Data Classifiers in Minutes☆36Updated 5 months ago
- a lightweight, open-source blueprint for building powerful and scalable LLM chat applications☆30Updated 9 months ago
- Text generation in Python, as easy as possible☆55Updated last week
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆74Updated this week
- Branch Out Your Conversations☆32Updated 2 months ago