Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform
☆90Sep 9, 2025Updated 9 months ago
Alternatives and similar repositories for enterprise-h2ogpte
Users that are interested in enterprise-h2ogpte are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆64Feb 4, 2024Updated 2 years ago
- ☆12Nov 2, 2025Updated 7 months ago
- Examples for using the SiLLM framework for training and running Large Language Models (LLMs) on Apple Silicon☆16May 8, 2025Updated last year
- A zero-shot relation extractor, easily downloadable from the HuggingFace repo.☆12Aug 13, 2021Updated 4 years ago
- WebAISum is a Python script that allows you to summarize web pages using AI models. It supports both local models like Ollama and remote …☆15Apr 28, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆74Sep 5, 2023Updated 2 years ago
- ☆22Oct 14, 2024Updated last year
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- C++ inference wrappers for running blazing fast embedding services on your favourite serverless like AWS Lambda. By Prithivi Da, PRs welc…☆23Mar 4, 2024Updated 2 years ago
- one stack for all needs☆18May 11, 2026Updated last month
- ☆20Apr 8, 2025Updated last year
- AI_Powered_Dev_Search_Engine☆12Mar 10, 2024Updated 2 years ago
- this is a dungeon ai run locally that use your llm in the terminal with multiple players from 2 to 5☆17Jan 25, 2026Updated 4 months ago
- Complex RAG backend☆29Mar 28, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A client-only OpenAI LLM Playground for prototyping agents without writing any code.☆22Aug 31, 2023Updated 2 years ago
- Example for Logging LLM Evaluator Prompt Responses☆18Aug 14, 2023Updated 2 years ago
- ☆30Mar 10, 2024Updated 2 years ago
- A mcp server that uses the Osmosis-Apply-1.7B model to apply code merges☆53Jul 3, 2025Updated 11 months ago
- Automated LLM novelist☆46Apr 11, 2024Updated 2 years ago
- Run Vision LLMs, TTS and STT APIs. Website and API for https://text-generator.io☆39May 8, 2026Updated last month
- Accompanying code and SEP dataset for the "Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?" paper.☆61Apr 20, 2026Updated last month
- A Taipy Chatbot that supports images thanks to OpenAI's GPT-4o☆22Apr 13, 2026Updated 2 months ago
- ☆37Jan 29, 2026Updated 4 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A collection of experimental Retrieval Augmented Generation (RAG) Techniques to elevate your pipelines, all with code and intuitive expla…☆37Jul 21, 2025Updated 10 months ago
- A way to analyze tool call accuracy, structural correctness and tool recall for LLM's. Uses Native tool calling.☆23Aug 23, 2025Updated 9 months ago
- A frontend for creative writing with LLMs☆164Jul 15, 2024Updated last year
- speech to text gui for different (e.g. Whisper, Voxtral) models and backends, including whisper.cpp, crispasar, mlx-whisper, faster-whisp…☆22May 30, 2026Updated 2 weeks ago
- An experimental desktop client for using Claude Desktop's MCP with Novelcrafter codices.☆11Dec 3, 2024Updated last year
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.☆45Feb 15, 2024Updated 2 years ago
- QLoRA with Enhanced Multi GPU Support☆38Aug 8, 2023Updated 2 years ago
- Latent Large Language Models☆19Aug 24, 2024Updated last year
- [EMNLP 2022] Language Model Pre-Training with Sparse Latent Typing☆14Feb 10, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- An OS entirely coded by ChatGPT☆18Sep 5, 2023Updated 2 years ago
- My website & blog with articles about coding, tech, functional programming, …☆10Updated this week
- WebRAgent is a retrieval-augmented generation (RAG) web application featuring agent-based query decomposition, vector search with Qdrant,…☆56Mar 22, 2025Updated last year
- A framework for steering MoE models by detecting and controlling behavior-linked experts.☆35Sep 12, 2025Updated 9 months ago
- aigc evals☆10Dec 2, 2023Updated 2 years ago
- alternative remote for Lego Boost with Pythonista and iOS☆10Aug 27, 2017Updated 8 years ago
- Grammar checker with a keyboard shortcut for Ollama and Apple MLX with Automator on macOS.☆82Feb 5, 2024Updated 2 years ago