Inference examples
☆67Sep 19, 2025Updated 6 months ago
Alternatives and similar repositories for inference-examples
Users that are interested in inference-examples are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A simple streamlit app that performs Retrieval-Augmented Generation over a corpus of presidential speeches☆15Apr 24, 2024Updated last year
- ☆12Nov 8, 2023Updated 2 years ago
- ☆27Mar 5, 2024Updated 2 years ago
- ☆13Aug 30, 2024Updated last year
- A sample pattern for running CI tests on Modal☆19Apr 12, 2025Updated 11 months ago
- Code OSS DEV☆30Nov 26, 2022Updated 3 years ago
- ☆11May 17, 2024Updated last year
- groq-gradio☆18Nov 19, 2025Updated 4 months ago
- ☆14May 9, 2024Updated last year
- Okra, your all in one personal AI assistant☆14Jun 14, 2024Updated last year
- BH hackathon☆14Apr 4, 2024Updated last year
- Example project showing how you can use Edge Config and Feature Flags with the Vercel AI SDK☆39Dec 12, 2025Updated 3 months ago
- nvidia/parakeet-rnnt-1.1b running in Replicate Cog container ⚙️☆16Jan 5, 2024Updated 2 years ago
- The Onchain AI Oracle Intents Engine (IE): A Basic Text-to-tx Simulator Contract based on OAO.☆16Feb 17, 2024Updated 2 years ago
- Go WorkerPool aims to control heavy Go Routines☆12Aug 20, 2022Updated 3 years ago
- Sample chatbot app using groq inference, ai sdk, shadcn ui components and tailwind☆25Dec 15, 2025Updated 3 months ago
- ODSC 2023 workshop materials on causal graphs using implementations of DoWhy (PyWhy, EconML)☆13Nov 1, 2023Updated 2 years ago
- template for a golang project☆12Feb 25, 2026Updated 3 weeks ago
- automating cross-region failures in Google Cloud SQL☆11Apr 3, 2020Updated 5 years ago
- ☆15Feb 12, 2025Updated last year
- transparent HTTP cache proxy with Redis — deduplicate API calls, save costs☆12Feb 21, 2026Updated last month
- 🦀 Rust server running in a Docker container deployed to AWS ECS via Terraform 🚀☆12Dec 31, 2024Updated last year
- A duckdb extension that executes js (provided by you or generated via OpenAI) in an embedded v8 interpreter and returns a table☆19Jun 9, 2025Updated 9 months ago
- ☆13Jun 29, 2024Updated last year
- Slides and sample code from presentations at our meetup.☆11Aug 13, 2024Updated last year
- ✨ A simple extension that allows you to search and add .cursorrules listed in cursor.directory.☆14Mar 26, 2025Updated 11 months ago
- frameworks_base for Geeksphone Peak and Keon☆12Jan 13, 2015Updated 11 years ago
- ☆10Sep 25, 2020Updated 5 years ago
- Testing framework for SVM programs & integrations☆12Dec 5, 2024Updated last year
- splits videos into scenes with gpt-4o-mini and saves them separately☆12Dec 19, 2024Updated last year
- Command line tool for Deep Infra cloud ML inference service☆34Jun 10, 2024Updated last year
- Fast, 100% local web page summarization with Microsoft Phi-3☆33Apr 29, 2024Updated last year
- real-time voice-to-text in electron☆13Dec 3, 2025Updated 3 months ago
- TouchGFX simulator development in Visual Studio Code with CMake☆14Mar 25, 2025Updated 11 months ago
- Everything you need to launch a full stack web and native app on Cloudflare.☆10Jan 25, 2025Updated last year
- Repository for Google Cloud Run Deep Dive☆11Jul 8, 2020Updated 5 years ago
- This Repo shows how to integrate LangChain, Open AI and store embeddings in the MongoDB Atlas and run a similarity search using MongoDB A…☆12Sep 4, 2023Updated 2 years ago
- 👨🔧Jekyll integration with Google Workbox to create Service Worker automatically.☆14Feb 1, 2019Updated 7 years ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jul 22, 2023Updated 2 years ago