avilum / yallaLinks
A tiny LLM Agent with minimal dependencies, focused on local inference.
☆53Updated 8 months ago
Alternatives and similar repositories for yalla
Users that are interested in yalla are comparing it to the libraries listed below
Sorting:
- Work with OpenAI's streaming API at ease with Python generators☆121Updated last year
- A pytest plugin for running and analyzing LLM evaluation tests.☆127Updated 4 months ago
- Transform your pythonic research to an artifact that engineers can deploy easily.☆154Updated last week
- A Lossless Compression Library for AI pipelines☆250Updated 2 months ago
- A WhatsApp bot that can participate in group conversations, powered by AI. The bot monitors group messages and responds when mentioned.☆78Updated last week
- Function Calling Benchmark & Testing☆86Updated 11 months ago
- Framework for building, orchestrating and deploying multi-agent systems. Managed by OpenAI Solutions team. Experimental framework.☆92Updated 8 months ago
- InferX is a Inference Function as a Service Platform☆111Updated last week
- ☆100Updated 7 months ago
- Security framework for LLM-generated SQL queries 🛡️☆30Updated 7 months ago
- Tutorial for building LLM router☆211Updated 11 months ago
- A fast batching API to serve LLM models☆183Updated last year
- ☆121Updated 10 months ago
- A Whatsapp-bot that allows access to GPT3 while also serving as your own memory backup☆135Updated 2 years ago
- SDK for using LLM☆80Updated last year
- HyPSTER - HyperParameter optimization on STERoids☆48Updated 7 months ago
- Adapted version of llama3.np (NumPy) to a CuPy implementation for the Llama 3 model.☆35Updated last year
- MCP server for Israel Government Data☆60Updated this week
- ☆222Updated this week
- Transform any OpenAPI/Swagger definition into a fully-featured Model Context Protocol (MCP) server☆152Updated 2 weeks ago
- ☆66Updated last year
- OpenTelemetry Instrumentation for AI Observability☆480Updated this week
- Serving LLMs in the HF-Transformers format via a PyFlask API☆71Updated 9 months ago
- A Pythonic integration for LLMs.☆88Updated last year
- This open-source repository offers reference code for integrating workplace datastores with Cohere's LLMs, enabling developers and busine…☆151Updated 8 months ago
- A framework for evaluating function calls made by LLMs☆37Updated 11 months ago
- run ollama & gguf easily with a single command☆51Updated last year
- Fast parallel LLM inference for MLX☆193Updated 11 months ago
- This is our own implementation of 'Layer Selective Rank Reduction'☆239Updated last year
- 🤖 Headless IDE for AI agents☆191Updated 2 months ago