avilum / yalla
A tiny LLM Agent with minimal dependencies, focused on local inference.
☆50Updated last month
Related projects ⓘ
Alternatives and complementary repositories for yalla
- Tutorial for building LLM router☆157Updated 3 months ago
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆222Updated 6 months ago
- Function Calling Benchmark & Testing☆74Updated 4 months ago
- ☆106Updated this week
- ☆64Updated 5 months ago
- 🤖 Headless IDE for AI agents☆128Updated this week
- Generate large synthetic data using a local LLM☆88Updated this week
- Synthetic Data for LLM Fine-Tuning☆93Updated 11 months ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆105Updated last week
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆97Updated 7 months ago
- Practical and advanced guide to LLMOps. It provides a solid understanding of large language models’ general concepts, deployment techniqu…☆52Updated 2 months ago
- ☆103Updated 7 months ago
- Simple examples using Argilla tools to build AI☆38Updated this week
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆201Updated 5 months ago
- Serving LLMs in the HF-Transformers format via a PyFlask API☆68Updated last month
- Low-Rank adapter extraction for fine-tuned transformers model☆162Updated 6 months ago
- ☆110Updated 2 weeks ago
- One click templates for inferencing Language Models☆115Updated 3 weeks ago
- The long-term memory for your Superagents 🥷and LLMs 🤖. Built with GraphRAG, Knowledge graphs and autonomous ai agents☆44Updated last month
- The easiest, and fastest way to run AI-generated Python code safely☆207Updated 2 weeks ago
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async API☆45Updated last month
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆81Updated last year
- Let's build better datasets, together!☆202Updated 3 months ago
- Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …☆160Updated 3 months ago
- run ollama & gguf easily with a single command☆47Updated 5 months ago
- A toolkit for building multimodal AI agents☆107Updated 2 weeks ago
- ☆80Updated this week
- A Lossless Compression Library for AI pipelines☆171Updated this week
- Mistral + Haystack: build RAG pipelines that rock 🤘☆100Updated 9 months ago
- Client-side toolkit for using large language models, including where self-hosted☆102Updated this week