Simple and fast server for GPTQ-quantized LLaMA inference
☆24May 18, 2023Updated 2 years ago
Alternatives and similar repositories for TALIS
Users that are interested in TALIS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Streamlines the creation of dataset to train a Large Language Model with triplets : instruction-input-output . The default configuration …☆13Apr 17, 2023Updated 3 years ago
- ☆17Apr 11, 2024Updated 2 years ago
- 🎨 Imagine what Picasso could have done with AI. Self-host your StableDiffusion API.☆50May 8, 2023Updated 2 years ago
- ☆21May 27, 2023Updated 2 years ago
- Minimalistic batching application for LLMs using ASP.NET Core and LLamaSharp☆12Oct 23, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Like system requirements lab but for LLMs☆31Jun 10, 2023Updated 2 years ago
- An open-source non-official community implementation of the model from the paper: Surgical Robot Transformer (SRT): Imitation Learning fo…☆12Apr 20, 2026Updated 2 weeks ago
- Easily create LLM automation/agent workflows☆60Feb 13, 2024Updated 2 years ago
- CopperAI offers a hands-free, voice-to-voice interaction system with a Large Language Model (LLM)☆29Nov 20, 2023Updated 2 years ago
- A repository of projects and datasets under active development by Alignment Lab AI☆22Dec 22, 2023Updated 2 years ago
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆125Jun 16, 2023Updated 2 years ago
- BabyAGI to run with locally hosted models using the API from https://github.com/oobabooga/text-generation-webui☆86May 6, 2023Updated 2 years ago
- Harnessing the Memory Power of the Camelids☆147Oct 19, 2023Updated 2 years ago
- Train Large Language Models (LLM) using LoRA☆26May 22, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A guidance language for controlling large language models.☆43Jun 9, 2023Updated 2 years ago
- Local LLM ReAct Agent with Guidance☆158May 23, 2023Updated 2 years ago
- simple prompt script to convert hf/ggml files to gguf, and to quantize☆29Oct 7, 2023Updated 2 years ago
- Llama cute voice assistant☆28Sep 10, 2023Updated 2 years ago
- BFloat16 Fused Adam Operator for PyTorch☆19Nov 16, 2024Updated last year
- ☆11Aug 15, 2024Updated last year
- Pytorch code for paper QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models☆25Sep 27, 2023Updated 2 years ago
- ☆13Mar 30, 2026Updated last month
- ☆30Apr 23, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Extra tools to work with the Luigi library☆11Apr 6, 2026Updated 3 weeks ago
- The fastest Whisper optimization for automatic speech recognition as a command-line interface ⚡️☆10Dec 3, 2023Updated 2 years ago
- Research that compiles.☆85Apr 19, 2026Updated 2 weeks ago
- ☆12Apr 4, 2024Updated 2 years ago
- Self-contained Python lib with zero-dependencies that give you a unified device properties for gpu, cpu, and npu. No more calling separat…☆14Apr 23, 2026Updated last week
- Desktop application for instant AI-powered text transformation. Translate, correct, summarize, and change the tone of any text, anywhere,…☆31Dec 29, 2025Updated 4 months ago
- Codebase topic modeling using GNNs(Node aggregation and clustering)☆62Jun 30, 2023Updated 2 years ago
- 4 bits quantization of LLaMa using GPTQ☆12Jun 2, 2023Updated 2 years ago
- A powerful image and caption browser/editor, caption generator, and bulk image resizer built with Python(PyQT5) using GPT-4.☆18Apr 21, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Python labs demonstrating various techniques for reducing hallucinations in apps using large language models☆31Nov 28, 2025Updated 5 months ago
- 🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)☆17Mar 20, 2024Updated 2 years ago
- ☆30Dec 12, 2025Updated 4 months ago
- ☆16Jun 5, 2023Updated 2 years ago
- Example of DDD using Hexagonal, CQRS and EventSourcing Architecture☆11Oct 7, 2016Updated 9 years ago
- SDXL GPU cluster scripts☆16Oct 28, 2023Updated 2 years ago
- ☆11Sep 5, 2025Updated 7 months ago