Simple and fast server for GPTQ-quantized LLaMA inference
☆24May 18, 2023Updated 3 years ago
Alternatives and similar repositories for TALIS
Users that are interested in TALIS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Streamlines the creation of dataset to train a Large Language Model with triplets : instruction-input-output . The default configuration …☆13Apr 17, 2023Updated 3 years ago
- ☆16Apr 11, 2024Updated 2 years ago
- ☆21May 27, 2023Updated 3 years ago
- An open-source non-official community implementation of the model from the paper: Surgical Robot Transformer (SRT): Imitation Learning fo…☆13Jun 22, 2026Updated last week
- Easily create LLM automation/agent workflows☆59Feb 13, 2024Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- CopperAI offers a hands-free, voice-to-voice interaction system with a Large Language Model (LLM)☆29Nov 20, 2023Updated 2 years ago
- ☆15Oct 6, 2022Updated 3 years ago
- ☆14May 25, 2023Updated 3 years ago
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆124Jun 16, 2023Updated 3 years ago
- chatsnack is the easiest Python library for rapid development with OpenAI's ChatGPT API. It's an intuitive interface for creating and man…☆29Updated this week
- Harnessing the Memory Power of the Camelids☆147Oct 19, 2023Updated 2 years ago
- Train Large Language Models (LLM) using LoRA☆26May 22, 2023Updated 3 years ago
- Buzz AI, aka gt-chat, is a fast and intuitive question-answering chatbot for Georgia Tech. Powered by Next.js, FastAPI, and OpenAI, it so…☆30Apr 13, 2023Updated 3 years ago
- Local LLM ReAct Agent with Guidance☆158May 23, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- simple prompt script to convert hf/ggml files to gguf, and to quantize☆29Oct 7, 2023Updated 2 years ago
- Llama cute voice assistant☆28Sep 10, 2023Updated 2 years ago
- BFloat16 Fused Adam Operator for PyTorch☆20Nov 16, 2024Updated last year
- ☆11Aug 15, 2024Updated last year
- Pytorch code for paper QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models☆25Sep 27, 2023Updated 2 years ago
- ☆30Apr 23, 2025Updated last year
- Extra tools to work with the Luigi library☆11Jun 1, 2026Updated last month
- Framework for finetunning the ToolFormer-based LM in a few shots manner☆25Nov 11, 2023Updated 2 years ago
- The fastest Whisper optimization for automatic speech recognition as a command-line interface ⚡️☆10Dec 3, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Research that compiles.☆85Apr 19, 2026Updated 2 months ago
- ☆12Apr 4, 2024Updated 2 years ago
- Share your GPU without MIG or MPS☆50Jan 27, 2026Updated 5 months ago
- pi + rainbowhat + touchscreen + usb sound card (mic or aux in) + open ai = audio logic anaklyzer☆12Apr 23, 2023Updated 3 years ago
- Django sample app for DigitalOcean App Platform☆12Nov 24, 2025Updated 7 months ago
- A simple example of how to implement vector based DDPG using PyTorch and a ML-Agents environment.☆18Dec 23, 2018Updated 7 years ago
- Like Duolingo, but better☆39May 5, 2023Updated 3 years ago
- Self-contained Python lib with zero-dependencies that give you a unified device properties for gpu, cpu, and npu. No more calling separat…☆15Jun 19, 2026Updated 2 weeks ago
- This is a plugin for Premiere Pro, which provídes an automated way to update timecodes / start times of media (clips) in your projects.☆11Jun 12, 2026Updated 3 weeks ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Codebase topic modeling using GNNs(Node aggregation and clustering)☆62Jun 30, 2023Updated 3 years ago
- 4 bits quantization of LLaMa using GPTQ☆12Jun 2, 2023Updated 3 years ago
- Spout plugin for Unreal Engine 5 using DirectX12☆39May 23, 2026Updated last month
- ☆46Jan 7, 2026Updated 5 months ago
- Host LLM via text-generation-inference☆16Dec 5, 2023Updated 2 years ago
- OpenAI compatible API for open source LLMs☆17Oct 30, 2023Updated 2 years ago
- 🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)☆17Mar 20, 2024Updated 2 years ago