Command-line script for inferencing from models such as falcon-7b-instruct
☆75Jun 1, 2023Updated 3 years ago
Alternatives and similar repositories for falcon-play
Users that are interested in falcon-play are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Command-line script for inferencing from models such as MPT-7B-Chat☆99Jul 6, 2023Updated 2 years ago
- ☆13Aug 23, 2024Updated last year
- ☆19Jan 24, 2025Updated last year
- Track the progress of LLM context utilisation☆56Apr 14, 2025Updated last year
- This the bunkoer library, for secure your data on all your llm task☆10Jan 2, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Trigger an LLM in your CI/CD to auto-complete your work☆11Apr 5, 2023Updated 3 years ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Jul 12, 2023Updated 2 years ago
- llama-4bit-colab☆63Mar 18, 2023Updated 3 years ago
- Command-line script for inferencing from models such as LLaMA, in a chat scenario, with LoRA adaptations☆32Jun 1, 2023Updated 3 years ago
- Simplified version of a common crawl fetcher☆16Dec 24, 2025Updated 5 months ago
- Continuous Meme Delivery☆12Dec 7, 2022Updated 3 years ago
- QLoRA with Enhanced Multi GPU Support☆38Aug 8, 2023Updated 2 years ago
- ☆23Jul 10, 2023Updated 2 years ago
- This repository tracks the changes the the "Unix Timesharing System" paper written by Dennis Ritchie and Ken Thompson.☆11Oct 6, 2018Updated 7 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A sample pattern for running CI tests on Modal☆19Apr 12, 2025Updated last year
- ☆22Jul 24, 2023Updated 2 years ago
- ☆16Jul 23, 2024Updated last year
- LLM plugin for models hosted by Anyscale Endpoints☆35Apr 22, 2024Updated 2 years ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆119Jan 7, 2024Updated 2 years ago
- Oobabooga "Hello World" API example for node.js with Express☆13Jul 2, 2023Updated 2 years ago
- Extend the original llama.cpp repo to support redpajama model.☆117Sep 3, 2024Updated last year
- ☆10Sep 22, 2020Updated 5 years ago
- Implementation of Stable Diffusion from scratch [WORK IN PROGRESS]☆22Feb 18, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- How to build a security camera with a Raspberry Pi☆10Jun 1, 2026Updated 2 weeks ago
- ☆139Nov 5, 2023Updated 2 years ago
- Create a QnA bot on a pdf☆16May 27, 2023Updated 3 years ago
- ☆65Feb 22, 2023Updated 3 years ago
- Falcon40B and 7B (Instruct) with streaming, top-k, and beam search☆40Jun 6, 2023Updated 3 years ago
- 2020 Summer Olympics medals per million people☆12Aug 8, 2021Updated 4 years ago
- IETF L4S Deployment Design Recommendations☆20May 19, 2026Updated last month
- 🎸 Integrating AI plugins to LLMs☆228Sep 28, 2023Updated 2 years ago
- ☆14Aug 30, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Retrieves data from FantasyPros.com☆11Oct 26, 2025Updated 7 months ago
- Run evaluation on LLMs using human-eval benchmark☆429Sep 12, 2023Updated 2 years ago
- A proof-of-concept illustration to show how LLM's could manage and organize files.☆241Jun 1, 2023Updated 3 years ago
- Python Server for C3 AI app. A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) with…☆24Jan 7, 2024Updated 2 years ago
- Small repository for my video on LoRA☆16May 14, 2023Updated 3 years ago
- A reinforcement learning framework based on MLX.☆257Dec 1, 2025Updated 6 months ago
- Code for the paper "Cottention: Linear Transformers With Cosine Attention"☆20Nov 15, 2025Updated 7 months ago