Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA
☆125Jun 16, 2023Updated 2 years ago
Alternatives and similar repositories for landmark-attention-qlora
Users that are interested in landmark-attention-qlora are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Landmark Attention: Random-Access Infinite Context Length for Transformers☆426Dec 20, 2023Updated 2 years ago
- A plugin for Oobabooga TextUI that allows you to search multiple search engines. Initially we're using Google API or DuckDuckGo.☆16Jun 4, 2023Updated 2 years ago
- ☆27Aug 30, 2023Updated 2 years ago
- Customizable implementation of the self-instruct paper.☆1,052Mar 7, 2024Updated 2 years ago
- Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with L…☆45Jun 13, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Efficient 3bit/4bit quantization of LLaMA models☆18May 18, 2023Updated 2 years ago
- Simple and fast server for GPTQ-quantized LLaMA inference☆24May 18, 2023Updated 2 years ago
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆2,912Sep 30, 2023Updated 2 years ago
- ☆167Jun 1, 2023Updated 2 years ago
- BabyAGI to run with locally hosted models using the API from https://github.com/oobabooga/text-generation-webui☆86May 6, 2023Updated 2 years ago
- BabyAGI to run with GPT4All☆247May 14, 2023Updated 2 years ago
- Automated prompting and scoring framework to evaluate LLMs using updated human knowledge prompts☆109Jul 29, 2023Updated 2 years ago
- Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes…☆145Oct 17, 2023Updated 2 years ago
- Buzz AI, aka gt-chat, is a fast and intuitive question-answering chatbot for Georgia Tech. Powered by Next.js, FastAPI, and OpenAI, it so…☆30Apr 13, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Tune any FALCON in 4-bit☆463Sep 1, 2023Updated 2 years ago
- Just a simple HowTo for https://github.com/johnsmith0031/alpaca_lora_4bit☆32May 25, 2023Updated 2 years ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- A simple batch file to make the oobabooga one click installer compatible with llama 4bit models and able to run on cuda☆21Mar 27, 2023Updated 3 years ago
- Patch for MPT-7B which allows using and training a LoRA☆58May 20, 2023Updated 2 years ago
- ☆20Jan 24, 2024Updated 2 years ago
- ☆535Dec 1, 2023Updated 2 years ago
- Like system requirements lab but for LLMs☆31Jun 10, 2023Updated 2 years ago
- Tokun to can tokens☆18Jun 19, 2025Updated 9 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆70Nov 17, 2025Updated 4 months ago
- ☆415Nov 2, 2023Updated 2 years ago
- An Autonomous LLM Agent that runs on Wizcoder-15B☆335Oct 21, 2024Updated last year
- Implementation of "Generative Agents: Interactive Simulacra of Human Behavior" paper with Guidance and Langchain. Full features and work …☆287Sep 4, 2023Updated 2 years ago
- A sleek, customizable interface for managing LLMs with responsive design and easy agent personalization.☆17Aug 30, 2024Updated last year
- SwiftLet is a lightweight Python framework for running open-source Large Language Models (LLMs) locally using safetensors☆28Aug 6, 2025Updated 7 months ago
- The official API server for Exllama. OAI compatible, lightweight, and fast.☆1,158Mar 21, 2026Updated last week
- An extension for text-generation-webui by oobabooga. Adds options to keep tabs on page and to move extensions into a sidebar.☆23Sep 24, 2023Updated 2 years ago
- Self-evaluating interview for AI coders☆601Jun 21, 2025Updated 9 months ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Discord chatbot interface to train an LLM on user message history☆27Jun 9, 2023Updated 2 years ago
- Experimental sampler to make LLMs more creative☆31Aug 2, 2023Updated 2 years ago
- LLM that combines the principles of wizardLM and vicunaLM☆716May 31, 2023Updated 2 years ago
- AI Based "Happiness Optimizer"☆12Oct 20, 2024Updated last year
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath☆9,476Jun 7, 2025Updated 9 months ago
- Full finetuning of large language models without large memory requirements☆94Sep 22, 2025Updated 6 months ago
- Discord bot for [oobabooga webui](https://github.com/oobabooga/text-generation-webui)☆12Mar 21, 2025Updated last year