Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA
☆124Jun 16, 2023Updated 2 years ago
Alternatives and similar repositories for landmark-attention-qlora
Users that are interested in landmark-attention-qlora are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Landmark Attention: Random-Access Infinite Context Length for Transformers☆425Dec 20, 2023Updated 2 years ago
- A plugin for Oobabooga TextUI that allows you to search multiple search engines. Initially we're using Google API or DuckDuckGo.☆17Jun 4, 2023Updated 2 years ago
- ☆28Aug 30, 2023Updated 2 years ago
- Customizable implementation of the self-instruct paper.☆1,052Mar 7, 2024Updated 2 years ago
- Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with L…☆45Jun 13, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Efficient 3bit/4bit quantization of LLaMA models☆18May 18, 2023Updated 3 years ago
- Simple and fast server for GPTQ-quantized LLaMA inference☆24May 18, 2023Updated 3 years ago
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆2,922Sep 30, 2023Updated 2 years ago
- ☆166Jun 1, 2023Updated 2 years ago
- BabyAGI to run with locally hosted models using the API from https://github.com/oobabooga/text-generation-webui☆86May 6, 2023Updated 3 years ago
- [ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization☆720Aug 13, 2024Updated last year
- Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes…☆145Oct 17, 2023Updated 2 years ago
- Buzz AI, aka gt-chat, is a fast and intuitive question-answering chatbot for Georgia Tech. Powered by Next.js, FastAPI, and OpenAI, it so…☆30Apr 13, 2023Updated 3 years ago
- Tune any FALCON in 4-bit☆463Sep 1, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Just a simple HowTo for https://github.com/johnsmith0031/alpaca_lora_4bit☆32May 25, 2023Updated 3 years ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- A simple batch file to make the oobabooga one click installer compatible with llama 4bit models and able to run on cuda☆21Mar 27, 2023Updated 3 years ago
- Makes llama.cpp easy to use.☆12May 14, 2025Updated last year
- Patch for MPT-7B which allows using and training a LoRA☆58May 20, 2023Updated 3 years ago
- never forget anything again! combine AI and intelligent tooling for a local knowledge base to track catalogue, annotate, and plan for you…☆38May 14, 2024Updated 2 years ago
- Instruct-tune Open LLaMA / RedPajama / StableLM models on consumer hardware using QLoRA☆80Dec 15, 2023Updated 2 years ago
- ☆20Jan 24, 2024Updated 2 years ago
- ☆536Dec 1, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- High performance implementation of the WARP (SIGIR'25) retrieval engine.☆34May 21, 2026Updated last week
- Like system requirements lab but for LLMs☆31Jun 10, 2023Updated 2 years ago
- Tokun to can tokens☆18Jun 19, 2025Updated 11 months ago
- Train Llama Loras Easily☆31Aug 3, 2023Updated 2 years ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆70Nov 17, 2025Updated 6 months ago
- ☆415Nov 2, 2023Updated 2 years ago
- An Autonomous LLM Agent that runs on Wizcoder-15B☆335Oct 21, 2024Updated last year
- Fast fuzzy text search☆12May 16, 2023Updated 3 years ago
- Implementation of "Generative Agents: Interactive Simulacra of Human Behavior" paper with Guidance and Langchain. Full features and work …☆289Sep 4, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A sleek, customizable interface for managing LLMs with responsive design and easy agent personalization.☆17Aug 30, 2024Updated last year
- SwiftLet is a lightweight Python framework for running open-source Large Language Models (LLMs) locally using safetensors☆29Aug 6, 2025Updated 9 months ago
- QLoRA for Masked Language Modeling☆24Sep 11, 2023Updated 2 years ago
- An extension for text-generation-webui by oobabooga. Adds options to keep tabs on page and to move extensions into a sidebar.☆23Sep 24, 2023Updated 2 years ago
- Self-evaluating interview for AI coders☆600Jun 21, 2025Updated 11 months ago
- Discord chatbot interface to train an LLM on user message history☆27Jun 9, 2023Updated 2 years ago
- Experimental sampler to make LLMs more creative☆31Aug 2, 2023Updated 2 years ago