☆69May 26, 2024Updated 2 years ago
Alternatives and similar repositories for kraken
Users that are interested in kraken are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is our own implementation of 'Layer Selective Rank Reduction'☆240May 26, 2024Updated 2 years ago
- Extract a single expert from a Mixture Of Experts model using slerp interpolation.☆19May 26, 2024Updated 2 years ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31May 22, 2024Updated 2 years ago
- 5X faster 60% less memory QLoRA finetuning☆21May 28, 2024Updated 2 years ago
- ☆78Dec 26, 2023Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Simple Model Similarities Analysis☆21Feb 3, 2024Updated 2 years ago
- ☆167Aug 8, 2025Updated 10 months ago
- ☆138Aug 19, 2024Updated last year
- ☆13Jun 29, 2024Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆70Nov 17, 2025Updated 6 months ago
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆92Feb 27, 2024Updated 2 years ago
- Tools for formatting large language model prompts.☆13Dec 19, 2023Updated 2 years ago
- ☆21Jun 8, 2025Updated last year
- A Next.js chatbot app demonstrating seamless integration with window.ai.☆15Jun 25, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- various experiments for scaling inference time compute with small reasoning models☆17Jan 16, 2025Updated last year
- OpenCyc Ontology or Knowledge Base Data Files☆18Jan 14, 2022Updated 4 years ago
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async API☆48Sep 26, 2024Updated last year
- Low-Rank adapter extraction for fine-tuned transformers models☆181May 2, 2024Updated 2 years ago
- My Gen AI research☆11Jun 3, 2024Updated 2 years ago
- Code for NOLA, an implementation of "nola: Compressing LoRA using Linear Combination of Random Basis"☆59Aug 25, 2024Updated last year
- REST API for Large Language Models using FastAPI, Redis and LiteLLM☆14Nov 30, 2023Updated 2 years ago
- All the world is a play, we are but actors in it.☆50Jul 21, 2025Updated 10 months ago
- Attend - to what matters.☆17Feb 22, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A powerful MCP memory using a knowledge graph powered by elastic search☆19Apr 1, 2026Updated 2 months ago
- A miniature version of Modal☆24Jun 11, 2024Updated 2 years ago
- a set of scripts to easily convert all training data from huggingface into alpaca instruct or sharegpt format, which should allow for eas…☆20Mar 14, 2025Updated last year
- Experimenting text-embeddings-inference server on both CPU and GPU☆18Oct 25, 2023Updated 2 years ago
- Simple Graph Memory for AI applications☆104Feb 23, 2026Updated 3 months ago
- Tools for merging pretrained large language models.☆7,126May 6, 2026Updated last month
- ☆29Apr 29, 2024Updated 2 years ago
- An embeddable widget for interacting with openAI api compatable LLM's☆15Sep 18, 2024Updated last year
- Chrome Extension for exploring Hugging Face datasets 🔎☆48Sep 18, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Luber : A ridesharing App☆14Dec 13, 2017Updated 8 years ago
- Vercel AI Provider for running Large Language Models locally using LLamaCpp☆30May 6, 2024Updated 2 years ago
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆257Oct 30, 2024Updated last year
- Extension for SillyTavern, adds extra settings and context control for NovelAI's Clio and Kayra models.☆19Oct 27, 2025Updated 7 months ago
- A library for simplifying training with multi gpu setups in the HuggingFace / PyTorch ecosystem.☆16Updated this week
- Using fourier interpolation to merge large language models☆11Jan 6, 2026Updated 5 months ago
- An Open Source Toolkit For LLM Distillation☆959May 12, 2026Updated last month