QuixiAI/kraken

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/QuixiAI/kraken)

QuixiAI / kraken

☆69

Alternatives and similar repositories for kraken

Users that are interested in kraken are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

QuixiAI / laserRMT
View on GitHub
This is our own implementation of 'Layer Selective Rank Reduction'
☆240May 26, 2024Updated 2 years ago
serp-ai / Parameter-Efficient-MoE
View on GitHub
Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks
☆31May 22, 2024Updated 2 years ago
enjalot / latent-data-modal
View on GitHub
Using modal.com to process FineWeb-edu data
☆20Apr 11, 2026Updated 3 months ago
serp-ai / unsloth
View on GitHub
5X faster 60% less memory QLoRA finetuning
☆21May 28, 2024Updated 2 years ago
QuixiAI / spectrum
View on GitHub
☆145Aug 20, 2025Updated 11 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
swj0419 / detect-pretrain-code-contamination
View on GitHub
☆78Dec 26, 2023Updated 2 years ago
fblgit / model-similarity
View on GitHub
Simple Model Similarities Analysis
☆21Feb 3, 2024Updated 2 years ago
QuixiAI / generate
View on GitHub
☆27Mar 13, 2024Updated 2 years ago
QuixiAI / OpenChatML
View on GitHub
☆166Aug 8, 2025Updated 11 months ago
QuixiAI / grokadamw
View on GitHub
☆137Aug 19, 2024Updated last year
r-three / phatgoose
View on GitHub
Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"
☆93Feb 27, 2024Updated 2 years ago
louisbrulenaudet / ragoon
View on GitHub
High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡
☆70Nov 17, 2025Updated 8 months ago
misbahsy / XQuotes
View on GitHub
☆13Jun 29, 2024Updated 2 years ago
YanniKouloumbis / next-js-window-ai
View on GitHub
A Next.js chatbot app demonstrating seamless integration with window.ai.
☆15Jun 25, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
offskiies / KB_builder
View on GitHub
Build your own custom knowledge base from various sources such as youtube videos transcripts, tweets, articles, videos and audios. Uses G…
☆13Dec 15, 2023Updated 2 years ago
ariG23498 / timm-wrapper-examples
View on GitHub
Notebooks to demonstrate TimmWrapper
☆17Jan 16, 2025Updated last year
zhuzilin / faster-nougat
View on GitHub
Implementation of nougat that focuses on processing pdf locally.
☆85Jan 15, 2025Updated last year
UCDvision / NOLA
View on GitHub
Code for NOLA, an implementation of "nola: Compressing LoRA using Linear Combination of Random Basis"
☆59Aug 25, 2024Updated last year
thomasgauthier / LoRD
View on GitHub
Low-Rank adapter extraction for fine-tuned transformers models
☆181May 2, 2024Updated 2 years ago
mustafaaljadery / mlxcli
View on GitHub
Run large models from the terminal using Apple MLX.
☆32Mar 18, 2024Updated 2 years ago
michaelfeil / embed
View on GitHub
A stable, fast and easy-to-use inference library with a focus on a sync-to-async API
☆48Sep 26, 2024Updated last year
IIMunchII / restllm
View on GitHub
REST API for Large Language Models using FastAPI, Redis and LiteLLM
☆14Nov 30, 2023Updated 2 years ago
bdambrosio / AllTheWorldAPlay
View on GitHub
All the world is a play, we are but actors in it.
☆51Jul 21, 2025Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
sebastianschramm / fastapi_hf_endpoints
View on GitHub
Custom fastapi server packaged as docker image for Huggingface inference endpoints deployment
☆13Apr 17, 2024Updated 2 years ago
QuixiAI / dolphinflow-optimizer
View on GitHub
☆21Jun 8, 2025Updated last year
TroyDoesAI / AI_Research
View on GitHub
My Gen AI research
☆11Jun 3, 2024Updated 2 years ago
jmtomczak / vae_kan_example
View on GitHub
A simple example of VAEs with KANs
☆12May 17, 2024Updated 2 years ago
davidBelanger / torch-util
View on GitHub
utility code for doing deep nlp in torch
☆17May 16, 2017Updated 9 years ago
RBigData / launchr
View on GitHub
Launch a distributed R server on a cluster from a remote R session
☆12Apr 23, 2022Updated 4 years ago
h4shk4t / rlm
View on GitHub
General plug-and-play inference library for Recursive Language Models (RLMs), supporting various sandboxes.
☆17Apr 26, 2026Updated 3 months ago
wenqiglantz / text-embedding-inference-server-edd
View on GitHub
Experimenting text-embeddings-inference server on both CPU and GPU
☆18Oct 25, 2023Updated 2 years ago
arcee-ai / mergekit
View on GitHub
Tools for merging pretrained large language models.
☆7,261Jun 17, 2026Updated last month
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Technoculture / personal-graph
View on GitHub
Simple Graph Memory for AI applications
☆105Feb 23, 2026Updated 5 months ago
Alignment-Lab-AI / Dataset-Conversion-Toolkit
View on GitHub
a set of scripts to easily convert all training data from huggingface into alpaca instruct or sharegpt format, which should allow for eas…
☆20Mar 14, 2025Updated last year
therohk / opencyc-kb
View on GitHub
OpenCyc Ontology or Knowledge Base Data Files
☆21Jan 14, 2022Updated 4 years ago
AI-ANK / Airbnb-Listing-Explorer
View on GitHub
☆29Apr 29, 2024Updated 2 years ago
nickaggarwal / nvidia-triton-llm-streaming
View on GitHub
Integrating SSE with NVIDIA Triton Inference Server using a Python backend and Zephyr model. There is very less documentation how to use …
☆10May 29, 2024Updated 2 years ago
nnance / llamacpp-ai-provider
View on GitHub
Vercel AI Provider for running Large Language Models locally using LLamaCpp
☆30May 6, 2024Updated 2 years ago
jwjohns / LFM2Sloth
View on GitHub
Modular task agnostic training pipeline using LFM2 from Liquid AI with unsloth.
☆16Sep 13, 2025Updated 10 months ago