clean up your LLM datasets
☆113May 30, 2023Updated 3 years ago
Alternatives and similar repositories for ambrosia
Users that are interested in ambrosia are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆83Sep 10, 2023Updated 2 years ago
- A repository of prompts and Python scripts for intelligent transformation of raw text into diverse formats.☆32May 29, 2023Updated 3 years ago
- ☆45Oct 13, 2023Updated 2 years ago
- Collection of various text datasets to assist ML researchers in training or fine-tuning their models☆21Apr 1, 2023Updated 3 years ago
- ☆21Aug 27, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A Simple Discord Bot for the Alpaca LLM☆102Jun 22, 2023Updated 3 years ago
- A library for squeakily cleaning and filtering language datasets.☆50Jul 10, 2023Updated 2 years ago
- Full finetuning of large language models without large memory requirements☆93Sep 22, 2025Updated 9 months ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Jun 21, 2023Updated 3 years ago
- A collection of modular datasets generated by GPT-4, General-Instruct - Roleplay-Instruct - Code-Instruct - and Toolformer☆1,665Sep 15, 2023Updated 2 years ago
- For Python: analyses the files in a directory, finds the filenames and function calls, and uses GPT to create an explanation of what they…☆15Apr 17, 2023Updated 3 years ago
- Seamless Voice Interactions with LLMs☆12Oct 28, 2023Updated 2 years ago
- Customizable implementation of the self-instruct paper.☆1,052Mar 7, 2024Updated 2 years ago
- Go bindings for Langchain AI☆13Apr 11, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A discord bot that roleplays!☆157Sep 25, 2023Updated 2 years ago
- Image Diffusion block merging technique applied to transformers based Language Models.☆56May 8, 2023Updated 3 years ago
- data cleaning and curation for unstructured text☆330Aug 6, 2024Updated last year
- Conduct consumer interviews with synthetic focus groups using LLMs and LangChain☆42Sep 14, 2023Updated 2 years ago
- ☆416Nov 2, 2023Updated 2 years ago
- ☆50Mar 14, 2024Updated 2 years ago
- Just a bunch of benchmark logs for different LLMs☆130Jul 28, 2024Updated last year
- Rust bindings for CTranslate2☆14Jun 21, 2023Updated 3 years ago
- ☆20Jul 12, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Designing a Dashboard for Transparency and Control of Conversational AI, https://arxiv.org/abs/2406.07882☆39Oct 7, 2025Updated 8 months ago
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆70Aug 27, 2023Updated 2 years ago
- 🔓 The open-source autonomous agent LLM initiative 🔓☆91Feb 12, 2024Updated 2 years ago
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆280Jan 10, 2026Updated 5 months ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Mar 12, 2024Updated 2 years ago
- A public release of TimelineBuilder for building personal digital data timelines.☆371Sep 3, 2024Updated last year
- A place to store reusable transformer components of my own creation or found on the interwebs☆80May 30, 2026Updated last month
- assign color hues to a collection of text fragments based on embeddings☆20Jun 15, 2024Updated 2 years ago
- private repo for nonfiction drafting☆17Oct 24, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official code and dataset for our NAACL 2024 paper: DialogCC: An Automated Pipeline for Creating High-Quality Multi-modal Dialogue Datase…☆13Jun 24, 2024Updated 2 years ago
- QLoRA with Enhanced Multi GPU Support☆38Aug 8, 2023Updated 2 years ago
- ☆63Apr 12, 2026Updated 2 months ago
- Rust source code for all 650 leetcode hard algorithmic problems available with no subscription☆18Jul 6, 2025Updated 11 months ago
- Automatically research and outbound companies with Exa API and google sheets app scripts.☆18Jun 24, 2024Updated 2 years ago
- Source code to accompany research paper on training multi token prediction language models using self-distillation.☆39Feb 21, 2026Updated 4 months ago
- Merge Transformers language models by use of gradient parameters.☆214Aug 8, 2024Updated last year