a set of scripts to easily convert all training data from huggingface into alpaca instruct or sharegpt format, which should allow for ease of use with any trainer
☆19Mar 14, 2025Updated last year
Alternatives and similar repositories for Dataset-Conversion-Toolkit
Users that are interested in Dataset-Conversion-Toolkit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Project code for training LLMs to write better unit tests + code☆21May 19, 2025Updated 11 months ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Feb 22, 2024Updated 2 years ago
- An OpenAI API compatible moderations server for checking whether text is potentially harmful.☆10Mar 23, 2024Updated 2 years ago
- A Python client for nREPL, the Clojure network REPL☆11Apr 1, 2018Updated 8 years ago
- Python interface to Logitech Mediaserver / Squeezeserver☆11Oct 27, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A music spectrum analyser and visualisation program for squeezelite☆14Jan 30, 2023Updated 3 years ago
- Analytics scripts☆11Mar 22, 2019Updated 7 years ago
- ☆10Oct 24, 2024Updated last year
- Gives stats of your play on pokernow.club given the logs.☆15Jul 22, 2022Updated 3 years ago
- ☆13Jun 29, 2024Updated last year
- No Language Left Unlocked: scalable backtranslation of NLLB models☆14Aug 4, 2025Updated 8 months ago
- An LLM-enchanced Infocom Experience☆23Apr 19, 2025Updated last year
- Very minimal (and stateless) agent framework☆44Jan 12, 2025Updated last year
- A python interface for controlling Logitech Squeezeboxes via the SqueezeboxServer☆15Jun 13, 2020Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆28Aug 30, 2023Updated 2 years ago
- cli tool to quantize gguf, gptq, awq, hqq and exl2 models☆79Dec 17, 2024Updated last year
- Flexible genetic algorithms in Clojure☆23May 27, 2020Updated 5 years ago
- ☆17Feb 1, 2024Updated 2 years ago
- Interactive Brokers' Trader Workstation (TWS) running in Docker☆20Dec 21, 2024Updated last year
- LoRa chip into a coherent linear-FM chirp generator☆89Updated this week
- Understanding the correlation between different LLM benchmarks☆29Jan 11, 2024Updated 2 years ago
- Clojure-friendly wrapper for InteractiveBrokers java API☆15Feb 18, 2018Updated 8 years ago
- ☆23Dec 15, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- My Gen AI research☆11Jun 3, 2024Updated last year
- This is the repo for CROssBARv2 Knowledge Graph data. CROssBARv2 is a heterogeneous general-purpose biomedical KG-based system.☆11Feb 4, 2026Updated 2 months ago
- A daily benchmark to regression-test cloud LLMs☆19Aug 7, 2025Updated 8 months ago
- A simple mcp server that lets your AI agent send emails and attach files through SMTP.☆27Feb 17, 2026Updated 2 months ago
- Analysis code for knowledge discovery project☆12Sep 25, 2018Updated 7 years ago
- 🚀 Real-time 3D visualization of NASA's Artemis II mission — Three.js + real JPL Horizons trajectory data☆54Apr 6, 2026Updated 3 weeks ago
- ☆15Apr 2, 2025Updated last year
- simple ansible playbook to take clean ubuntu 18.04 to CUDA 10, PyTorch 1.0, fastai, miniconda heaven☆12Dec 16, 2018Updated 7 years ago
- X Developer Challenge☆12Apr 25, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Use Hermes-2-Pro-Mistral-7B function calling with your OpenAI API compatible code.☆18May 7, 2024Updated last year
- Using modal.com to process FineWeb-edu data☆20Apr 11, 2026Updated 2 weeks ago
- A harness for small llms☆75Updated this week
- A Clojure library of optimisation and control theory tools and convenience functions based on Neanderthal.☆26Sep 25, 2020Updated 5 years ago
- AI Agents Workshop with Red Hat AI☆13Feb 26, 2025Updated last year
- Visualization of the full depth of the order book along time☆21Dec 17, 2019Updated 6 years ago
- A search engine implementation using OpenAI's clip model☆10Jun 20, 2021Updated 4 years ago