☆67Feb 4, 2026Updated 4 months ago
Alternatives and similar repositories for generative_data_prep
Users that are interested in generative_data_prep are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repo contains the data preparation, tokenization, training and inference code for BLOOMChat. BLOOMChat is a 176 billion parameter mu…☆584Oct 10, 2023Updated 2 years ago
- ☆13Apr 30, 2024Updated 2 years ago
- ToolBench, an evaluation suite for LLM tool manipulation capabilities.☆178Feb 28, 2024Updated 2 years ago
- Exploring finetuning public checkpoints on filter 8K sequences on Pile☆116Mar 22, 2023Updated 3 years ago
- ☆14Mar 28, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Efficient Pre-training of Masked Language Model via Concept-based Curriculum Masking☆13Feb 5, 2023Updated 3 years ago
- ☆12Jan 5, 2024Updated 2 years ago
- An autonomous orchestrator that unites and manages open-source devs for complex problems by faciliting synergy between multiple Discord s…☆33Sep 16, 2024Updated last year
- A Model Context Protocol server for Flux image generation, providing tools for image generation, manipulation, and control☆25Mar 25, 2026Updated 2 months ago
- Effective Unsupervised Domain Adaptation of Neural Rankers by Diversifying Synthetic Query Generation☆16Apr 23, 2025Updated last year
- Simple Implementation of TinyGPTV in super simple Zeta lego blocks☆16Nov 11, 2024Updated last year
- ☆12Mar 4, 2025Updated last year
- A Chrome extension that generates binaural beats.☆29Aug 23, 2023Updated 2 years ago
- Simple Autogpt with tree of thoughts☆14May 25, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆15Nov 22, 2023Updated 2 years ago
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆48Jan 17, 2024Updated 2 years ago
- Pytorch implementation of the Gato paper from Deepmind☆12Feb 8, 2023Updated 3 years ago
- Video production for developers☆44May 1, 2026Updated last month
- Tiktok is an advanced multimedia recommender system that fuses the generative modality-aware collaborative self-augmentation and contrast…☆14Aug 18, 2023Updated 2 years ago
- Multi-threading, Concurrency, Asynchrony, and various Execution Methods implemented in a Rust backend for bleeding edge performance.☆21Nov 11, 2024Updated last year
- This is a holding area for apps that need to be updated for serialosc compatibility before movement into their own repository.☆10Mar 24, 2016Updated 10 years ago
- Implementaiton of "DiLM: Distilling Dataset into Language Model for Text-level Dataset Distillation" (accepted by NAACL2024 Findings)".☆28Feb 10, 2025Updated last year
- Low code framework to build and launch a crew of AI agents with shared state. See docs https://axcrew.dev☆43Mar 30, 2026Updated 2 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆32Sep 22, 2024Updated last year
- Direct X game controller server/client written in Python☆10Jul 10, 2018Updated 7 years ago
- This repo explores how AMR to address tasks difficult for LLMs☆13Jan 15, 2024Updated 2 years ago
- ☆20Apr 8, 2025Updated last year
- ☆12Jul 7, 2022Updated 3 years ago
- Official Repository for paper "Ontology-Free General-Domain Knowledge Graph-to-Text Generation Dataset Synthesis using Large Language Mod…☆15Nov 25, 2024Updated last year
- Generate textbook-quality synthetic LLM pretraining data☆508Oct 19, 2023Updated 2 years ago
- jQuery, React and Streamlit applications written by LLMs☆16Dec 24, 2023Updated 2 years ago
- Generate High Quality textual or multi-modal datasets with Agents☆18Jun 7, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Implementation of Unified Embedding: Battle-Tested Feature Representations for Web-Scale ML Systems☆15Nov 11, 2023Updated 2 years ago
- Tascell: Backtrcking-based load balancing framework☆13Jan 1, 2026Updated 5 months ago
- Finetune any model on HF in less than 30 seconds☆57Apr 27, 2026Updated last month
- ☆13Oct 11, 2024Updated last year
- bumble bee transformer☆14Apr 19, 2021Updated 5 years ago
- ☆12Oct 28, 2023Updated 2 years ago
- A Ruby on Rails style framework for the DSPy (Demonstrate, Search, Predict) project for Language Models like GPT, BERT, and LLama.☆133Oct 16, 2024Updated last year