Chat data cleaning, filtering and deduplication pipeline.
☆22Jul 25, 2023Updated 2 years ago
Alternatives and similar repositories for chat-data-pipeline
Users that are interested in chat-data-pipeline are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Mar 16, 2022Updated 4 years ago
- II-Thought-RL is our initial attempt at developing a large-scale, multi-domain Reinforcement Learning (RL) dataset☆31Apr 8, 2025Updated last year
- Using Spectral Noise Gating (SNG) techniques to reduce background noise in streaming microphone input for enhanced vocal recognition☆25Dec 10, 2018Updated 7 years ago
- ☆16Dec 31, 2021Updated 4 years ago
- A bunch of LLaMa model investigations, including recreating generative agents (from the paper Generative Agents: Interactive Simulacra of…☆23May 31, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for Massive-scale Decoding for Text Generation using Lattices☆44Jul 29, 2022Updated 3 years ago
- ☆36Dec 6, 2022Updated 3 years ago
- RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …☆10Nov 3, 2023Updated 2 years ago
- Utility for React components to easily subscribe to Mutant streams☆13Dec 9, 2017Updated 8 years ago
- classy is a simple-to-use library for building high-performance Machine Learning models in NLP.☆87Apr 6, 2026Updated last month
- A Rust implementation of the Handshake and Lightning Network secure messaging protocol - based on Noise.☆12Dec 9, 2019Updated 6 years ago
- A tiny server to run local inference on MLX model in the style of OpenAI☆13Jan 31, 2024Updated 2 years ago
- Get CPU usage percentage of own process☆18Jan 18, 2020Updated 6 years ago
- Reboot with METAVERSE SEED DEV KIT - Build your own Metaverse! Coming soon, early in April 2024☆15May 17, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- USB Hid handler for nodejs☆11Sep 30, 2022Updated 3 years ago
- DataOps framework for Machine Learning projects.☆61May 4, 2023Updated 3 years ago
- Send @dtinth an encrypted message☆13Dec 3, 2025Updated 5 months ago
- ☆29Feb 24, 2025Updated last year
- 2D HTML Canvas bindings to ECSY, entity component system for the web☆15Jul 11, 2023Updated 2 years ago
- Awesome links that have been placed in Angular Developer Thailand Facebook group☆13Mar 2, 2017Updated 9 years ago
- A simple game inspired by Battle City☆14Apr 8, 2026Updated last month
- Training a reward model for RLHF using RWKV.☆15Jun 5, 2023Updated 2 years ago
- Handlebars helper, alternative to built-in partials. Similar to handlebars-helper-partial, but this helper will allow wildcard (glob) pat…☆16Nov 10, 2014Updated 11 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Lightweight knowledge distillation pipeline☆28Nov 29, 2021Updated 4 years ago
- Train Llama Loras Easily☆31Aug 3, 2023Updated 2 years ago
- Render diagrams from your kubernetes manifests☆14Nov 24, 2025Updated 5 months ago
- ☆15Jul 6, 2018Updated 7 years ago
- L1TTLE PAWS - Arcade physics platformer with procedural art and levels for JS13K!☆23Jan 18, 2026Updated 3 months ago
- Stateless CLI tool to easily pin CAR files to IPFS pinning services. Client for the IPFS Pinning Service API that speaks HTTP and Bitswap…☆16Dec 15, 2023Updated 2 years ago
- Complete Hashistack including TFE, Terraform, Consul, Vault, Nomad, Packer all in a single packer manifest. Builds in parallel on Qemu, …☆14Jul 18, 2022Updated 3 years ago
- A simple, interactive, block-based, color-coded, music-synchronized, transposale, optionally auto-scrolling chordbook web application.☆17Aug 8, 2021Updated 4 years ago
- A simple tool to help save game information in a cleaner format, instead of the game object, that can be accessed anywhere in the game. A…☆10Feb 19, 2017Updated 9 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- NOTE: this project is quite old. I won't be maintaining it anymore, but it should still work :-)☆10Apr 17, 2015Updated 11 years ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Jun 21, 2023Updated 2 years ago
- Compare strings line by line.☆11Feb 14, 2025Updated last year
- An experiment to see if chatgpt can improve the output of the stanford alpaca dataset☆12Mar 29, 2023Updated 3 years ago
- ☆41Sep 21, 2025Updated 7 months ago
- Zero-Shot Foreign Accent Conversion without a Native Reference☆36May 1, 2024Updated 2 years ago
- Various test models in WNNX format. It can view with `pip install wnetron && wnetron`☆12Jun 22, 2022Updated 3 years ago