Chat data cleaning, filtering and deduplication pipeline.
☆22Jul 25, 2023Updated 2 years ago
Alternatives and similar repositories for chat-data-pipeline
Users that are interested in chat-data-pipeline are comparing it to the libraries listed below
Sorting:
- ☆12Mar 16, 2022Updated 3 years ago
- ☆16Dec 31, 2021Updated 4 years ago
- Copy objects from real life and directly paste them on a background image using only your phone's camera☆23Feb 10, 2026Updated 3 weeks ago
- Code base for internal reward models and PPO training☆24Oct 1, 2023Updated 2 years ago
- ☆29Feb 24, 2025Updated last year
- ☆37Sep 21, 2025Updated 5 months ago
- A bunch of LLaMa model investigations, including recreating generative agents (from the paper Generative Agents: Interactive Simulacra of…☆23May 31, 2023Updated 2 years ago
- Using Spectral Noise Gating (SNG) techniques to reduce background noise in streaming microphone input for enhanced vocal recognition☆25Dec 10, 2018Updated 7 years ago
- Train Llama Loras Easily☆31Aug 3, 2023Updated 2 years ago
- Lightweight knowledge distillation pipeline☆28Nov 29, 2021Updated 4 years ago
- Zero-Shot Foreign Accent Conversion without a Native Reference☆36May 1, 2024Updated last year
- Lottery Ticket Adaptation☆40Nov 20, 2024Updated last year
- Creates CMM script that can directly executed on Kaggle from easy merge script☆14Jan 12, 2026Updated last month
- ☆13Nov 9, 2025Updated 4 months ago
- Nr. 1 ranked "Pitch Detector" on the web. Implemented with WebAssembly.☆11Mar 24, 2021Updated 4 years ago
- Implementation for the paper "Unified Multimodal Model with Unlikelihood Training for Visual Dialog"☆13May 12, 2023Updated 2 years ago
- Using large language models to maintain AI_CHANGELOG.md☆14Jul 15, 2024Updated last year
- Koel Labs innovates open-source speech research, inclusive speech technologies, and real-time pronunciation feedback for language learner…☆18Feb 25, 2026Updated last week
- This repository provides the code for replicating the experiments in the paper "Building One-Shot Semi-supervised (BOSS) Learning up to F…☆36Aug 9, 2020Updated 5 years ago
- USB Hid handler for nodejs☆11Sep 30, 2022Updated 3 years ago
- Progress Web App template for Scripture App Builder☆13Feb 28, 2026Updated last week
- RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …☆10Nov 3, 2023Updated 2 years ago
- Phase Vocoder and Wavelet Transform Implementation for Pitch Shifting a sound signal☆11Jul 27, 2020Updated 5 years ago
- Handlebars helper, alternative to built-in partials. Similar to handlebars-helper-partial, but this helper will allow wildcard (glob) pat…☆16Nov 10, 2014Updated 11 years ago
- Code for Massive-scale Decoding for Text Generation using Lattices☆44Jul 29, 2022Updated 3 years ago
- DataOps framework for Machine Learning projects.☆62May 4, 2023Updated 2 years ago
- [Last Updated 2021] TTS from Cookie. Messy and experimental!☆43Mar 24, 2023Updated 2 years ago
- Compare strings line by line.☆11Feb 14, 2025Updated last year
- Code for ACL 2024 findings paper "wav2vec-S: Adapting Pre-trained Speech Models for Streaming"☆10Apr 20, 2025Updated 10 months ago
- ☆16Feb 18, 2024Updated 2 years ago
- An implementation of "Subspace Representations for Soft Set Operations and Sentence Similarities" (NAACL 2024)☆10May 31, 2024Updated last year
- A RESTful API server to control ChatdollKit-based AITuber 💬☆13Jan 14, 2025Updated last year
- Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.☆12Feb 11, 2024Updated 2 years ago
- Official PyTorch implementation of CD-MOE☆12Mar 29, 2025Updated 11 months ago
- Devcon Systems☆14Sep 8, 2018Updated 7 years ago
- OpenCV Sample Projects in Rust☆12Nov 27, 2021Updated 4 years ago
- ☆13Dec 28, 2022Updated 3 years ago
- Spin up any(almost) llm locally!☆14Dec 4, 2023Updated 2 years ago
- msglm makes it a little easier to create messages for language models like Claude and OpenAI GPTs.☆14Jan 29, 2026Updated last month