Chat data cleaning, filtering and deduplication pipeline.
☆22Jul 25, 2023Updated 2 years ago
Alternatives and similar repositories for chat-data-pipeline
Users that are interested in chat-data-pipeline are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code base for internal reward models and PPO training☆24Oct 1, 2023Updated 2 years ago
- ☆12Mar 16, 2022Updated 4 years ago
- Copy objects from real life and directly paste them on a background image using only your phone's camera☆23Feb 10, 2026Updated 4 months ago
- Fullstack machine learning inference template☆31Nov 24, 2023Updated 2 years ago
- ☆16Dec 31, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A bunch of LLaMa model investigations, including recreating generative agents (from the paper Generative Agents: Interactive Simulacra of…☆23May 31, 2023Updated 3 years ago
- Code for Massive-scale Decoding for Text Generation using Lattices☆44Jul 29, 2022Updated 3 years ago
- [ECCV22] BungeeNeRF: Progressive Neural Radiance Field for Extreme Multi-scale Scene Rendering (Jittor)☆11Sep 16, 2022Updated 3 years ago
- Tools for content datamining and NLP at scale☆45Jun 20, 2024Updated last year
- RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …☆10Nov 3, 2023Updated 2 years ago
- Utility for React components to easily subscribe to Mutant streams☆13Dec 9, 2017Updated 8 years ago
- A Rust implementation of the Handshake and Lightning Network secure messaging protocol - based on Noise.☆14Dec 9, 2019Updated 6 years ago
- 👜 Callbag listener sink that receives data from any listenable source☆14Feb 6, 2018Updated 8 years ago
- A tiny server to run local inference on MLX model in the style of OpenAI☆13Jan 31, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 👜 Utility function for plugging callbags together in chain☆16Feb 9, 2019Updated 7 years ago
- DataOps framework for Machine Learning projects.☆61May 4, 2023Updated 3 years ago
- ☆29Feb 24, 2025Updated last year
- Training a reward model for RLHF using RWKV.☆15Jun 5, 2023Updated 3 years ago
- Lottery Ticket Adaptation☆40Nov 20, 2024Updated last year
- Handlebars helper, alternative to built-in partials. Similar to handlebars-helper-partial, but this helper will allow wildcard (glob) pat…☆16Nov 10, 2014Updated 11 years ago
- Lightweight knowledge distillation pipeline☆28Nov 29, 2021Updated 4 years ago
- Train Llama Loras Easily☆31Aug 3, 2023Updated 2 years ago
- A game for experimenting with sensorimotor AI.☆16May 9, 2014Updated 12 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆15Jul 6, 2018Updated 7 years ago
- Complete Hashistack including TFE, Terraform, Consul, Vault, Nomad, Packer all in a single packer manifest. Builds in parallel on Qemu, …☆14Jul 18, 2022Updated 3 years ago
- A Vote-and-Verify Strategy for Fast Spatial Verification in Image Retrieval☆19Jun 7, 2017Updated 9 years ago
- NOTE: this project is quite old. I won't be maintaining it anymore, but it should still work :-)☆10Apr 17, 2015Updated 11 years ago
- 📑 Collection of smart contracts (mostly Ethereum) for reference and learning.☆12Feb 17, 2022Updated 4 years ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Jun 21, 2023Updated 2 years ago
- Extension providing a theme editor where colors, font families and font sizes of the elements of the user interface can be varied☆23Jul 9, 2024Updated last year
- Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.☆12Feb 11, 2024Updated 2 years ago
- Zero-Shot Foreign Accent Conversion without a Native Reference☆36May 1, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆19Jan 3, 2023Updated 3 years ago
- [Last Updated 2021] TTS from Cookie. Messy and experimental!☆43Mar 24, 2023Updated 3 years ago
- A Docker container to renew my certificates and store them in Vault☆15Sep 20, 2019Updated 6 years ago
- Various test models in WNNX format. It can view with `pip install wnetron && wnetron`☆12Jun 22, 2022Updated 3 years ago
- Fast whitespace correction with Transformers☆17Aug 22, 2025Updated 9 months ago
- Code for ACL 2024 findings paper "wav2vec-S: Adapting Pre-trained Speech Models for Streaming"☆12Apr 21, 2026Updated last month
- ChatGPT-like Web UI for RWKVstic☆19Apr 23, 2023Updated 3 years ago