A powerful, high-performance CLI tool for removing duplicate lines from text files with advanced comparison options and parallel processing capabilities.
☆10Apr 16, 2025Updated 11 months ago
Alternatives and similar repositories for DupeRemover
Users that are interested in DupeRemover are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆25Dec 12, 2025Updated 4 months ago
- gRelay is an open source project written in Go that provides the circuit break pattern with a relay idea behind.☆31Sep 1, 2022Updated 3 years ago
- ☆17Aug 2, 2023Updated 2 years ago
- [ACL 2025 Main] Official Repository for "Evaluating Language Models as Synthetic Data Generators"☆41Dec 13, 2024Updated last year
- Code for "Tracing Knowledge in Language Models Back to the Training Data"☆39Dec 27, 2022Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆54Oct 24, 2024Updated last year
- Code for paper "Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models."☆53Oct 19, 2024Updated last year
- ☆187Jul 2, 2025Updated 9 months ago
- LLM Unlearning☆184Oct 20, 2023Updated 2 years ago
- A project to improve skills of large language models☆918Updated this week
- [ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning☆643Mar 4, 2024Updated 2 years ago
- LongBench v2 and LongBench (ACL 25'&24')☆1,148Jan 15, 2025Updated last year
- A reading list on LLM based Synthetic Data Generation 🔥☆1,532Jun 5, 2025Updated 10 months ago
- Goji is a minimalistic web framework for Golang that's high in antioxidants.☆3,644Oct 27, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆2,983Updated this week
- Package macaron is a high productive and modular web framework in Go.☆3,556Feb 16, 2026Updated last month
- ☆4,109Jun 4, 2024Updated last year
- Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.☆4,755Jul 18, 2025Updated 8 months ago
- Efficient cache for gigabytes of data written in Go.☆8,119Feb 6, 2026Updated 2 months ago
- ☆6,287Dec 12, 2025Updated 4 months ago
- A curated list of data engineering tools for software developers☆8,502Apr 5, 2026Updated last week
- A framework for few-shot evaluation of language models.☆12,138Updated this week
- dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build application…☆12,605Updated this week
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A curated list to learn about distributed systems☆11,758Jan 10, 2025Updated last year
- A curated list of awesome big data frameworks, ressources and other awesomeness.☆14,321Feb 5, 2026Updated 2 months ago
- Highly available Prometheus setup with long term storage capabilities. A CNCF Incubating project.☆14,017Apr 8, 2026Updated last week
- 🕵️♂ ️ All-in-one OSINT tool for analysing any website☆32,780Apr 8, 2026Updated last week
- Markdown for the component era☆19,392Updated this week
- A curated list of awesome Competitive Programming, Algorithm and Data Structure resources☆13,840Dec 8, 2024Updated last year
- Vitess is a database clustering system for horizontal scaling of MySQL.☆20,898Updated this week
- The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lak…☆21,058Updated this week
- lightweight, idiomatic and composable router for building Go HTTP services☆21,952Feb 19, 2026Updated last month
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A fast, local first, reactive Database for JavaScript Applications https://rxdb.info/☆23,137Updated this week
- Roadmap to becoming a Go developer in 2020☆18,431Feb 13, 2023Updated 3 years ago
- A realtime distributed messaging platform☆25,882Jul 13, 2025Updated 9 months ago
- A Go microservices framework☆22,727Updated this week
- Curated List of React Components & Libraries.☆47,253Jan 26, 2026Updated 2 months ago
- Your ultimate Go microservices framework for the cloud-native era.☆25,590Apr 4, 2026Updated last week
- Scalable datastore for metrics, events, and real-time analytics☆31,397Updated this week