Convert all of libgen to high quality markdown
☆255Dec 13, 2023Updated 2 years ago
Alternatives and similar repositories for libgen_to_txt
Users that are interested in libgen_to_txt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Generate textbook-quality synthetic LLM pretraining data☆509Oct 19, 2023Updated 2 years ago
- Score LLM pretraining data with classifiers☆55Nov 2, 2023Updated 2 years ago
- A Comprehensive survey on business use cases of AI that help them thrive in the digital economy☆13Oct 7, 2020Updated 5 years ago
- QLoRA for Masked Language Modeling☆23Sep 11, 2023Updated 2 years ago
- An experimental desktop client for using Claude Desktop's MCP with Novelcrafter codices.☆10Dec 3, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Hello world demonstration for Weblate☆14Jan 20, 2026Updated 2 months ago
- ☆45Oct 13, 2023Updated 2 years ago
- Multipack distributed sampler for fast padding-free training of LLMs☆206Aug 10, 2024Updated last year
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Jul 12, 2023Updated 2 years ago
- This is a AUTOSAR documents specific retriever based on LLM and RAG.☆16Nov 12, 2024Updated last year
- Command-line script for inferencing from models such as WizardCoder☆25Sep 6, 2023Updated 2 years ago
- Useful collection of webrat Textmate snippets meant for use with the RSpec Story and/or Cucumber bundles☆79Aug 7, 2009Updated 16 years ago
- Consider is a parser for the ThinkGear protocol used by NeuroSky devices (MindSet, BrainBand and others).☆16Apr 3, 2012Updated 13 years ago
- batched loras☆351Sep 6, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A proselint linter for use with Phabricator's arc command line tool.☆17Jun 17, 2016Updated 9 years ago
- This GUI aims to simplify the process of converting GGUF files to llamafile format by providing an intuitive and convenient way for users…☆14Jan 2, 2026Updated 2 months ago
- A bot to add citation data from OpenCitations to Wikidata☆12May 23, 2023Updated 2 years ago
- CROMER (CROss-document Main Events and entities Recognition), is a tool for cross-document coreference☆12Jan 14, 2015Updated 11 years ago
- Domain-specific language for mobile (web) applications☆16May 12, 2010Updated 15 years ago
- A simple node.js wrapper for Stanford CoreNLP.☆10Aug 7, 2014Updated 11 years ago
- ☆19Dec 2, 2023Updated 2 years ago
- Android TextMate Bundle☆17Mar 20, 2009Updated 17 years ago
- ☆13May 10, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Jun 21, 2023Updated 2 years ago
- ☆198Feb 9, 2024Updated 2 years ago
- The OpenCitations metadata model: documents and other material.☆19Nov 20, 2025Updated 4 months ago
- Reimplementation of the task generation part from the Alpaca paper☆119Apr 4, 2023Updated 2 years ago
- AgentSearch is a framework for powering search agents and enabling customizable local search.☆524Apr 22, 2024Updated last year
- ☆11Dec 10, 2022Updated 3 years ago
- 一个纯实验项目☆11Sep 13, 2011Updated 14 years ago
- ARCHIVED R Client for the Lagotto Altmetrics Platform☆15May 10, 2022Updated 3 years ago
- A Sublime Text 3 client for Vale Server.☆13Dec 7, 2020Updated 5 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- A large-scale information-rich web dataset, featuring millions of real clicked query-document labels☆346Dec 16, 2024Updated last year
- [NeurIPS VLM workshop 2024] In-Context Ensemble Learning from Pseudo Labels Improves Video-Language Models for Low-Level Workflow Underst…☆23Mar 16, 2025Updated last year
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines☆197May 6, 2024Updated last year
- ☆17Aug 28, 2025Updated 7 months ago
- ☆415Nov 2, 2023Updated 2 years ago
- Bayesian Visual Working Memory in Python.☆13Mar 28, 2020Updated 6 years ago
- Just a bunch of benchmark logs for different LLMs☆120Jul 28, 2024Updated last year