A lightweight script for processing HTML page to markdown format with support for code blocks
☆82Apr 14, 2024Updated 2 years ago
Alternatives and similar repositories for code-html-to-markdown
Users that are interested in code-html-to-markdown are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Scrape the webpage convert it into Markdown, and enhance AI search applications.☆257May 11, 2024Updated last year
- ☆233Mar 7, 2024Updated 2 years ago
- Bridging the Generalization Gap in Text-to-SQL Parsing with Schema Expansion☆13Jul 26, 2023Updated 2 years ago
- ☆34Mar 21, 2026Updated last month
- Learning Semantic Parsers from Denotations with Latent Structured Alignments and Abstract Programs(EMNLP2019)☆19Dec 3, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Trending projects & awesome papers about data-centric llm studies.☆40May 20, 2025Updated 11 months ago
- 🤖ConvRe🤯: An Investigation of LLMs’ Inefficacy in Understanding Converse Relations (EMNLP 2023)☆24Oct 10, 2023Updated 2 years ago
- Dataset for TACL 2022 paper: "FeTaQA: Free-form Table Question Answering"☆89May 11, 2023Updated 2 years ago
- ☆12Jul 6, 2023Updated 2 years ago
- Code for the curation of The Stack v2 and StarCoder2 training data☆132Apr 11, 2024Updated 2 years ago
- ReCross: Unsupervised Cross-Task Generalization via Retrieval Augmentation☆23May 1, 2022Updated 4 years ago
- Simple and scalable tools for data-driven pretraining data selection.☆29Jun 9, 2025Updated 10 months ago
- code for the table-based open domain question answering project, with paper title: "Reasoning over Hybrid Chain for Table-and-Text Open D…☆12Sep 16, 2022Updated 3 years ago
- ☆12Oct 10, 2021Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆15Mar 12, 2024Updated 2 years ago
- Provides a minimal implementation to extract FLAN datasets for further processing☆11Feb 1, 2023Updated 3 years ago
- [ICML'24] TroVE: Inducing Verifiable and Efficient Toolboxes for Solving Programmatic Tasks☆33Sep 20, 2024Updated last year
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆42Dec 29, 2025Updated 4 months ago
- ☆12Jul 10, 2023Updated 2 years ago
- ☆20Apr 16, 2025Updated last year
- CHASE is a large-scale and pragmatic Chinese dataset for cross-database context-dependent text-to-SQL task (natural language interfaces f…☆10May 7, 2021Updated 4 years ago
- ☆984Feb 7, 2025Updated last year
- Pretraining with Natural and Synthetic Data for Few-shot Table-based Question Answering☆30Dec 2, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- The official repository for the paper "From Zero to Hero: Examining the Power of Symbolic Tasks in Instruction Tuning".☆66Apr 18, 2023Updated 3 years ago
- [EMNLP-2024] ⚓️ Sailor: Open Language Models for South-East Asia☆138Dec 21, 2024Updated last year
- [ACL 2022] A hierarchical table dataset for question answering and data-to-text generation.☆109Dec 16, 2025Updated 4 months ago
- Code for Blog Post: Can Better Cold-Start Strategies Improve RL Training for LLMs?☆20Mar 9, 2025Updated last year
- [ICLR 2026] BARREL: Boundary-Aware Reasoning for Factual and Reliable LRMs☆18May 21, 2025Updated 11 months ago
- "Few-shot In-context Learning for Knowledge Base Question Answering" [ACL2023]☆67Jan 27, 2025Updated last year
- Implementation of the paper: "Turning Tables: Generating Examples from Semi-structured Tables for Endowing Language Models with Reasoning…☆22Nov 2, 2021Updated 4 years ago
- An attempt to replicate the neural programmer work [Neelakantan et al 2016, 2017] using techniques for learning probability distributions…☆13Jun 7, 2017Updated 8 years ago
- [ICLR 2024] Lemur: Open Foundation Models for Language Agents☆556Oct 28, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- https://arxiv.org/abs/2404.10917☆14Mar 18, 2025Updated last year
- 🔱 Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs☆71Mar 21, 2025Updated last year
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learning☆126May 6, 2025Updated 11 months ago
- TAT-QA (Tabular And Textual dataset for Question Answering) contains 16,552 questions associated with 2,757 hybrid contexts from real-wor…☆130Dec 9, 2024Updated last year
- Code for the paper "Decomposing the Enigma: Subgoal-based Demonstration Learning for Formal Theorem Proving"☆19May 25, 2023Updated 2 years ago
- ☆31Sep 4, 2021Updated 4 years ago
- ☆13Dec 12, 2025Updated 4 months ago