Annmayn / html2excelLinks
Convert HTML tables to excel files
☆15Updated 4 years ago
Alternatives and similar repositories for html2excel
Users that are interested in html2excel are comparing it to the libraries listed below
Sorting:
- python bindings of cppjieba ,recommand jieba_fast for results consistency and speed balance☆22Updated 6 years ago
- ☆15Updated last year
- an unofficial code for augment-XY-CUT in XYLayoutLM☆30Updated 3 years ago
- A span-based joint named entity recognition (NER) and relation extraction model.☆11Updated 5 years ago
- Finetune Bloom big language model with Lora method☆32Updated 2 years ago
- High performance Trie and Ahocorasick automata (AC automata) Keyword Match & Replace Tool for python. Correct case insensitive implementa…☆94Updated last year
- An intelligent OCR to detect tables and pure text inside PDFs and obtaing a csv file and a txt from it☆15Updated 7 years ago
- Chinese Mathematical Formula Detection (MFD) Dataset 中文文档数学公式检测数据集☆34Updated 3 years ago
- Leveraging large language models for text-to-SQL synthesis, this project fine-tunes WizardLM/WizardCoder-15B-V1.0 with QLoRA on a custom …☆45Updated 2 years ago
- The multilingual variant of GLM, a general language model trained with autoregressive blank infilling objective☆62Updated 3 years ago
- Large-scale exact string matching tool☆17Updated 11 months ago
- Implementation of pQRNN in PyTorch☆46Updated 4 years ago
- super fast cpp implementation of longest common subsequence/substring☆72Updated 2 years ago
- Framework for information extraction from tables☆40Updated 6 years ago
- 🌳CED: Catalog Extraction from Documents☆16Updated 2 years ago
- Unsupervised tableQA and databaseQA on chinese finance question and tabular data☆13Updated 2 years ago
- 有一个通用实体关系事件抽取的任务,需要使用到UIE模框架,而且需要将起部署到昇腾310服务器上,因为UIE模型底层使用的是ernie3.0,但是目前paddle官方还不支持ernie3.0模型在昇腾310上部署,所以才有了以下的操作,主要过程是,先试用paddle训练处模型…☆20Updated 3 years ago
- Fine tuning of the Retrieval-Augmented Generation (RAG) with a custom knowledge source.☆13Updated 4 years ago
- 科大讯飞低资源多语种文本翻译挑战赛获奖方案☆27Updated 2 years ago
- ☆97Updated 3 years ago
- GLM (General Language Model)☆24Updated 3 years ago
- ☆70Updated 7 years ago
- This is a meta-model distilled from LLMs for information extraction. This is an intermediate checkpoint that can be well-transferred to a…☆28Updated 11 months ago
- AAAI'22-"CODE: Contrastive Pre-training with Adversarial Fine-tuning for Zero-shot Expert Linking."☆12Updated 4 years ago
- ☆69Updated 5 years ago
- Tool for converting LLMs from uni-directional to bi-directional by removing causal mask for tasks like classification and sentence embedd…☆63Updated last year
- Grammatical Error Correction Based on Language Model(BERT, GPT-2), and Seq2Seq☆18Updated 6 years ago
- Silk Road will be the dataset zoo for Luotuo(骆驼). Luotuo is an open sourced Chinese-LLM project founded by 陈 启源 @ 华中师范大学 & 李鲁鲁 @ 商汤科技 & 冷子…☆40Updated 2 years ago
- ☆40Updated 5 years ago
- PDF table extraction☆10Updated 4 years ago