distill large scale web page text
☆12Jul 29, 2023Updated 2 years ago
Alternatives and similar repositories for LLMDataDistill
Users that are interested in LLMDataDistill are comparing it to the libraries listed below
Sorting:
- cpp inference for EmotiVoice☆16Jan 1, 2024Updated 2 years ago
- SeaLion Client can help create project quickly☆53May 7, 2025Updated 10 months ago
- This repository lists papers, codes, and datasets in Biomedical Text Summarisation based on PLM☆23Oct 4, 2022Updated 3 years ago
- Momentum Decoding: Open-ended Text Generation as Graph Exploration☆19Jan 27, 2023Updated 3 years ago
- ReCross: Unsupervised Cross-Task Generalization via Retrieval Augmentation☆24May 1, 2022Updated 3 years ago
- ☆11May 24, 2024Updated last year
- ☆13Jun 16, 2021Updated 4 years ago
- Generate visual studio solution from a bazel workspace.☆13Jan 19, 2022Updated 4 years ago
- ☆11May 9, 2022Updated 3 years ago
- notes and codes about NLP☆25Jan 22, 2019Updated 7 years ago
- pymur is a Python interface to The Lemur Toolkit.☆19Sep 17, 2018Updated 7 years ago
- A variant of HTM where spatial and temporal pooling are accomplished with the same mechanism☆13Apr 11, 2015Updated 10 years ago
- [ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning☆98Apr 26, 2023Updated 2 years ago
- Using self-play to augment multi-turn text-to-SQL datasets☆11Oct 20, 2022Updated 3 years ago
- This is a C++ implementation of cocoapi bbox evaluation code.☆11Dec 9, 2021Updated 4 years ago
- WIKIGENBENCH: Exploring Full-length Wikipedia Generation under Real-World Scenario (COLING 2025)☆12Jan 5, 2025Updated last year
- EfficientVLM: Fast and Accurate Vision-Language Models via Knowledge Distillation and Modal-adaptive Pruning (ACL 2023)☆34Jul 18, 2023Updated 2 years ago
- ☆15Sep 22, 2024Updated last year
- Chinese processing☆36Jan 29, 2014Updated 12 years ago
- Repository containing the website for the EMNLP 2023 conference☆17Feb 12, 2025Updated last year
- OSWorld-Human: Benchmarking the Efficiency of Computer-Use Agents☆23Jan 6, 2026Updated 2 months ago
- Matrix exponential in cuda for pytorch and tensorflow☆17Nov 26, 2018Updated 7 years ago
- Machine learning environment over MAME-supported games☆15Feb 27, 2026Updated 3 weeks ago
- ☆14Oct 7, 2023Updated 2 years ago
- ☆12Oct 23, 2018Updated 7 years ago
- Analyzing deviation from orthogonality in RNNs☆16Oct 30, 2017Updated 8 years ago
- Open-Retrieval Conversational Machine Reading: A new setting & OR-ShARC dataset☆13Nov 19, 2022Updated 3 years ago
- [NeurIPS 2022]MorphTE: Injecting Morphology in Tensorized Embeddings☆17Oct 29, 2022Updated 3 years ago
- Rust wrapper for Paddle Inference.☆11May 22, 2021Updated 4 years ago
- ☆12Feb 22, 2021Updated 5 years ago
- ☆14Nov 14, 2022Updated 3 years ago
- A curated list of vision-and-language pre-training (VLP). :-)☆62Jul 6, 2022Updated 3 years ago
- async compile on neovim☆12Aug 17, 2023Updated 2 years ago
- ☆13Sep 2, 2021Updated 4 years ago
- ☆15Aug 18, 2022Updated 3 years ago
- This repo contains codes and instructions for baselines in the VLUE benchmark.☆41Jul 16, 2022Updated 3 years ago
- LambdaFM: Learning Optimal Ranking with Factorization Machines Using Lambda Surrogates☆18Aug 17, 2019Updated 6 years ago
- MaXM is a suite of test-only benchmarks for multilingual visual question answering in 7 languages: English (en), French (fr), Hindi (hi),…☆13Jan 16, 2024Updated 2 years ago
- C++ model train&inference framework☆223Dec 25, 2019Updated 6 years ago