☆98Mar 20, 2024Updated 2 years ago
Alternatives and similar repositories for RefGPT
Users that are interested in RefGPT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆164Apr 17, 2023Updated 3 years ago
- [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other mo…☆415Jun 25, 2025Updated 11 months ago
- ☆15Nov 24, 2020Updated 5 years ago
- [ACL 2024] FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models☆118Jun 12, 2025Updated 11 months ago
- 非常好用的工具包,可以直接安装并使用☆21Mar 18, 2022Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- The code of CIKM 2023 (Oral Presentation) : A Multi-Task Semantic Decomposition Framework with Task-specific Pre-training for Few-Shot NE…☆14Jul 19, 2024Updated last year
- the newest version of llama3,source code explained line by line using Chinese☆22Apr 19, 2024Updated 2 years ago
- URS Benchmark: Evaluating LLMs on User Reported Scenarios☆31May 30, 2025Updated last year
- ☆12Jun 5, 2024Updated 2 years ago
- distilled Self-Critique refines the outputs of a LLM with only synthetic data☆11Apr 11, 2024Updated 2 years ago
- Codebase for DualEnc (ACL-20)☆22Oct 3, 2023Updated 2 years ago
- ☆70Apr 14, 2023Updated 3 years ago
- The Bytepiece Tokenizer Implemented in Rust.☆14Nov 28, 2023Updated 2 years ago
- MinPrompt: Graph-based Minimal Prompt Data Augmentation for Few-shot Question Answering☆14May 3, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Repo for the question-in-context rewriting baseline presented in Elgohary et al. "Can you unpack that? Learning to rewrite questions-in-c…☆24May 20, 2020Updated 6 years ago
- Code for "HiChunk: Evaluating and Enhancing Retrieval-Augmented Generation with Hierarchical Chunking"☆98Nov 18, 2025Updated 6 months ago
- A first cut into exploring the use of dependency links for building Text Graphs, that, among other things, with help of a centrality algo…☆32Oct 20, 2023Updated 2 years ago
- This project is mainly to explore what effect can be achieved by fine-tuning LLM model (ChatGLM-6B)of about 6B in vertical field (Romance…☆26Apr 6, 2023Updated 3 years ago
- 雅意信息抽取大模型:在百万级人工构造的高质量信息抽取数据上进行指令微调,由中科闻歌算法团队研发。 (Repo for YAYI Unified Information Extraction Model)☆320Aug 8, 2024Updated last year
- ☆153Apr 16, 2024Updated 2 years ago
- Exploration of semantic chunking and chunk classification☆19Sep 16, 2024Updated last year
- Implementation of paper Data Engineering for Scaling Language Models to 128K Context☆496Mar 19, 2024Updated 2 years ago
- Manages vllm-nccl dependency☆18Jun 3, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- This repo is to demo the concept of lossless compression with Transformers as encoder and decoder.☆14May 2, 2024Updated 2 years ago
- The implementation for CIKM 2024: Towards Completeness-Oriented Tool Retrieval for Large Language Models.☆26Nov 6, 2024Updated last year
- The code in "SlideCoder: Layout-aware RAG-enhanced Hierarchical Slide Generation from Design"☆48Oct 20, 2025Updated 7 months ago
- ☆36Sep 6, 2024Updated last year
- T2Ranking: A large-scale Chinese benchmark for passage ranking.☆163Jul 3, 2023Updated 2 years ago
- ☆42May 9, 2024Updated 2 years ago
- 该项目通过新闻数据集演示文本分类全流程:数据清洗,模型训练,模型部署和前端展示。使用的模型和工具:pytorch,bert,streamlit☆18Nov 7, 2022Updated 3 years ago
- ☆98Dec 5, 2023Updated 2 years ago
- Extrapolating RLVR to General Domains without Verifiers☆203Aug 12, 2025Updated 9 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆59Aug 22, 2024Updated last year
- CCL2024 Chinese Essay Rhetoric Recognition and Understanding☆17Oct 1, 2024Updated last year
- This repository open-sources our GEC system submitted by THU KELab (sz) in the CCL2023-CLTC Track 1: Multidimensional Chinese Learner Tex…☆15Nov 25, 2023Updated 2 years ago
- Code repo for EMNLP 2023 paper "Auto-Instruct: Automatic Instruction Generation and Ranking for Black-Box Language Models"☆23Nov 13, 2023Updated 2 years ago
- Implementation for EACL 2024 paper "Corpus-Steered Query Expansion with Large Language Models"☆13Mar 19, 2024Updated 2 years ago
- Official code for the publication "Large Language Models as Zero-shot Dialogue State Tracker through Function Calling" https//arxiv.org/a…☆69Aug 14, 2024Updated last year
- MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍 、杂志…☆4,206May 23, 2026Updated 2 weeks ago