HITsz-TMG / YiZhao
YiZhao: A 2TB Open Financial Corpus. Data and tools for generating and inspecting YiZhao, a safe, high-quality, open-source bilingual financial corpus (Chinese and English).
☆20Updated 4 months ago
Alternatives and similar repositories for YiZhao:
Users that are interested in YiZhao are comparing it to the libraries listed below
- Repo for for paper "AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction".☆64Updated 8 months ago
- the newest version of llama3,source code explained line by line using Chinese☆22Updated last year
- Open source code of the paper: "OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain"☆55Updated 4 months ago
- 通用简单工具项目☆16Updated 6 months ago
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆32Updated 10 months ago
- ☆32Updated last week
- ☆37Updated last week
- This repository provides an implementation of the paper "A Simple yet Effective Training-free Prompt-free Approach to Chinese Spelling Co…☆67Updated last month
- The code for LaRA Benchmark☆29Updated last month
- CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation☆47Updated 2 months ago
- ☆26Updated 6 months ago
- Knowledge-Reasoning Synergy Reinforcement Learning.☆34Updated last month
- This is a meta-model distilled from LLMs for information extraction. This is an intermediate checkpoint that can be well-transferred to a…☆27Updated 2 months ago
- [ACL24] Official repo for "Synthesizing Text-to-SQL Data from Weak and Strong LLMs"☆66Updated 8 months ago
- 大语言模型训练和服务调研☆37Updated last year
- CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models☆41Updated last year
- ☆46Updated 10 months ago
- ☆41Updated 5 months ago
- 1.4B sLLM for Chinese and English - HammerLLM🔨☆44Updated last year
- 中文大语言模型评测第三期☆25Updated 10 months ago
- ☆36Updated 7 months ago
- ☆94Updated 4 months ago
- 天池算法比赛《BetterMixture - 大模型数据混合挑战赛》的第一名top1解决方案☆28Updated 9 months ago
- The demo, code and data of FollowRAG☆71Updated 4 months ago
- TianGong-AI-Unstructure☆63Updated last week
- ☆33Updated last week
- Qwen-WisdomVast is a large model trained on 1 million high-quality Chinese multi-turn SFT data, 200,000 English multi-turn SFT data, and …☆18Updated last year
- The GitHub repository for the paper "Self-prompted Chain-of-Thought on Large Language Models for Open-domain Multi-hop Reasoning" accepte…☆18Updated last year
- This is the code repo for our paper "Benchmarking Retrieval-Augmented Generation in Multi-Modal Contexts".☆31Updated last month
- ☆91Updated last year