Official Repository for SIGIR2024 Demo Paper "An Integrated Data Processing Framework for Pretraining Foundation Models"
☆87Aug 27, 2024Updated last year
Alternatives and similar repositories for Yulan-GARDEN
Users that are interested in Yulan-GARDEN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- YuLan: An Open-Source Large Language Model☆638Jan 10, 2025Updated last year
- This repository has been redirected into https://kuaisar.github.io/.☆11Oct 12, 2023Updated 2 years ago
- [SIGIR 2024 (Demo)] CoSearchAgent: A Lightweight Collborative Search Agent with Large Language Models☆30Feb 15, 2024Updated 2 years ago
- A comprehensive library for implementing LLMs, including a unified training pipeline and comprehensive model evaluation.☆853Jun 16, 2025Updated 9 months ago
- ☆13Mar 1, 2019Updated 7 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Some example codes for drawing figures in research paper☆35Mar 3, 2022Updated 4 years ago
- Learning word embeddings with AdaGrad and Noise Contrastive Estimation, C++ 11.☆13Sep 22, 2014Updated 11 years ago
- ☆13Aug 11, 2024Updated last year
- The OlymMATH dataset☆24Jun 1, 2025Updated 9 months ago
- Official Repository for Paper "BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Compet…☆18Aug 28, 2024Updated last year
- Leveraging passage embeddings for efficient listwise reranking with large language models.☆50Dec 7, 2024Updated last year
- ConvGQR: Generative Query Reformulation for Conversational Search. A codebase for ACL 2023 accepted paper.☆34Mar 5, 2024Updated 2 years ago
- [ICLR'25] DataGen: Unified Synthetic Dataset Generation via Large Language Models☆66Mar 8, 2025Updated last year
- Synthesizing realistic and diverse text-datasets from augmented LLMs☆16Jan 26, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆12Mar 5, 2024Updated 2 years ago
- The source code and the data for ACL 2022 paper "Show Me More Details: Discovering Hierarchies of Procedures from Semi-structured Web Dat…☆14Apr 21, 2023Updated 2 years ago
- ☆15Nov 23, 2023Updated 2 years ago
- Open source code of the paper: "OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain"☆82Dec 20, 2024Updated last year
- ☆40Nov 13, 2025Updated 4 months ago
- SIGIR 2021: Proactive Retrieval-based Chatbots based on Relevant Knowledge and Goals☆11Jul 30, 2021Updated 4 years ago
- [ACL 2024] Implementation for Advancing Abductive Reasoning in Knowledge Graphs through Complex Logical Hypothesis Generation☆15Oct 9, 2025Updated 5 months ago
- A Chrome browser extension that renders diagrams in the deepseek website inline.☆26Jan 31, 2025Updated last year
- ☆12Apr 25, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for embedding and retrieval research.☆16Oct 24, 2023Updated 2 years ago
- ☆11Nov 23, 2024Updated last year
- ☆327Jul 25, 2024Updated last year
- ☆14Apr 19, 2024Updated last year
- A transformer seq2seq model to generate couplets. 一个写对联的 Transformer 序列到序列模型。☆17Feb 1, 2019Updated 7 years ago
- From Easy to Hard: A Dual Curriculum Learning Framework for Context-Aware Document Ranking☆14Oct 25, 2022Updated 3 years ago
- ☆396Apr 1, 2025Updated 11 months ago
- Saving Dense Retriever from Shortcut Dependency in Conversational Search (EMNLP 2022)☆18Nov 24, 2022Updated 3 years ago
- 一起来数三角形吧!☆10Jun 27, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- The official GitHub page for the survey paper "A Survey of Large Language Models".☆12,134Mar 11, 2025Updated last year
- Github repo for storing LlamaDatasets☆39Dec 12, 2025Updated 3 months ago
- The homepage for ConvSearch Dataset.☆14May 31, 2022Updated 3 years ago
- We systematically studied the influencing factors when LLM generates benchmarks,By using our code, you can generate high-quality QA datas…☆20May 20, 2025Updated 10 months ago
- ☆32Oct 22, 2025Updated 5 months ago
- The final project of NKU 2022 Computer Architecture. 南开大学2022体系结构大作业。☆10Sep 25, 2023Updated 2 years ago
- Collection of papers for scalable automated alignment.☆93Oct 22, 2024Updated last year