Official Repository for SIGIR2024 Demo Paper "An Integrated Data Processing Framework for Pretraining Foundation Models"
☆87Aug 27, 2024Updated last year
Alternatives and similar repositories for Yulan-GARDEN
Users that are interested in Yulan-GARDEN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- YuLan: An Open-Source Large Language Model☆636Jan 10, 2025Updated last year
- An all-in-one framework for Ad-hoc Information Retrieval.☆18Apr 3, 2024Updated 2 years ago
- [SIGIR 2024 (Demo)] CoSearchAgent: A Lightweight Collborative Search Agent with Large Language Models☆30Feb 15, 2024Updated 2 years ago
- A comprehensive library for implementing LLMs, including a unified training pipeline and comprehensive model evaluation.☆850Jun 16, 2025Updated 10 months ago
- JDsearch: A Personalized Product Search Dataset with Real Queries and Full Interactions☆43May 8, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆13Mar 1, 2019Updated 7 years ago
- Learning word embeddings with AdaGrad and Noise Contrastive Estimation, C++ 11.☆13Sep 22, 2014Updated 11 years ago
- ☆13Apr 5, 2026Updated 2 weeks ago
- The OlymMATH dataset☆24Jun 1, 2025Updated 10 months ago
- Official Repository for Paper "BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Compet…☆18Aug 28, 2024Updated last year
- Implementations of Influential Recommender System☆11Oct 29, 2024Updated last year
- LoRA☆17Apr 15, 2023Updated 3 years ago
- Leveraging passage embeddings for efficient listwise reranking with large language models.☆51Dec 7, 2024Updated last year
- 中文医学语料库☆14Jul 2, 2021Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ConvGQR: Generative Query Reformulation for Conversational Search. A codebase for ACL 2023 accepted paper.☆34Mar 5, 2024Updated 2 years ago
- [ICLR'25] DataGen: Unified Synthetic Dataset Generation via Large Language Models☆66Mar 8, 2025Updated last year
- 🗄️ Fudan University PowerPoint presentation templates. 复旦大学PPT模板☆34Jun 10, 2025Updated 10 months ago
- ☆12Mar 5, 2024Updated 2 years ago
- Synthesizing realistic and diverse text-datasets from augmented LLMs☆18Apr 4, 2026Updated 2 weeks ago
- The source code and the data for ACL 2022 paper "Show Me More Details: Discovering Hierarchies of Procedures from Semi-structured Web Dat…☆14Apr 21, 2023Updated 2 years ago
- Use the tokenizer in parallel to achieve superior acceleration☆20Mar 21, 2024Updated 2 years ago
- Telegram电报群发(飞机群发,TG私信)+电报注册机/批量创建账号☆23Oct 26, 2024Updated last year
- 🔍 Awesome Agentic Search is a curated list of papers, tools, and resources on agentic search—where AI agents plan, search, and reason to…☆56Aug 28, 2025Updated 7 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆15Nov 23, 2023Updated 2 years ago
- ☆59Feb 27, 2025Updated last year
- Open source code of the paper: "OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain"☆83Dec 20, 2024Updated last year
- SIGIR 2021: Proactive Retrieval-based Chatbots based on Relevant Knowledge and Goals☆11Jul 30, 2021Updated 4 years ago
- Reproducing R1 for Code with Reliable Rewards☆12Apr 9, 2025Updated last year
- ☆12Apr 25, 2022Updated 3 years ago
- Code for embedding and retrieval research.☆16Oct 24, 2023Updated 2 years ago
- NLPIR tutorial: pretrain for IR. pre-train on raw textual corpus, fine-tune on MS MARCO Document Ranking☆13Sep 10, 2021Updated 4 years ago
- ☆11Nov 23, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆16Jul 12, 2024Updated last year
- List some datasets in NLP field.☆29May 27, 2021Updated 4 years ago
- ☆27Jul 7, 2015Updated 10 years ago
- ☆329Jul 25, 2024Updated last year
- ☆70Jun 7, 2023Updated 2 years ago
- ☆401Apr 1, 2025Updated last year
- From Easy to Hard: A Dual Curriculum Learning Framework for Context-Aware Document Ranking☆14Oct 25, 2022Updated 3 years ago