Official Repository for SIGIR2024 Demo Paper "An Integrated Data Processing Framework for Pretraining Foundation Models"
☆87Aug 27, 2024Updated last year
Alternatives and similar repositories for Yulan-GARDEN
Users that are interested in Yulan-GARDEN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- YuLan: An Open-Source Large Language Model☆636Jan 10, 2025Updated last year
- An all-in-one framework for Ad-hoc Information Retrieval.☆18Apr 3, 2024Updated 2 years ago
- [SIGIR 2024 (Demo)] CoSearchAgent: A Lightweight Collborative Search Agent with Large Language Models☆30Feb 15, 2024Updated 2 years ago
- A comprehensive library for implementing LLMs, including a unified training pipeline and comprehensive model evaluation.☆850Jun 16, 2025Updated last year
- Some example codes for drawing figures in research paper☆36Mar 3, 2022Updated 4 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆13Apr 5, 2026Updated 2 months ago
- The OlymMATH dataset☆25Jun 1, 2025Updated last year
- LoRA☆18Apr 15, 2023Updated 3 years ago
- 中文医学语料库☆15Jul 2, 2021Updated 4 years ago
- ConvGQR: Generative Query Reformulation for Conversational Search. A codebase for ACL 2023 accepted paper.☆35Mar 5, 2024Updated 2 years ago
- [ICLR'25] DataGen: Unified Synthetic Dataset Generation via Large Language Models☆68Mar 8, 2025Updated last year
- Synthesizing realistic and diverse text-datasets from augmented LLMs☆19Apr 4, 2026Updated 2 months ago
- Use the tokenizer in parallel to achieve superior acceleration☆20Mar 21, 2024Updated 2 years ago
- 🔍 Awesome Agentic Search is a curated list of papers, tools, and resources on agentic search—where AI agents plan, search, and reason to…☆60Aug 28, 2025Updated 10 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆15Nov 23, 2023Updated 2 years ago
- Python 2D Navier-Stokes solver☆30Aug 4, 2025Updated 10 months ago
- ☆59Feb 27, 2025Updated last year
- Open source code of the paper: "OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain"☆84Dec 20, 2024Updated last year
- ☆39Apr 6, 2026Updated 2 months ago
- SIGIR 2021: Proactive Retrieval-based Chatbots based on Relevant Knowledge and Goals☆11Jul 30, 2021Updated 4 years ago
- [Under Review] Think Before Recommend: Unleashing the Latent Reasoning Power for Sequential Recommendation☆68Dec 28, 2025Updated 6 months ago
- ☆13Feb 1, 2024Updated 2 years ago
- ☆12Apr 25, 2022Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for embedding and retrieval research.☆16Oct 24, 2023Updated 2 years ago
- NLPIR tutorial: pretrain for IR. pre-train on raw textual corpus, fine-tune on MS MARCO Document Ranking☆13Sep 10, 2021Updated 4 years ago
- ☆12Oct 17, 2024Updated last year
- ☆11Nov 23, 2024Updated last year
- List some datasets in NLP field.☆29May 27, 2021Updated 5 years ago
- This repo contains the syllabus of the Hugging Face Deep Reinforcement Learning Course translated in Chinese.☆10Jan 16, 2024Updated 2 years ago
- made daily news with ai☆62Updated this week
- ☆70Jun 7, 2023Updated 3 years ago
- ☆333Jul 25, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆14Apr 19, 2024Updated 2 years ago
- 《自然语言处理:大模型理论与实践》配套数据和代码☆77Dec 24, 2025Updated 6 months ago
- From Easy to Hard: A Dual Curriculum Learning Framework for Context-Aware Document Ranking☆14Oct 25, 2022Updated 3 years ago
- ☆412Apr 1, 2025Updated last year
- Saving Dense Retriever from Shortcut Dependency in Conversational Search (EMNLP 2022)☆18Nov 24, 2022Updated 3 years ago
- ☆12May 13, 2023Updated 3 years ago
- Official pytorch implementation of "MUSE: A Simple Yet Effective Multimodal Search-Based Framework for Lifelong User Interest Modeling"☆52Jan 12, 2026Updated 5 months ago