support BM25+vecetor
☆28May 26, 2025Updated 10 months ago
Alternatives and similar repositories for BY_RAG_V2
Users that are interested in BY_RAG_V2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The infere…☆22May 9, 2025Updated 11 months ago
- GenEnv: Difficulty-Aligned Co-Evolution Between LLM Agents and Environment Simulators☆53Dec 23, 2025Updated 3 months ago
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆40Jan 4, 2024Updated 2 years ago
- ☆95Mar 31, 2026Updated last week
- WebUI for using SmolDocling-256M-preview☆13Mar 21, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆80Jun 5, 2024Updated last year
- ☆49Sep 11, 2025Updated 7 months ago
- Python bindings for NVIDIA CUDA APIs.☆13Mar 2, 2024Updated 2 years ago
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆33Feb 10, 2025Updated last year
- ☆22May 4, 2024Updated last year
- Bing Daily 4K Ultra HD Wallpaper 必应每日4K超清壁纸☆14Jan 18, 2023Updated 3 years ago
- Qwen GRPO Graph Extraction RL Finetune☆63Apr 2, 2025Updated last year
- 最简易的R1结果在小模型上的复现,阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证,对于强推理能力,think思考过程性内容是AGI/ASI的核心。☆45Feb 8, 2025Updated last year
- trying to reproduce suno v3☆34Jan 29, 2025Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆126Feb 10, 2024Updated 2 years ago
- VenomPred 2.0 API☆11Feb 4, 2026Updated 2 months ago
- A complete 7-layer intelligent memory system for AI Agents with multi-modal memory fusion also support context_engineering☆138Jul 7, 2025Updated 9 months ago
- Python package for P2 (Path Planning), a masked diffusion model sampling method for sequence generation (protein, text, etc.).☆23Aug 19, 2025Updated 7 months ago
- ALAS: Autonomous Learning Agent System☆15Aug 14, 2025Updated 7 months ago
- BFloat16 Fused Adam Operator for PyTorch☆19Nov 16, 2024Updated last year
- flow mirror models from JZX AI Labs☆43Sep 30, 2024Updated last year
- Youtu-Parsing: Perception, Structuring and Recognition via High-Parallelism Decoding☆64Feb 10, 2026Updated 2 months ago
- ☆15Oct 4, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- LongAttn :Selecting Long-context Training Data via Token-level Attention☆15Jul 16, 2025Updated 8 months ago
- ☆30May 9, 2025Updated 11 months ago
- Code for reproducing our paper: LMSOC: An Approach for Socially Sensitive Pretraining☆13Oct 22, 2021Updated 4 years ago
- Some microbenchmarks and design docs before commencement☆11Feb 1, 2021Updated 5 years ago
- ☆68Mar 21, 2025Updated last year
- ☆25Jul 28, 2025Updated 8 months ago
- The All-in-one Judge Models introduced by Opencompass☆119Jul 15, 2025Updated 8 months ago
- implementation of https://arxiv.org/pdf/2312.09299☆21Jul 3, 2024Updated last year
- ☆14Jan 8, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- 用于分享自制的System Prompt,适用于多数大模型。☆13Dec 28, 2024Updated last year
- The official repo for the DanQing dataset.☆33Mar 25, 2026Updated 2 weeks ago
- Ling-Coder-Lite is a MoE LLM provided and open-sourced by CodeFuse and InclusionAI.☆14Apr 22, 2025Updated 11 months ago
- ☆107Dec 5, 2025Updated 4 months ago
- ☆11Nov 9, 2022Updated 3 years ago
- ☆20Jan 22, 2026Updated 2 months ago
- Decoding Attention is specially optimized for MHA, MQA, GQA and MLA using CUDA core for the decoding stage of LLM inference.☆46Jun 11, 2025Updated 10 months ago