☆56Jun 26, 2025Updated 10 months ago
Alternatives and similar repositories for extending-the-context-length-of-open-source-llms
Users that are interested in extending-the-context-length-of-open-source-llms are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Miscellaneous Tutorials☆26Sep 20, 2023Updated 2 years ago
- Track the progress of LLM context utilisation☆55Apr 14, 2025Updated last year
- Large Language Model Hosting Container☆92Apr 13, 2026Updated 3 weeks ago
- ☆12Mar 27, 2025Updated last year
- KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models☆25Aug 24, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆14Oct 31, 2024Updated last year
- ☆11Jun 21, 2025Updated 10 months ago
- ☆24Nov 26, 2024Updated last year
- Demonstration of LLM integration into a lex bot using Lambda codehooks and a Sagemaker endpoint.☆14Dec 20, 2023Updated 2 years ago
- ☆14Apr 25, 2025Updated last year
- 5X faster 60% less memory QLoRA finetuning☆21May 28, 2024Updated last year
- ☆13May 17, 2025Updated 11 months ago
- Mistral on AWS examples for Bedrock & SageMaker☆91Apr 30, 2026Updated last week
- Verifiers for LLM Reinforcement Learning☆80Apr 15, 2025Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆46Aug 4, 2025Updated 9 months ago
- ☆26Dec 13, 2024Updated last year
- A proxy for Google Bard LLM☆10Nov 2, 2023Updated 2 years ago
- YesBut - Multimodal Satire Comprehension Dataset☆19Oct 23, 2024Updated last year
- A simple Streamlit application to visualize document chunks and queries in embedding space 🗺️🔍☆13Apr 15, 2025Updated last year
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆158Jun 13, 2024Updated last year
- Unofficial project transforming AWS PartyRock apps into fullstack SvelteKit apps☆12Mar 3, 2024Updated 2 years ago
- Simple implementation of Speculative Sampling in NumPy for GPT-2.☆99Aug 20, 2023Updated 2 years ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31May 22, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Deploy a multi-account cloud foundation to support highly-regulated workloads and complex compliance requirements.☆16Sep 3, 2024Updated last year
- multilingual RAG☆16Feb 6, 2024Updated 2 years ago
- ☆16Dec 11, 2023Updated 2 years ago
- Code for NOLA, an implementation of "nola: Compressing LoRA using Linear Combination of Random Basis"☆57Aug 25, 2024Updated last year
- Amazon Transcribe Live Call Analytics (LCA) Sample Solution☆131Oct 2, 2025Updated 7 months ago
- ☆25Jul 20, 2025Updated 9 months ago
- Learn how to quickly build Agents with Amazon Bedrock☆105Mar 29, 2024Updated 2 years ago
- ☆28Oct 30, 2025Updated 6 months ago
- Context is Key: Combining Embedding-based Retrieval with LLMs for Comprehensive Knowledge Enrichment☆31Jul 14, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [ICLR 2024 & NeurIPS 2023 WS] An Evaluator LM that is open-source, offers reproducible evaluation, and inexpensive to use. Specifically d…☆315Nov 11, 2023Updated 2 years ago
- FastAPI WebSocket server for the OpenVoice text-to-speech model.☆12Jun 6, 2024Updated last year
- Get ready to embark on an exciting journey as we combine the power of Amazon Bedrock, ReactJS and the AWS JavaScript SDK to create a gene…☆45Jun 27, 2024Updated last year
- Using modal.com to process FineWeb-edu data☆20Apr 11, 2026Updated 3 weeks ago
- ☆14Feb 20, 2024Updated 2 years ago
- Porting of espressif/arduino-esp32 example to M5Stack CoreS3 (GC0308)☆11Nov 30, 2023Updated 2 years ago
- Reasoning-based Evaluation and Ranking of Translations.☆20Jul 18, 2025Updated 9 months ago