TrelisResearch / code-llama-32k
Run code-llama with 50k tokens using flash attention and better transformer
☆12Updated last year
Alternatives and similar repositories for code-llama-32k:
Users that are interested in code-llama-32k are comparing it to the libraries listed below
- ☆30Updated last year
- Demos of some issues with LangChain.☆32Updated last year
- Streamlit app for recommending eval functions using prompt diffs☆27Updated last year
- Falcon40B and 7B (Instruct) with streaming, top-k, and beam search☆40Updated last year
- Example of running LangChain on Cloud Run☆61Updated last year
- Using modal.com to process FineWeb-edu data☆20Updated 2 weeks ago
- A daemon that makes a desktop OS accessible to AI agents☆23Updated last week
- ☆31Updated last year
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆37Updated last year
- Github repo for storing LlamaDatasets☆33Updated last year
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Updated 4 months ago
- ☆57Updated last year
- AI_Powered_Dev_Search_Engine☆12Updated last year
- Public reports detailing responses to sets of prompts by Large Language Models.☆30Updated 2 months ago
- Tools for formatting large language model prompts.☆12Updated last year
- A personal knowledge base that I can dump information to and help me learn☆24Updated 9 months ago
- Deploy your autonomous agents to production grade environments with 99% Uptime Guarantee, Infinite Scalability, and self-healing.☆37Updated last week
- ☆12Updated 6 months ago
- ☆1Updated 8 months ago
- Self-host LLMs with vLLM and BentoML☆94Updated this week
- Ongoing research training transformer models at scale☆35Updated last year
- A list of AI memory projects☆84Updated 2 months ago
- LLM finetuning☆42Updated last year
- A multimodal RAG application that enables semantic search on multimedia sources like audio, video and images☆34Updated last year
- Embed anything.☆29Updated 10 months ago
- Data extraction with LLM on CPU☆68Updated last year
- Open Source Embeddings Optimisation and Eval Framework for RAG/LLM Applications. Documentations at https://docs.vectorboard.ai/introducti…☆50Updated last year
- LLM Agents: Landing Page Generation for an E-commerce Platform Using CrewAI, Groq-LangChain and Qdrant☆13Updated 9 months ago
- A text-to-SQL prototype on the northwind sqlite dataset☆12Updated 6 months ago
- ☆75Updated last year