swiss-ai / apertus-tech-reportLinks
Tech Report of the Apertus LLM Suite
☆124Updated 2 months ago
Alternatives and similar repositories for apertus-tech-report
Users that are interested in apertus-tech-report are comparing it to the libraries listed below
Sorting:
- Massive Multimodal Open RAG & Extraction A scalable multimodal pipeline for processing, indexing, and querying multimodal documents Eve…☆165Updated this week
- Pivotal Token Search☆131Updated 4 months ago
- Testing WASM-powered AI agents☆192Updated 2 months ago
- ☆143Updated 2 months ago
- SWE-Bench Pro: Can AI Agents Solve Long-Horizon Software Engineering Tasks?☆217Updated last week
- ☆115Updated 10 months ago
- Train, tune, and infer Bamba model☆136Updated 5 months ago
- Code for collecting, processing, and preparing datasets for the Common Pile☆243Updated 2 months ago
- open source interpretability platform 🧠☆509Updated this week
- lossily compress representation vectors using product quantization☆59Updated last month
- DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation☆771Updated 4 months ago
- Alice in Wonderland code base for experiments and raw experiments data☆131Updated 2 months ago
- A toolkit for describing model features and intervening on those features to steer behavior.☆214Updated last year
- Public repository containing METR's DVC pipeline for eval data analysis☆138Updated 7 months ago
- Simple & Scalable Pretraining for Neural Architecture Research☆302Updated last month
- ☆87Updated this week
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆106Updated this week
- Open source interpretability artefacts for R1.☆163Updated 7 months ago
- Transformer GPU VRAM estimator☆67Updated last year
- ☆228Updated last month
- Code for the paper "Fishing for Magikarp"☆175Updated 6 months ago
- ☆83Updated 3 months ago
- The Granite Guardian models are designed to detect risks in prompts and responses.☆121Updated last month
- Real-Time Detection of Hallucinated Entities in Long-Form Generation☆268Updated 2 weeks ago
- A framework for standardizing evaluations of large foundation models, beyond single-score reporting and rankings.☆171Updated last week
- Evaluating LLMs with fewer examples☆168Updated last year
- Training-Ready RL Environments + Evals☆182Updated this week
- Storing long contexts in tiny caches with self-study☆217Updated last month
- Contains all assets to run with Moonshot Library (Connectors, Datasets and Metrics)☆38Updated 2 months ago
- ☆266Updated 5 months ago