The AI stack
PlotonIQ uses specialised models for each task rather than one model for everything.
Language: Claude + Groq
Claude (Anthropic) handles all text tasks: quoting, price justification, business chat, briefing. Strong at structured output (JSON), numerical reasoning, and industry-specific instructions.
Groq serves as fallback. If Claude is unavailable, Groq with Llama takes over automatically. Slightly lower quality, but availability rises to 99.9%+.
Vision: DINO + Gemini + KI-Vision
For AI measurement, a multi-model cascade:
Claude Vision — object detection, bounding boxes.
Gemini Flash — text-prompt localisation ("find all glass surfaces").
Gemini 2.5 Flash — context analysis (window type, glazing).
KI-Vision — pixel-accurate polygon segmentation. Not rectangles — true contours. 5-15 seconds per photo.
Audio: Whisper
Whisper (OpenAI, hosted via Groq) — speech-to-text, 95%+ accuracy, all 17 languages. PlotonIQ extracts materials, dimensions, customer names and scheduling from the transcript automatically.
Production: Physics, not AI
Production scheduling deliberately uses deterministic algorithms. Machine throughput rates, setup times, deadline prioritisation — physics and logic, not pattern recognition. No API costs, no latency, predictable results.