Question 1

What image formats are supported?

Accepted Answer

Interlocute supports JPEG, PNG, WebP, TIFF, BMP, and GIF. Images can be submitted as raw bytes, a public URL, a blob SAS URI, or a provider file ID.

Question 2

What are the three analysis layers?

Accepted Answer

The structural fingerprint runs locally with zero API calls — structural properties, quality metrics, colour analysis, and metadata. Semantic intelligence adds understanding from a multimodal LLM — scene classification, entities, intent, emotion, and design scores. Forensic verification adds adversarial analysis — OCR evidence anchors, claim validation, manipulation detection, and risk assessment.

Question 3

Is the structural fingerprint really free?

Accepted Answer

Yes. The structural fingerprint is fully deterministic and runs locally. It makes no external API calls and incurs no provider costs. It is always executed as the first step of any analysis.

Question 4

How does confidence-driven escalation work?

Accepted Answer

When semantic intelligence produces a confidence score below the profile's escalation threshold (default 0.65), the pipeline automatically runs forensic verification. You can also force forensic verification by setting highStakes to true.

Question 5

What is adversarial verification?

Accepted Answer

Forensic verification uses a second model pass that cross-examines claims made by the semantic analysis against OCR-extracted evidence. It detects contradictions, manipulation signals (edited images, AI-generated artifacts, lighting inconsistencies), and produces a contextual risk assessment with a tamper-proof evidence hash.

Question 6

Can I detect objects without running the full pipeline?

Accepted Answer

Yes. The Object Detection profile runs independently of the layered pipeline. It detects objects and people with bounding boxes, extracts content tags, and optionally removes the background — all without invoking semantic intelligence or forensic verification.

Question 7

How is image analysis billed?

Accepted Answer

The structural fingerprint is free. Semantic intelligence incurs LLM token costs plus a platform premium. Forensic verification incurs additional LLM costs for the adversarial pass and vision API costs for OCR and feature extraction. All costs are attributed per-request in your usage ledger.

Image Intelligence

Structural fingerprint — instant local analysis

Semantic intelligence — LLM-powered understanding

Forensic verification — adversarial analysis

Object detection & vision features

Built-in profiles

Confidence-driven escalation

Frequently Asked Questions

Documentation

Related Features

Video Intelligence

Document Processing

Guardrails & Governance

Ready to build with Image Intelligence?