Media Processor
Deploy a media-processing AI agent that handles audio, video, images, and documents out of the box — no pipeline assembly required.
Built-in media modules
Structured intelligence for video, documents, and images
Video Intelligence
Upload a video and get a structured AI index: speech transcripts, visual scene analysis, entity extraction, sentiment, and AI summaries — choose the signals you need.
- Speech profile — what was said
- Visual profile — what was seen
- Insights profile — what it means
- Composable profiles
Document Processing
Upload PDFs, forms, and scanned documents. Three composable profiles extract text, structural layout, and form fields — combine them to pay only for what you need.
- PDF Read — text and language
- PDF Layout — structure and figures
- Form Extraction — fields and barcodes
- Composable profiles
Image Intelligence
Upload an image and get layered AI analysis: a structural fingerprint with instant local metrics, semantic understanding from a multimodal LLM, and full forensic verification with manipulation detection.
- Structural fingerprint — instant local analysis
- Semantic intelligence — LLM-powered understanding
- Forensic verification — adversarial analysis
- Object detection & vision features
What you get out of the box
Audio transcription (speech-to-text) out of the box
Image upload with OCR and vision analysis
Video ingestion and frame extraction
PDF and text document parsing and summarisation
All media processing governed and usage-metered
Combine media inputs with conversational AI in one node
How setup works
Sign up and create a new node
Select the Media Processor profile
Upload audio, images, or documents via the dashboard or API
Optionally customise the constitution for your media workflow
Embed the transcription widget or call the API directly
Try these prompts
Frequently Asked Questions
Media Processor
What media formats does the Media Processor support?
How does audio transcription work?
Can I combine media processing with conversation?
Is the Media Processor suitable for production workloads?
How is media processing billed?
Ready to deploy?
Create your Media Processor node in seconds and start building.