In Person

Speak. We'll handle the rest.

In Person mode is designed for two practical outcomes: live interpreted conversation support and structured post-meeting documentation from spoken notes. This helps teams capture important detail while reducing reliance on memory-based writeups.

Users can speak naturally, then apply Prompt Customisation to shape output into clear summaries, action lists, and follow-up records. This is especially useful when teams need consistent documentation quality for operational, legal, or regulatory contexts.

Start free trial →

Two Ways to Use In Person Mode

Whether you're interpreting a conversation or capturing spoken notes, In Person mode adapts to your workflow.

Who Uses In Person Mode

High-trust, high-stakes environments where language and documentation matter.

šŸ„ Healthcare & Aged Care
Clinician speaking with a patient whose first language isn't English. Notes captured as a clinical handover summary.

āš–ļø Legal & Migration
Lawyer conducting an intake interview across a language barrier. Session documented for file records.

šŸ›ļø Government Engagement
Officer meeting with a constituent. Interpreted conversation with a follow-up action list emailed automatically.

šŸŽ“ Education
Teacher in a parent meeting where the family speaks a different language. Notes delivered as a structured record.

šŸŒ NGOs & Humanitarian
Field worker conducting an interpreted welfare check. Documentation ready for case management.

āœˆļø Tourism & Hospitality
Front desk staff assisting international guests. Conversation notes logged for service follow-up.

How In Person Mode Works

Watch the demo

Try free today →

How does prompt customisation work?

You can speak your summarisation requirements in the app, and AI converts that into a usable prompt. The default output is a structured text summary, and it can be automatically emailed to the session creator.

How do summaries get delivered?

Summaries are generated as text and can be automatically sent by email to the person who created the session. JSON-style summaries are possible for advanced workflows but are not the default path.

How do I choose between In Person, Video Call, and Broadcast?

Use In Person for face-to-face interpreted conversations or spoken note capture, Video Call for English-only transcription or two-language interpretation, and Broadcast when sessions involve 3 or more active languages with one-to-many and two-way participation where needed. If you are uncertain, start with your most common meeting format and language mix, then expand mode usage as your team gains confidence.

Can we trial VideoTranslatorAI before wider rollout?

Yes. Teams can start with the free trial experience to validate workflows, language coverage, and summary output quality in real scenarios. A practical trial should include at least one In Person session, one Video Call scenario, and one workflow using Prompt Customisation so you can confirm fit before broader operational adoption.