Voice Mode Tools Reference¶

Voice Mode provides several tools through the Model Context Protocol (MCP) for voice interactions.

Available Tools¶

Have a voice conversation - speak a message and optionally listen for response.

Parameters:

Parameter	Type	Default	Description
`message`	string	required	The message to speak
`wait_for_response`	boolean	`true`	Whether to listen for a response after speaking
`listen_duration`	number	`30.0`	Maximum time to listen for response in seconds
`min_listen_duration`	number	`1.0`	Minimum time to record before silence detection can stop
`transport`	string	`"auto"`	Transport method: "auto", "local", or "livekit"
`voice`	string	auto	TTS voice override (e.g., "nova", "alloy", "af_sky")
`tts_provider`	string	auto	TTS provider: "openai" or "kokoro"
`tts_model`	string	auto	TTS model (e.g., "tts-1", "tts-1-hd", "gpt-4o-mini-tts")
`tts_instructions`	string	none	Tone/style instructions for gpt-4o-mini-tts model

Example:

converse("Hello, how can I help you today?", wait_for_response=True, listen_duration=45)

Listen for speech and convert to text.

Parameters:

Parameter	Type	Default	Description
`duration`	number	`5.0`	How long to listen in seconds
`min_listen_duration`	number	`1.0`	Minimum time to record before silence detection

Check LiveKit room status and participants.

Parameters: None

List available audio input and output devices.

Parameters: None

Check the status of all voice services including TTS, STT, LiveKit, and audio devices.

Parameters: None

Start the Kokoro TTS service.

Parameters:

Parameter	Type	Default	Description
`models_dir`	string	`~/Models/kokoro`	Path to Kokoro models directory

Stop the Kokoro TTS service.

Parameters: None

Check the status of Kokoro TTS service.

Parameters: None

Display live statistics dashboard for voice conversation performance.

Parameters: None

Get a concise summary of voice conversation performance metrics.

Parameters: None

Show recent voice conversation interactions with timing details.

Parameters:

Parameter	Type	Default	Description
`limit`	integer	`10`	Maximum number of recent interactions (max: 50)

Reset all voice conversation statistics and start a new session.

Parameters: None

Export detailed voice conversation statistics as JSON data.

Parameters: None