Stay organized with collections
Save and categorize content based on your preferences.
This page contains recommendations on how to provide speech data to the
Google Assistant API. These guidelines are designed for greater efficiency
and accuracy as well as reasonable response times from the service.
Audio pre-processing
It's best to provide audio that is as clean as possible by using a good quality
and well-positioned microphone. However, applying noise-reduction signal
processing to the audio before sending it to the service typically reduces
recognition accuracy. The service is designed to handle noisy audio.
For best results:
Position the microphone as close to the user as possible, particularly when
background noise is present.
Avoid audio clipping.
Do not use automatic gain control (AGC).
All noise reduction processing should be disabled.
Ideally:
The audio level should be calibrated so that the input signal does not clip,
and peak speech audio levels reach approximately -20 to -10 dBFS.
The device should exhibit approximately "flat" amplitude versus
frequency characteristics (+- 3 dB 100 Hz to 8000 Hz).
Total harmonic distortion should be less than 1% from 100 Hz to 8000
Hz at 90 dB SPL input level.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Missing the information I need","missingTheInformationINeed","thumb-down"],["Too complicated / too many steps","tooComplicatedTooManySteps","thumb-down"],["Out of date","outOfDate","thumb-down"],["Samples / code issue","samplesCodeIssue","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2024-09-18 UTC."],[[["\u003cp\u003eThe Google Assistant API is designed to handle noisy audio, so noise reduction processing should be disabled prior to sending audio.\u003c/p\u003e\n"],["\u003cp\u003eFor optimal performance, position the microphone near the user, avoid audio clipping and automatic gain control, and calibrate audio levels to peak between -20 to -10 dBFS.\u003c/p\u003e\n"],["\u003cp\u003eDevices used for audio input should ideally have a flat frequency response and minimal harmonic distortion.\u003c/p\u003e\n"],["\u003cp\u003e\u003cstrong\u003eWarning:\u003c/strong\u003e The Google Assistant Library for Python is deprecated; use the Google Assistant Service instead.\u003c/p\u003e\n"]]],["The Google Assistant Library for Python is deprecated; use the Google Assistant Service instead. For optimal audio input, provide clean audio from a high-quality, well-positioned microphone. Disable noise reduction and automatic gain control. Position the microphone close to the user and avoid audio clipping. Calibrate audio levels to avoid clipping, with peak levels around -20 to -10 dBFS. Aim for a flat amplitude response (+- 3 dB from 100 Hz to 8000 Hz) and total harmonic distortion under 1%.\n"],null,["# Best Practices for Audio\n\n| **Warning:** The Google Assistant Library for Python is deprecated as of June 28th, 2019. Use the [Google Assistant Service](/assistant/sdk/guides/service/python) instead.\n\n\u003cbr /\u003e\n\nThis page contains recommendations on how to provide speech data to the\nGoogle Assistant API. These guidelines are designed for greater efficiency\nand accuracy as well as reasonable response times from the service.\n\nAudio pre-processing\n--------------------\n\nIt's best to provide audio that is as clean as possible by using a good quality\nand well-positioned microphone. However, applying noise-reduction signal\nprocessing to the audio before sending it to the service typically reduces\nrecognition accuracy. The service is designed to handle noisy audio.\n\nFor best results:\n\n- Position the microphone as close to the user as possible, particularly when background noise is present.\n- Avoid audio clipping.\n- Do not use automatic gain control (AGC).\n- All noise reduction processing should be disabled.\n\nIdeally:\n\n- The audio level should be calibrated so that the input signal does not clip, and peak speech audio levels reach approximately -20 to -10 dBFS.\n- The device should exhibit approximately \"flat\" amplitude versus frequency characteristics (+- 3 dB 100 Hz to 8000 Hz).\n- Total harmonic distortion should be less than 1% from 100 Hz to 8000 Hz at 90 dB SPL input level."]]