SpeechConfiguration Class Reference


                    
                    
                    wakewords

A comma-separated list of wakeword keywords

Remark

ex: “up,dog”

Warning

cannot contain spaces

See also

AppleWakewordRecognizer

Declaration

Swift

@objc
public var wakewords: String

Show on GitHub


                    
                    
                    fftWindowType

The name of the window function to apply to each audio frame before calculating the STFT.

Remark

Currently the “hann” window is supported.

See also

TFLiteWakewordRecognizer

Declaration

Swift

public var fftWindowType: SignalProcessing.FFTWindowType

Show on GitHub


                    
                    
                    rmsTarget

The desired linear Root Mean Squared (RMS) signal energy, which is used for signal normalization and should be tuned to the RMS target used during wakeword model training.

See also

TFLiteWakewordRecognizer

Declaration

Swift

@available(*, deprecated, message: "RMS normalization is no longer used during wakeword recognition.")
@objc
public var rmsTarget: Float

Show on GitHub


                    
                    
                    rmsAlpha

The Exponentially Weighted Moving Average (EWMA) update rate for the current Root Mean Squared (RMS) signal energy (0 for no RMS normalization).

See also

TFLiteWakewordRecognizer

Declaration

Swift

@available(*, deprecated, message: "RMS normalization is no longer used during wakeword recognition.")
@objc
public var rmsAlpha: Float

Show on GitHub


                    
                    
                    fftWindowSize

The size of the signal window used to calculate the STFT, in number of samples - should be a power of 2 for maximum efficiency.

See also

TFLiteWakewordRecognizer

Declaration

Swift

@objc
public var fftWindowSize: Int

Show on GitHub


                    
                    
                    fftHopLength

The length of time to skip each time the overlapping STFT is calculated, in milliseconds.

See also

TFLiteWakewordRecognizer

Declaration

Swift

@objc
public var fftHopLength: Int

Show on GitHub


                    
                    
                    melFrameLength

The length of a frame in the mel spectrogram used as an input to the wakeword recognizer encoder, in milliseconds.

See also

TFLiteWakewordRecognizer

Declaration

Swift

@objc
public var melFrameLength: Int

Show on GitHub


                    
                    
                    melFrameWidth

The number of filterbank components in each mel spectrogram frame sent to the wakeword recognizer.

See also

TFLiteWakewordRecognizer

Declaration

Swift

@objc
public var melFrameWidth: Int

Show on GitHub


                    
                    
                    stateWidth

The size of the wakeword recognizer’s encoder state output.

See also

TFLiteWakewordRecognizer, encodeWidth

Remarks

Defaults to matching the encodeWidth value.

Declaration

Swift

@objc
public var stateWidth: Int

Show on GitHub


                    
                    
                    encodeWidth

The size of the wakeword recognizer’s encoder window output.

See also

TFLiteWakewordRecognizer

Declaration

Swift

@objc
public var encodeWidth: Int

Show on GitHub


                    
                    
                    encodeLength

The length of the sliding window of encoder output used as an input to the wakeword recognizer classifier, in milliseconds.

See also

TFLiteWakewordRecognizer

Declaration

Swift

@objc
public var encodeLength: Int

Show on GitHub


                    
                    
                    wakeThreshold

The threshold of the wakeword recognizer classifier’s posterior output, above which the wakeword recognizer activates the pipeline, in the range [0, 1].

See also

TFLiteWakewordRecognizer

Declaration

Swift

@objc
public var wakeThreshold: Float

Show on GitHub


                    
                    
                    wakeActiveMin

The minimum length of an activation, in milliseconds. Used to ignore a Voice Activity Detector (VAD) deactivation after the wakeword.

See also

TFLiteWakewordRecognizer`

Declaration

Swift

@objc
public var wakeActiveMin: Int

Show on GitHub


                    
                    
                    wakeActiveMax

The maximum length of an activation, in milliseconds. Used to time out the speech pipeline activation.

Remarks

Defaults to 5 seconds to improve perceived responsiveness, although most NLUs use a longer timeout (eg 7s).

See also

AppleSpeechRecognizer, TFLiteWakewordRecognizer

Declaration

Swift

@objc
public var wakeActiveMax: Int

Show on GitHub


                    
                    
                    vadMode

Indicate to the VAD the level of permissiveness to non-speech activation.

See also

AppleWakewordRecognizer, TFLiteWakewordRecognizer

Declaration

Swift

public var vadMode: VADMode

Show on GitHub


                    
                    
                    vadFallDelay

Delay between a VAD deactivation and the delivery of the recognition results.

See also

AppleSpeechRecognizer

Remark

unique to iOS

Declaration

Swift

@objc
public var vadFallDelay: Int

Show on GitHub


                    
                    
                    sampleRate

Audio sampling rate, in Hz.

Declaration

Swift

public var sampleRate: Int

Show on GitHub


                    
                    
                    frameWidth

Audio frame width, in milliseconds.

To do

Should be renamed wakeFrameWidth.

Declaration

Swift

@objc
public var frameWidth: Int

Show on GitHub


                    
                    
                    wakewordRequestTimeout

Length of time to allow an Apple ASR request to run, in milliseconds.

See also

AppleWakewordRecognizer

Remark

Apple has an undocumented limit of 60000ms per request. Unique to iOS.

Declaration

Swift

@objc
public var wakewordRequestTimeout: Int

Show on GitHub


                    
                    
                    preEmphasis

The pre-emphasis filter weight to apply to the normalized audio signal, in a range of [0, 1].

See also

TFLiteWakewordRecognizer

Declaration

Swift

@objc
public var preEmphasis: Float

Show on GitHub


                    
                    
                    filterModelName

The filename of the machine learning model used for the filtering step.

Remarks

Both the file name and the file path are configurable to allow for flexibility in constructing the path that the recognizer will attempt to load the model from.

Declaration

Swift

@objc
public var filterModelName: String

Show on GitHub


                    
                    
                    encodeModelName

The filename of the machine learning model used for the encoding step.

Remarks

Both the file name and the file path are configurable to allow for flexibility in constructing the path that the recognizer will attempt to load the model from.

Declaration

Swift

@objc
public var encodeModelName: String

Show on GitHub


                    
                    
                    detectModelName

The filename of the machine learning model used for the detect step.

Remarks

Both the file name and the file path are configurable to allow for flexibility in constructing the path that the recognizer will attempt to load the model from.

Declaration

Swift

@objc
public var detectModelName: String

Show on GitHub


                    
                    
                    filterModelPath

The filesystem path to the machine learning model for the filtering step.

Declaration

Swift

@objc
public var filterModelPath: String

Show on GitHub


                    
                    
                    encodeModelPath

The filesystem path to the machine learning model for the encoding step.

See also

TFLiteWakewordRecognizer

Declaration

Swift

@objc
public var encodeModelPath: String

Show on GitHub


                    
                    
                    detectModelPath

The filesystem path to the machine learning model for the detect step.

See also

TFLiteWakewordRecognizer

Declaration

Swift

@objc
public var detectModelPath: String

Show on GitHub


                    
                    
                    apiId

Text To Speech API client identifier key.

See also

TextToSpeech

Declaration

Swift

@objc
public var apiId: String

Show on GitHub


                    
                    
                    apiSecret

Text To Speech API client secret key.

See also

TextToSpeech

Declaration

Swift

@objc
public var apiSecret: String

Show on GitHub


                    
                    
                    nluVocabularyPath

The filesystem path to the vocabulary used for tokenizer encoding.

See also

Tokenizer

Declaration

Swift

@objc
public var nluVocabularyPath: String

Show on GitHub


                    
                    
                    nluTerminatorTokenIndex

The index in the vocabulary of the terminator token. Determined by the NLU vocabulary.

See also

BertTokenizer

Declaration

Swift

@objc
public var nluTerminatorTokenIndex: Int

Show on GitHub


                    
                    
                    nluPaddingTokenIndex

The index in the vocabulary of the terminator token. Determined by the NLU vocabulary.

See also

BertTokenizer

Declaration

Swift

@objc
public var nluPaddingTokenIndex: Int

Show on GitHub


                    
                    
                    nluModelPath

The filesystem path to the machine learning model for Natural Language Understanding processing.

See also

TensorflowNLU

Declaration

Swift

@objc
public var nluModelPath: String

Show on GitHub


                    
                    
                    nluModelMetadataPath

The filesystem path to the model metadata for Natural Language Understanding processing.

See also

TensorflowNLU

Declaration

Swift

@objc
public var nluModelMetadataPath: String

Show on GitHub


                    
                    
                    nluMaxTokenLength

The maximum utterance length the NLU can process. Determined by the NLU model.

See also

BertTokenizer

Declaration

Swift

@objc
public var nluMaxTokenLength: Int

Show on GitHub


                    
                    
                    tracing

Debugging trace levels, for simple filtering.

Declaration

Swift

@objc
public var tracing: Trace.Level

Show on GitHub


                    
                    
                    delegateDispatchQueue

Delegate events will be sent using the specified dispatch queue.

Declaration

Swift

@objc
public var delegateDispatchQueue: DispatchQueue

Show on GitHub


                    
                    
                    automaticallyClassifyTranscript

Automatically run Spokestack’s NLU classification on ASR transcripts for clients that use the Spokestack facade.

Note

Requires NLUTensorflow to be correctly configured, notably with nluModelPath, nluModelMetadataPath, and nluVocabularyPath.

See also

Spokestack, NLUTensorflow, nluModelPath, nluModelMetadataPath, and nluVocabularyPath

Declaration

Swift

@objc
public var automaticallyClassifyTranscript: Bool

Show on GitHub


                    
                    
                    keywordFilterModelName

The filename of the machine learning model used for the filtering step of the keyword recognizer.

Remarks

Both the file name and the file path are configurable to allow for flexibility in constructing the path that the recognizer will attempt to load the model from.

See also

TFLiteKeywordRecognizer

Declaration

Swift

@objc
public var keywordFilterModelName: String

Show on GitHub


                    
                    
                    keywordEncodeModelName

The filename of the machine learning model used for the encoding step of the keyword recognizer.

Remarks

Both the file name and the file path are configurable to allow for flexibility in constructing the path that the recognizer will attempt to load the model from.

See also

TFLiteKeywordRecognizer

Declaration

Swift

@objc
public var keywordEncodeModelName: String

Show on GitHub


                    
                    
                    keywordDetectModelName

The filename of the machine learning model used for the detect step of the keyword recognizer.

Remarks

Both the file name and the file path are configurable to allow for flexibility in constructing the path that the recognizer will attempt to load the model from.

See also

TFLiteKeywordRecognizer

Declaration

Swift

@objc
public var keywordDetectModelName: String

Show on GitHub


                    
                    
                    keywordMetadataName

The filename of the model metadata for keyword recognition

Remarks

Both the file name and the file path are configurable to allow for flexibility in constructing the path that the recognizer will attempt to load the model from.

See also

TFLiteKeywordRecognizer

Declaration

Swift

@objc
public var keywordMetadataName: String

Show on GitHub


                    
                    
                    keywordFilterModelPath

The filesystem path to the machine learning model for the filtering step of the keyword recognizer.

See also

TFLiteKeywordRecognizer

Declaration

Swift

@objc
public var keywordFilterModelPath: String

Show on GitHub


                    
                    
                    keywordEncodeModelPath

The filesystem path to the machine learning model for the encoding step of the keyword recognizer.

See also

TFLiteKeywordRecognizer

Declaration

Swift

@objc
public var keywordEncodeModelPath: String

Show on GitHub


                    
                    
                    keywordDetectModelPath

The filesystem path to the machine learning model for the detect step of the keyword recognizer.

See also

TFLiteKeywordRecognizer

Declaration

Swift

@objc
public var keywordDetectModelPath: String

Show on GitHub


                    
                    
                    keywordThreshold

The threshold of the keyword recognizer’s posterior output, above which the keyword recognizer emits a recognition event for the most probable keyword.

See also

TFLiteKeywordRecognizer

Declaration

Swift

@objc
public var keywordThreshold: Float

Show on GitHub


                    
                    
                    keywordMetadataPath

The filesystem path to the model metadata for keyword recognition

See also

TFLiteKeywordRecognizer

Declaration

Swift

@objc
public var keywordMetadataPath: String

Show on GitHub


                    
                    
                    keywords

A comma-separated list of keywords to recognize.

Remark

ex: “yes,no”

Warning

Cannot contain spaces. Will be ignored in favor of keywordMetadataPath if available.

See also

TFLiteKeywordRecognizer

Declaration

Swift

@objc
public var keywords: String

Show on GitHub


                    
                    
                    keywordFFTWindowType

The name of the window function to apply to each audio frame before calculating the STFT.

Remark

Currently the “hann” window is supported.

See also

TFLiteWakewordRecognizer

Declaration

Swift

public var keywordFFTWindowType: SignalProcessing.FFTWindowType

Show on GitHub


                    
                    
                    keywordFFTWindowSize

The size of the signal window used to calculate the STFT, in number of samples - should be a power of 2 for maximum efficiency.

See also

TFLiteWakewordRecognizer

Declaration

Swift

@objc
public var keywordFFTWindowSize: Int

Show on GitHub


                    
                    
                    keywordFFTHopLength

The length of time to skip each time the overlapping STFT is calculated, in milliseconds.

See also

TFLiteWakewordRecognizer

Declaration

Swift

@objc
public var keywordFFTHopLength: Int

Show on GitHub


                    
                    
                    keywordMelFrameLength

The length of a frame in the mel spectrogram used as an input to the wakeword recognizer encoder, in milliseconds.

See also

TFLiteWakewordRecognizer

Declaration

Swift

@objc
public var keywordMelFrameLength: Int

Show on GitHub


                    
                    
                    keywordMelFrameWidth

The number of filterbank components in each mel spectrogram frame sent to the wakeword recognizer.

See also

TFLiteWakewordRecognizer

Declaration

Swift

@objc
public var keywordMelFrameWidth: Int

Show on GitHub


                    
                    
                    keywordEncodeWidth

The size of the wakeword recognizer’s encoder window output.

See also

TFLiteWakewordRecognizer

Declaration

Swift

@objc
public var keywordEncodeWidth: Int

Show on GitHub


                    
                    
                    keywordEncodeLength

The length of the sliding window of encoder output used as an input to the wakeword recognizer classifier, in milliseconds.

See also

TFLiteWakewordRecognizer

Declaration

Swift

@objc
public var keywordEncodeLength: Int

Show on GitHub


                    
                    
                    semaphoreTimeout

Timeout in seconds used for semaphore waits in the speech pipeline

Warning

There is not normally a need to change this value.

See also

AudioController, AppleWakewordRecognizer, AppleSpeechRecognizer

Declaration

Swift

@objc
public var semaphoreTimeout: Double

Show on GitHub