Skip to content

BaseSTTProvider

Abstract base class shared by every STT provider in CompositeVoice.

Defined in: src/providers/base/BaseSTTProvider.ts:75

Abstract base class shared by every STT provider in CompositeVoice.

Remarks

BaseSTTProvider sits between BaseProvider and the two transport- specific bases (LiveSTTProvider and RestSTTProvider). It adds the transcription callback mechanism that all STT providers use to deliver results back to the SDK core, along with guard methods that the orchestrator uses to interpret each transcription result.

The handler/guard pattern:

  1. Handler methods (processAudio) receive data and do things.
  2. Guard methods (isUtteranceComplete, isPreflight, isInterim, isFinal) assert conditions on results and return boolean.
  3. Callbacks (onTranscription / emitTranscription) deliver raw results to the orchestrator.

Guard methods have sensible defaults in this base class. Concrete providers override them when they have domain-specific logic (e.g., DeepgramSTT might override isUtteranceComplete based on endpointing config).

Inheritance hierarchy:

BaseProvider
 +-- BaseSTTProvider          <-- you are here
      +-- LiveSTTProvider     (WebSocket real-time STT)
      +-- RestSTTProvider     (REST file-based STT)

You typically do not extend BaseSTTProvider directly. Instead, extend LiveSTTProvider for streaming/WebSocket providers or RestSTTProvider for batch/file-based providers.

Example

import { BaseSTTProvider } from 'composite-voice';
import type { STTProviderConfig, TranscriptionResult } from 'composite-voice';

class CustomSTTProvider extends BaseSTTProvider {
  constructor(config: STTProviderConfig) {
    super('rest', config);
  }

  protected async onInitialize(): Promise<void> { }
  protected async onDispose(): Promise<void> { }

  processAudio(chunk: ArrayBuffer): void {
    // Process audio data
  }

  async transcribe(audio: Blob): Promise<void> {
    const text = await myCustomEngine.recognize(audio);
    this.emitTranscription({ text, isFinal: true, confidence: 1.0 });
  }
}

See

  • LiveSTTProvider for WebSocket-based real-time STT
  • RestSTTProvider for REST/file-based STT
  • BaseProvider for the root provider lifecycle

Extends

Constructors

Constructor

new BaseSTTProvider(
   type, 
   config, 
   logger?): BaseSTTProvider;

Defined in: src/providers/base/BaseSTTProvider.ts:95

Create a new STT provider.

Parameters

ParameterTypeDescription
type"rest" | "websocket"Transport type ('rest' or 'websocket').
configSTTProviderConfigSTT provider configuration.
logger?LoggerOptional parent logger; a child will be derived.

Returns

BaseSTTProvider

Overrides

BaseProviderClass.constructor

Properties

PropertyModifierTypeDefault valueDescriptionOverridesInherited fromDefined in
configpublicSTTProviderConfigundefinedSTT-specific provider configuration.BaseProviderClass.config-src/providers/base/BaseSTTProvider.ts:80
initializedprotectedbooleanfalseTracks whether initialize has completed successfully.-BaseProviderClass.initializedsrc/providers/base/BaseProvider.ts:97
loggerprotectedLoggerundefinedScoped logger instance for this provider.-BaseProviderClass.loggersrc/providers/base/BaseProvider.ts:94
rolesreadonlyreadonly ProviderRole[]undefinedSTT providers cover the 'stt' pipeline role by default.BaseProviderClass.roles-src/providers/base/BaseSTTProvider.ts:77
transcriptionCallback?protected(result) => voidundefinedCallback registered by the SDK or consumer to receive transcription results. Set via onTranscription.--src/providers/base/BaseSTTProvider.ts:86
typereadonlyProviderTypeundefinedCommunication transport this provider uses ('rest' or 'websocket').-BaseProviderClass.typesrc/providers/base/BaseProvider.ts:74

Accessors

isProxyMode

Get Signature

get protected isProxyMode(): boolean;

Defined in: src/providers/base/BaseProvider.ts:286

Whether the provider is in proxy mode.

Returns

boolean

true when proxyUrl is set.

Inherited from

BaseProviderClass.isProxyMode

Methods

assertAuth()

protected assertAuth(): void;

Defined in: src/providers/base/BaseProvider.ts:272

Validate that auth is configured (either apiKey or proxyUrl).

Returns

void

Remarks

Call this in onInitialize() for any provider that requires external authentication. Native providers (NativeSTT, NativeTTS) and in-browser providers (WebLLM) should NOT call this method.

Throws

ProviderInitializationError Thrown when neither apiKey nor proxyUrl is set.

Inherited from

BaseProviderClass.assertAuth


assertReady()

protected assertReady(): void;

Defined in: src/providers/base/BaseProvider.ts:255

Guard that throws if the provider has not been initialized.

Returns

void

Remarks

Call at the start of any method that requires the provider to be ready.

Throws

Error Thrown with a descriptive message when initialized is false.

Inherited from

BaseProviderClass.assertReady


dispose()

dispose(): Promise<void>;

Defined in: src/providers/base/BaseProvider.ts:154

Clean up resources and dispose of the provider.

Returns

Promise<void>

Remarks

Delegates to the subclass hook onDispose and resets the initialized flag. If the provider is not initialized, the call is a no-op.

Throws

Re-throws any error raised by onDispose.

Inherited from

BaseProviderClass.dispose


emitTranscription()

protected emitTranscription(result): void;

Defined in: src/providers/base/BaseSTTProvider.ts:206

Emit a transcription result to the registered callback.

Parameters

ParameterTypeDescription
resultTranscriptionResultThe transcription result to emit.

Returns

void

Remarks

Subclasses call this method whenever transcribed text is available. If no callback has been registered via onTranscription, the result is logged as a warning and dropped.


getConfig()

getConfig(): STTProviderConfig;

Defined in: src/providers/base/BaseSTTProvider.ts:225

Get a shallow copy of the current STT configuration.

Returns

STTProviderConfig

A new STTProviderConfig object.

Overrides

BaseProviderClass.getConfig


initialize()

initialize(): Promise<void>;

Defined in: src/providers/base/BaseProvider.ts:127

Initialize the provider, making it ready for use.

Returns

Promise<void>

Remarks

Calls the subclass hook onInitialize. If the provider has already been initialized the call is a no-op.

Throws

ProviderInitializationError Thrown when onInitialize rejects. The original error is wrapped with the provider class name for diagnostics.

Inherited from

BaseProviderClass.initialize


isFinal()

isFinal(result): boolean;

Defined in: src/providers/base/BaseSTTProvider.ts:174

Is this a final segment (but not necessarily utterance-complete)?

Parameters

ParameterTypeDescription
resultTranscriptionResultThe transcription result to check.

Returns

boolean

true when this is a final segment.

Remarks

A final segment represents committed text, but multi-segment providers (e.g., Deepgram) may emit several final segments for a single utterance. Only the last one will have isUtteranceComplete return true.


isInterim()

isInterim(result): boolean;

Defined in: src/providers/base/BaseSTTProvider.ts:159

Is this an interim (partial, non-final) result?

Parameters

ParameterTypeDescription
resultTranscriptionResultThe transcription result to check.

Returns

boolean

true when this is an interim result.

Remarks

Interim results update as the user speaks and are replaced by subsequent results. Useful for display but not for triggering downstream processing.


isPreflight()

isPreflight(result): boolean;

Defined in: src/providers/base/BaseSTTProvider.ts:144

Is this a preflight/eager end-of-turn signal?

Parameters

ParameterTypeDescription
resultTranscriptionResultThe transcription result to check.

Returns

boolean

true when this is a preflight signal.

Remarks

Used by the eager LLM pipeline for speculative generation. Only providers with preflight support (e.g., Deepgram Flux) need to override this.


isReady()

isReady(): boolean;

Defined in: src/providers/base/BaseProvider.ts:178

Check whether the provider has been initialized and is ready.

Returns

boolean

true when initialize has completed successfully and dispose has not yet been called.

Inherited from

BaseProviderClass.isReady


isUtteranceComplete()

isUtteranceComplete(result): boolean;

Defined in: src/providers/base/BaseSTTProvider.ts:129

Is this result a complete utterance ready for LLM processing?

Parameters

ParameterTypeDescription
resultTranscriptionResultThe transcription result to check.

Returns

boolean

true when the utterance is complete.

Remarks

The orchestrator calls this to decide when to send transcribed text to the LLM. Concrete providers override this when they have domain- specific endpointing logic (e.g., DeepgramSTT checks speech_final).


onConfigUpdate()

protected onConfigUpdate(_config): void;

Defined in: src/providers/base/BaseProvider.ts:242

Hook called after updateConfig merges new values.

Parameters

ParameterTypeDescription
_configPartial<BaseProviderConfig>The partial configuration that was merged.

Returns

void

Remarks

The default implementation is a no-op. Override in subclasses to react to runtime configuration changes (e.g. reconnect with a new API key).

Inherited from

BaseProviderClass.onConfigUpdate


onDispose()

abstract protected onDispose(): Promise<void>;

Defined in: src/providers/base/BaseProvider.ts:229

Provider-specific disposal logic.

Returns

Promise<void>

Remarks

Subclasses must implement this method to release any resources acquired during onInitialize (e.g. close connections, free memory).

Inherited from

BaseProviderClass.onDispose


onInitialize()

abstract protected onInitialize(): Promise<void>;

Defined in: src/providers/base/BaseProvider.ts:217

Provider-specific initialization logic.

Returns

Promise<void>

Remarks

Subclasses must implement this method to perform any setup required before the provider can be used (e.g. validate credentials, open connections, load models).

Inherited from

BaseProviderClass.onInitialize


onTranscription()

onTranscription(callback): void;

Defined in: src/providers/base/BaseSTTProvider.ts:191

Register a callback to receive transcription results.

Parameters

ParameterTypeDescription
callback(result) => voidFunction invoked with each TranscriptionResult.

Returns

void

Remarks

All STT providers — regardless of transport — deliver text through this callback. CompositeVoice registers it during pipeline setup so that transcription results flow into the conversation manager and, ultimately, the LLM provider.


processAudio()

abstract processAudio(chunk): void;

Defined in: src/providers/base/BaseSTTProvider.ts:114

Process a raw audio chunk.

Parameters

ParameterTypeDescription
chunkArrayBufferRaw audio data as an ArrayBuffer.

Returns

void

Remarks

Called by the orchestrator to send audio for processing. For live providers this delegates to WebSocket send; for REST providers this is typically a no-op (use transcribe instead).


resolveApiKey()

protected resolveApiKey(): string;

Defined in: src/providers/base/BaseProvider.ts:325

Resolve the API key for this provider.

Returns

string

The configured API key, or 'proxy' in proxy mode.

Remarks

Returns 'proxy' in proxy mode so that SDK clients (which require a non-empty API key string) can be instantiated without the real key.

Inherited from

BaseProviderClass.resolveApiKey


resolveAuthHeader()

protected resolveAuthHeader(defaultAuthType?): string | undefined;

Defined in: src/providers/base/BaseProvider.ts:366

Resolve Authorization header value for the configured auth type.

Parameters

ParameterTypeDefault valueDescription
defaultAuthType"token" | "bearer"'token'The default auth type for this provider.

Returns

string | undefined

The Authorization header value, or undefined in proxy mode.

Remarks

Returns the header value for REST or server-side WebSocket connections:

  • 'token''Token <apiKey>'
  • 'bearer''Bearer <apiKey>'

Returns undefined in proxy mode.

Inherited from

BaseProviderClass.resolveAuthHeader


resolveBaseUrl()

protected resolveBaseUrl(defaultUrl?): string | undefined;

Defined in: src/providers/base/BaseProvider.ts:307

Resolve the base URL for this provider.

Parameters

ParameterTypeDescription
defaultUrl?stringThe provider’s default API URL. Pass undefined to let the underlying SDK use its own default.

Returns

string | undefined

The resolved URL, or undefined when all sources are unset.

Remarks

Priority: proxyUrl > endpoint > defaultUrl.

For WebSocket providers (this.type === 'websocket'), the proxy URL’s http(s) scheme is automatically converted to ws(s).

When no URL is configured and defaultUrl is undefined, the return value is undefined — this lets SDK-based providers (Anthropic, OpenAI) fall back to their own built-in defaults.

Inherited from

BaseProviderClass.resolveBaseUrl


resolveWsProtocols()

protected resolveWsProtocols(defaultAuthType?): string[] | undefined;

Defined in: src/providers/base/BaseProvider.ts:343

Resolve WebSocket subprotocol for authentication.

Parameters

ParameterTypeDefault valueDescription
defaultAuthType"token" | "bearer"'token'The default auth type for this provider.

Returns

string[] | undefined

Subprotocol array for new WebSocket(url, protocols), or undefined.

Remarks

Returns the subprotocol array for direct mode based on authType:

  • 'token'['token', apiKey] (Deepgram default)
  • 'bearer'['bearer', apiKey] (OAuth/Bearer tokens)

Returns undefined in proxy mode (no client-side auth needed).

Inherited from

BaseProviderClass.resolveWsProtocols


updateConfig()

updateConfig(config): void;

Defined in: src/providers/base/BaseProvider.ts:201

Merge partial configuration updates into the current config.

Parameters

ParameterTypeDescription
configPartial<BaseProviderConfig>A partial configuration object whose keys will overwrite existing values.

Returns

void

Remarks

After merging, the subclass hook onConfigUpdate is called so providers can react to changed values at runtime.

Inherited from

BaseProviderClass.updateConfig

© 2026 CompositeVoice. All rights reserved.

Font size
Contrast
Motion
Transparency