Pytia

Speech to text (ASR) integration

Client of ASR speech recognition services. It has a set of appropriate connectors for specific external services to which the audio stream should be transmitted and text recognition should be received in return.

Configuration

voice.gateway {

  pytia {
    technology = TECHMO
    api {
      base-url = "0.0.0.0:8081"
      api-key: "xxx"
    }
    
    grpc-endpoint-port = 6443
    
    service-availability {
      ping-interval = 10d
      failure-limit = 2
      failure-time-window = 5m
      enabled = true
    }
  }
  techmo.engines = [
    {
      host = asr3-istio.example.com
      use-plain-text: false
      port = 8443
      name = "techmo-1"
    }
  ]
}

Field description

  • technology - name of a specific supplier - selection of the right connector for the service
  • api - REST API parameters of the website
  • grpc-enpoint-port - direct access port to the speech recognition service (used in the Automate chat tester)
  • service-availability - configuration of checking the availability of the ASR service
  • techmo.engines - configuration of access to the ASR service with a name for identification and communication method (use-plain-text option)