Skip to content
Navigation Menu

IBM Cloud

  • CatalogCatalog
  • Cost EstimatorCost Estimator
  • DocsDocs
  • Catalog
  • Cost Estimator
  • Docs

  • Navigation settings
Confirm
Do you want to log out?
CancelLog out

Error

Two-factor AuthenticationAuthentication Failed

Please answer the security question you selected for the following account:

Two-factor authentication is enabled for the following account:

Phone authentication is enabled for the following account:

  • Loading...
    Need help? Call us at 1-866-325-0045 and select option 2.

    Please wait for phone authentication...

    Invalid answer provided for security question. Please try again or cancel the action.

    Invalid code provided. Please try again or cancel the action.

    Phone authentication is timed out, Please cancel the action and try again later.

    Too many fail attempts. Please cancel the action and try again later.

    Authentication failed. Please try again or cancel the action.

    Change theme

    This feature is in early stage, some parts of the platform might not fully support different themes yet.

    • Log in
    • Sign up
    1. Catalog
    2. Services

    Speech to Text-dev

    • IBM
    • Date of last update: 12/18/2020
    • Docs
    • API docs

    Pricing plans

    PlanFeaturesPricing

    Summary

    Speech to Text-dev

      Already have an account? Log in
      Type
      • Service
      Provider
      • IBM
      Category
      • AI / Machine Learning
      Compliance
      • IAM-enabled
      • Service Endpoint Supported
      Related links
      • API docs
      • Docs
      • Terms

      Summary

      The Speech to Text service converts the human voice into the written word. It can be used anywhere there is a need to bridge the gap between the spoken word and their written form, including voice control of embedded systems, transcription of meetings and conference calls, and dictation of email and notes. This easy-to-use service uses machine intelligence to combine information about grammar and language structure with knowledge of the composition of the audio signal to generate an accurate transcription. The following languages and features are currently available:

      Features

      Available Languages

      English (US), English (UK), Japanese, Arabic (MSA, Broadband model only), Mandarin, Portuguese (Brazil), Spanish, French (Broadband model only), Korean

      Metadata

      Receive a metadata object in the JSON response that includes confidence score (per word), start/end time (per word), and alternate hypotheses / N-Best (per phrase). A new option for returning word alternatives per (sequential) time intervals is now available.

      Mobile SDKs (BETA)

      Mobile SDKs are now available to enable native interaction on iOS and Android devices.

      Keyword Spotting (BETA)

      Optional ability to search for one or more keywords in the audio stream. The returned metadata includes the beginning time, end time and confidence score for each instance of the keyword found. Keyword Spotting is currently available at no additional charge.

      SoftBank

      A localized version of this Watson service is available in Japan. Visit the following link for details: http://www.softbank.jp/biz/watson