[Day 22] Google Cloud Speech-to-Text - 2 - iT 邦幫忙::一起幫忙解決難題，拯救 IT 人的一天

第 11 屆 iThome 鐵人賽

DAY 22

Google Developers Machine Learning

Overview of Machine Learning Products系列第 22 篇

[Day 22] Google Cloud Speech-to-Text - 2

11th鐵人賽 cloud speech-to-text google development

Joseph-bug

2019-09-29 23:48:56

1877 瀏覽

分享至

這個步調而言，今天就是Cloud Speech-to-Text API串接，前情提要一樣是要先建立project、Enable API、下載credential json之類的。忘了的人記得看第三天的文章。

好，現在要先來把test data抓下來，我們可以在google的github上找到很多檔案可以測試，我這邊抓的是audio.raw，並把它放到testdata/speech_to_text資料夾下。
file structure

萬事俱備就只欠東風，我們來看看demo code吧：

func DemoCode(filename string) {
  ctx := context.Background()

  // Creates a client.
  client, err := speech.NewClient(ctx)
  if err != nil {
    log.Fatalf("Failed to create client: %v", err)
  }

  // Reads the audio file into memory.
  data, err := ioutil.ReadFile(filename)
  if err != nil {
    log.Fatalf("Failed to read file: %v", err)
  }

  // Detects speech in the audio file.
  resp, err := client.Recognize(ctx, &speechpb.RecognizeRequest{
    Config: &speechpb.RecognitionConfig{
      Encoding:        speechpb.RecognitionConfig_LINEAR16,
      SampleRateHertz: 16000,
      LanguageCode:    "en-US",
    },
    Audio: &speechpb.RecognitionAudio{
      AudioSource: &speechpb.RecognitionAudio_Content{Content: data},
    },
  })
  if err != nil {
    log.Fatalf("failed to recognize: %v", err)
  }

  // Prints the results.
  for _, result := range resp.Results {
    for _, alt := range result.Alternatives {
      fmt.Printf("\"%v\" (confidence=%3f)\n", alt.Transcript, alt.Confidence)
    }
  }
}