深度学习之语音识别

参考

  1. nobody132/masr: 中文语音识别; Mandarin Automatic … - GitHubgithub.com › nobody132 › masr
  2. nl8590687/ASRT_SpeechRecognition: A Deep … - GitHubgithub.com › ASRT_SpeechRecognition
  3. audier/DeepSpeechRecognition: A Chinese Deep … - GitHubgithub.com › audier › DeepSpeechRecognition
  4. xxbb1234021/speech_recognition: 中文语音识别 - GitHubgithub.com › xxbb1234021 › speech_recognition
  5. 从视频中提取音频——Python三行程序,的,python,搞定www.pythonf.cn › read
  6. 人人都能看懂的LSTM - 知乎zhuanlan.zhihu.com › …
  7. deepspeech项目star:16.9k
  8. 使用google接口进行语音识别:speech_recognition
  9. 相关paper: GitHub
  10. github speech recognition topics排行榜
  11. 滴滴语音识别开源
  12. 2021最新语音识别开源WENET

不OK的项目

  1. speech_recognition

基础

视频提取音频

# -*- coding:utf-8 _*-
"""
Author: 
Email: 
Date: 2021/03/23
File: audio_test.py
Software: PyCharm
Description: 音频识别
"""


# load modules
import os
import sys
from moviepy.editor import AudioFileClip

current_path = sys.path[0]
video_path = '%s/video/' % current_path  # 视频源文件目录
video_name = os.listdir(video_path)[0]
audio_path = '%s/audio_extracts/' % current_path  # 音频提取文件目录
if not os.path.exists(audio_path):
    os.mkdir(audio_path)
audio_name = video_name.split('.')[0] + '.wav'

# 音频提取
my_audio_clip = AudioFileClip(video_path+video_name)
my_audio_clip.write_audiofile(audio_path+audio_name)

# 音频分析
import numpy as np
import librosa
audio, freq = librosa.load(audio_path+audio_name)
time = np.arange(0, len(audio)) / freq
print(len(audio), type(audio), freq, sep="\t")

# 画信号增强图
import librosa.display
import matplotlib.pyplot as plt
audio, _ = librosa.effects.trim(audio)  # Trim leading and trailing #silence from an audio signal.
librosa.display.waveplot(audio, sr=freq)
plt.show()

音频识别

使用接口

speech-recognition:音频识别相关topic


文章作者: Myhaa
版权声明: 本博客所有文章除特別声明外,均采用 CC BY 4.0 许可协议。转载请注明来源 Myhaa !
评论
  目录