参考
- nobody132/masr: 中文语音识别; Mandarin Automatic … - GitHubgithub.com › nobody132 › masr
- nl8590687/ASRT_SpeechRecognition: A Deep … - GitHubgithub.com › ASRT_SpeechRecognition
- audier/DeepSpeechRecognition: A Chinese Deep … - GitHubgithub.com › audier › DeepSpeechRecognition
- xxbb1234021/speech_recognition: 中文语音识别 - GitHubgithub.com › xxbb1234021 › speech_recognition
- 从视频中提取音频——Python三行程序,的,python,搞定www.pythonf.cn › read
- 人人都能看懂的LSTM - 知乎zhuanlan.zhihu.com › …
- deepspeech项目:
star:16.9k
- 使用
google
接口进行语音识别:speech_recognition - 相关paper: GitHub
- github speech recognition topics排行榜
- 滴滴语音识别开源
- 2021最新语音识别开源WENET
不OK的项目
基础
视频提取音频
# -*- coding:utf-8 _*-
"""
Author:
Email:
Date: 2021/03/23
File: audio_test.py
Software: PyCharm
Description: 音频识别
"""
# load modules
import os
import sys
from moviepy.editor import AudioFileClip
current_path = sys.path[0]
video_path = '%s/video/' % current_path # 视频源文件目录
video_name = os.listdir(video_path)[0]
audio_path = '%s/audio_extracts/' % current_path # 音频提取文件目录
if not os.path.exists(audio_path):
os.mkdir(audio_path)
audio_name = video_name.split('.')[0] + '.wav'
# 音频提取
my_audio_clip = AudioFileClip(video_path+video_name)
my_audio_clip.write_audiofile(audio_path+audio_name)
# 音频分析
import numpy as np
import librosa
audio, freq = librosa.load(audio_path+audio_name)
time = np.arange(0, len(audio)) / freq
print(len(audio), type(audio), freq, sep="\t")
# 画信号增强图
import librosa.display
import matplotlib.pyplot as plt
audio, _ = librosa.effects.trim(audio) # Trim leading and trailing #silence from an audio signal.
librosa.display.waveplot(audio, sr=freq)
plt.show()
音频识别
使用接口
speech-recognition:音频识别相关topic