update readme
This commit is contained in:
parent
4e4aad07ce
commit
20f9a9c0fb
18
README.md
18
README.md
|
@ -1,7 +1,10 @@
|
||||||
# A Deep-Learning-Based Chinese Speech Recognition System
|
# A Deep-Learning-Based Chinese Speech Recognition System
|
||||||
基于深度学习的中文语音识别系统,如果您觉得喜欢,请点一个 **"Star"** 吧~
|
基于深度学习的中文语音识别系统,如果您觉得喜欢,请点一个 **"Star"** 吧~
|
||||||
|
|
||||||
[![GPL-3.0 Licensed](https://img.shields.io/badge/License-GPL3.0-blue.svg?style=flat)](https://opensource.org/licenses/GPL-3.0) [![TensorFlow Version](https://img.shields.io/badge/Tensorflow-1.4+-blue.svg)](https://www.tensorflow.org/) [![Keras Version](https://img.shields.io/badge/Keras-2.0+-blue.svg)](https://keras.io/) [![Python Version](https://img.shields.io/badge/Python-3.x-blue.svg)](https://www.python.org/)
|
[![GPL-3.0 Licensed](https://img.shields.io/badge/License-GPL3.0-blue.svg?style=flat)](https://opensource.org/licenses/GPL-3.0)
|
||||||
|
[![TensorFlow Version](https://img.shields.io/badge/Tensorflow-1.13+-blue.svg)](https://www.tensorflow.org/)
|
||||||
|
[![Keras Version](https://img.shields.io/badge/Keras-2.2+-blue.svg)](https://keras.io/)
|
||||||
|
[![Python Version](https://img.shields.io/badge/Python-3.x-blue.svg)](https://www.python.org/)
|
||||||
|
|
||||||
**ReadMe Language** | 中文版 | [English](https://github.com/nl8590687/ASRT_SpeechRecognition/blob/master/README_EN.md) |
|
**ReadMe Language** | 中文版 | [English](https://github.com/nl8590687/ASRT_SpeechRecognition/blob/master/README_EN.md) |
|
||||||
|
|
||||||
|
@ -19,6 +22,12 @@ ASRT的原理请查看本文:
|
||||||
* [统计语言模型:从中文拼音到文本](https://blog.ailemon.me/2017/04/27/statistical-language-model-chinese-pinyin-to-words/)
|
* [统计语言模型:从中文拼音到文本](https://blog.ailemon.me/2017/04/27/statistical-language-model-chinese-pinyin-to-words/)
|
||||||
* [无需中文分词算法的简单词频统计](https://blog.ailemon.me/2017/02/20/simple-words-frequency-statistic-without-segmentation-algorithm/)
|
* [无需中文分词算法的简单词频统计](https://blog.ailemon.me/2017/02/20/simple-words-frequency-statistic-without-segmentation-algorithm/)
|
||||||
|
|
||||||
|
关于CTC的问题请看:
|
||||||
|
|
||||||
|
* [[翻译]使用CTC进行序列建模](<https://blog.ailemon.me/2019/07/18/sequence-modeling-with-ctc/>)
|
||||||
|
|
||||||
|
更多内容请访问作者的博客:[AI柠檬博客](https://blog.ailemon.me/)
|
||||||
|
|
||||||
## Introduction 简介
|
## Introduction 简介
|
||||||
|
|
||||||
本项目使用Keras、TensorFlow基于深度卷积神经网络和长短时记忆神经网络、注意力机制以及CTC实现。
|
本项目使用Keras、TensorFlow基于深度卷积神经网络和长短时记忆神经网络、注意力机制以及CTC实现。
|
||||||
|
@ -102,8 +111,8 @@ CNN + LSTM/GRU + CTC
|
||||||
Python的依赖库
|
Python的依赖库
|
||||||
|
|
||||||
* python_speech_features
|
* python_speech_features
|
||||||
* TensorFlow
|
* TensorFlow (1.13+)
|
||||||
* Keras
|
* Keras (2.2+)
|
||||||
* Numpy
|
* Numpy
|
||||||
* wave
|
* wave
|
||||||
* matplotlib
|
* matplotlib
|
||||||
|
@ -116,6 +125,9 @@ Python的依赖库
|
||||||
[程序运行依赖环境详细说明](https://github.com/nl8590687/ASRT_SpeechRecognition/wiki/Dependent-Environment)
|
[程序运行依赖环境详细说明](https://github.com/nl8590687/ASRT_SpeechRecognition/wiki/Dependent-Environment)
|
||||||
|
|
||||||
## Data Sets 数据集
|
## Data Sets 数据集
|
||||||
|
|
||||||
|
[几个最新免费开源的中文语音数据集](https://blog.ailemon.me/2018/11/21/free-open-source-chinese-speech-datasets/)
|
||||||
|
|
||||||
* **清华大学THCHS30中文语音数据集**
|
* **清华大学THCHS30中文语音数据集**
|
||||||
|
|
||||||
data_thchs30.tgz
|
data_thchs30.tgz
|
||||||
|
|
18
README_EN.md
18
README_EN.md
|
@ -1,6 +1,9 @@
|
||||||
# A Deep-Learning-Based Chinese Speech Recognition System
|
# A Deep-Learning-Based Chinese Speech Recognition System
|
||||||
|
|
||||||
[![GPL-3.0 Licensed](https://img.shields.io/badge/License-GPL3.0-blue.svg?style=flat)](https://opensource.org/licenses/GPL-3.0) [![TensorFlow Version](https://img.shields.io/badge/Tensorflow-1.4+-blue.svg)](https://www.tensorflow.org/) [![Keras Version](https://img.shields.io/badge/Keras-2.0+-blue.svg)](https://keras.io/) [![Python Version](https://img.shields.io/badge/Python-3.x-blue.svg)](https://www.python.org/)
|
[![GPL-3.0 Licensed](https://img.shields.io/badge/License-GPL3.0-blue.svg?style=flat)](https://opensource.org/licenses/GPL-3.0)
|
||||||
|
[![TensorFlow Version](https://img.shields.io/badge/Tensorflow-1.13+-blue.svg)](https://www.tensorflow.org/)
|
||||||
|
[![Keras Version](https://img.shields.io/badge/Keras-2.2+-blue.svg)](https://keras.io/)
|
||||||
|
[![Python Version](https://img.shields.io/badge/Python-3.x-blue.svg)](https://www.python.org/)
|
||||||
|
|
||||||
**ReadMe Language** | [中文版](https://github.com/nl8590687/ASRT_SpeechRecognition/blob/master/README.md) | English |
|
**ReadMe Language** | [中文版](https://github.com/nl8590687/ASRT_SpeechRecognition/blob/master/README.md) | English |
|
||||||
|
|
||||||
|
@ -17,6 +20,12 @@ For questions about the principles of the statistical language model that are of
|
||||||
* [Simple word frequency statistics without Chinese word segmentation algorithm (Chinese)](https://blog.ailemon.me/2017/02/20/simple-words-frequency-statistic-without-segmentation-algorithm/)
|
* [Simple word frequency statistics without Chinese word segmentation algorithm (Chinese)](https://blog.ailemon.me/2017/02/20/simple-words-frequency-statistic-without-segmentation-algorithm/)
|
||||||
* [Statistical Language Model: Chinese Pinyin to Words (Chinese)](https://blog.ailemon.me/2017/04/27/statistical-language-model-chinese-pinyin-to-words/)
|
* [Statistical Language Model: Chinese Pinyin to Words (Chinese)](https://blog.ailemon.me/2017/04/27/statistical-language-model-chinese-pinyin-to-words/)
|
||||||
|
|
||||||
|
For questions about CTC, see:
|
||||||
|
|
||||||
|
* [[Translation] Sequence Modeling with CTC (Chinese)](<https://blog.ailemon.me/2019/07/18/sequence-modeling-with-ctc/>)
|
||||||
|
|
||||||
|
For more infomation please refer to author's blog website: [AILemon Blog](https://blog.ailemon.me/) (Chinese)
|
||||||
|
|
||||||
## Introduction
|
## Introduction
|
||||||
|
|
||||||
This project uses Keras, TensorFlow based on deep convolutional neural network and long-short memory neural network, attention mechanism and CTC to implement.
|
This project uses Keras, TensorFlow based on deep convolutional neural network and long-short memory neural network, attention mechanism and CTC to implement.
|
||||||
|
@ -102,8 +111,8 @@ However, as the current international and domestic teams can achieve 98%, the ac
|
||||||
## Python libraries that need importing
|
## Python libraries that need importing
|
||||||
|
|
||||||
* python_speech_features
|
* python_speech_features
|
||||||
* TensorFlow
|
* TensorFlow (1.13+)
|
||||||
* Keras
|
* Keras (2.2+)
|
||||||
* Numpy
|
* Numpy
|
||||||
* wave
|
* wave
|
||||||
* matplotlib
|
* matplotlib
|
||||||
|
@ -116,6 +125,9 @@ However, as the current international and domestic teams can achieve 98%, the ac
|
||||||
[Dependent Environment Details](https://github.com/nl8590687/ASRT_SpeechRecognition/wiki/Dependent-Environment)
|
[Dependent Environment Details](https://github.com/nl8590687/ASRT_SpeechRecognition/wiki/Dependent-Environment)
|
||||||
|
|
||||||
## Data Sets
|
## Data Sets
|
||||||
|
|
||||||
|
[Some free Chinese speech datasets (Chinese)](https://blog.ailemon.me/2018/11/21/free-open-source-chinese-speech-datasets/)
|
||||||
|
|
||||||
* **Tsinghua University THCHS30 Chinese voice data set**
|
* **Tsinghua University THCHS30 Chinese voice data set**
|
||||||
|
|
||||||
data_thchs30.tgz
|
data_thchs30.tgz
|
||||||
|
|
Loading…
Reference in New Issue