Tesseract OCR is a library and engine for optical character recognition. Version 4.0 has a greater facility for neural network training. The Tesseract Wiki is a good place to start. The Tesseract V4.0 neural network in particular implements an LSTM engine.
DeepSpeech Speech Recognition Machine Learning These are notes to the project, which seem to me worth pursuing. Having recently seen a number of AWS re:invent videos on Vision and Language Machine Learning tools at Amazon, I have ML-envy. Time to start a project, but while I wait for the Amazon Transcribe and Amazon Translate to […]
See also Deep Speech, Tesseract Recent Items on ML IBM's Ginny Rommety gave a compelling keynote at CES on AI, as well as answering a great set of questions on Bloomberg Technology. Harari's book 21 Lessons for the 21st Century has some interesting discussion of AI. One thing is that he has a tendency to […]