Math OCR

July, 2015

In 2015, CNN-based OCR was in its infancy, and traditional tools for OCR relied on there being a single baseline for the target text. I developed some strategies for performing OCR on mathematical expressions using a combination of machine learning for character recognition with more traditional approaches to put those characters together into a larger expression.

My work was eventually turned into US Patent 20170091597A1.