Skip to content

Latest commit

 

History

History
24 lines (11 loc) · 2.4 KB

README.md

File metadata and controls

24 lines (11 loc) · 2.4 KB

Street Number Recognition

Nowadays object recognition on images is a very challenging area. While some problems like single character recognition are easy to solve, the others like recognizing particular person in a crowd or recognizing a sequence if characters still might be difficult. Last years huge amount of open data was published which pushed progress in this area forward.

This project focused on a number recognition problem, which is basically a sequence ofsymbols. In particular, we are going to implement algorithm which will recognize house number from images. Application will take as input a bunch of images and provide a house number which is presented on each image. One of the difficulties here might be that sometimes house numbers are written in some specific order. It might be written vertical or diagonal or something else. However, at the end solid application which recognizes numbers from images with high accuracy will be presented.

In this project we will use Street View House Numbers(SVHN)[1] dataset to train our application recognize numbers on images. SVHN is a real-world image dataset for developing machine learning and object recognition algorithms with minimal requirement on data preprocessing and formatting. The dataset is obtained from house numbers in Google Street View images. Image below represents several samples from this dataset.

This project is implemented in multiple steps -

Step 1: Design and Test a Convolutional Neural Network (CNN) Model Architecture

In this step, a deep learning model is designed and implemented to recognize sequences of digits. In order to keep input data simple, the model is trained using synthetic data generated by concatenating character images from notMNIST or MNIST.

Step 2: Train a CNN Model on a realistic dataset

Once Convolutional Neural Network (CNN) architecture is finalized in step 1, model is trained on real data. In particular, the Street View House Numbers (SVHN) dataset is a good large-scale dataset collected from house numbers in Google Street View. Training on this more challenging dataset, where the digits are not neatly lined-up and have various skews, fonts and colors, require tweaking of hyperparameters for model to perform well.

Step 3: Test the model on newly-captured Images

Several pictures of numbers were taken from real houses. Those images were cropped and run through the classifier to produce example results.