18 Application Example: Photo OCR

Problem Description and Pipeline

  1. Text detection
  2. Character segmentation
  3. Character classification

Sliding Windows

Go out and collect large training sets of positive and negative examples.

Take that green rectangle and we slide it over a bit and then run that new image patch through our classifier to decide if there’s a pedestrian there.

Getting Lots of Data and Artificial Data

  1. artificial data synthesis
  2. just collect the data and you label it yourself
  3. crowd sourcing

Ceiling Analysis What Part of the Pipeline to Work on Next

Where should you allocate resources?
Which of these boxes is most worth your efforts, trying to improve the performance of.

  1. choose one module
  2. provide it the correct text detection outputs
  3. And then, use the same evaluation metric as before, to measure what is the overall accuracy of the entire system
  4. go to step 1, but choose another one

Leave a Reply

Your email address will not be published. Required fields are marked *