Computer VisionBrowser-basedTesseract.jsOCRWord and line boxesStudent lab

OCR Studio

Extract text from uploaded images, scanned notes, screenshots, or posters using Tesseract.js, then inspect the recognized text and optional word or line boxes.

This page teaches how document analysis combines image preprocessing with text recognition. Students can adjust grayscale, thresholding, contrast, and scale-up settings, then compare the extracted text with the visual text regions found by the OCR engine.

How OCR fits computer vision

OCR combines image preprocessing with text recognition. Before the recognizer can read letters, the image often needs contrast adjustment, grayscale conversion, thresholding, or scaling.

What this page shows

Students can upload notes, screenshots, posters, or scanned pages, then compare the extracted text with optional word or line boxes drawn over the original image.

Why bounding boxes matter

Document analysis is not only about the final text string. Word and line boxes reveal where the recognizer found text regions and make OCR errors much easier to diagnose.

Upload image

Status: Upload an image to start OCR.
Image: No image selected yet

Upload an image to inspect OCR output and detected text regions.

Visual Overlay

Word boxes are useful when students want to inspect local OCR mistakes. Line boxes are useful when students want to study document layout and reading order.

Preprocessing Controls

GrayscaleThresholdThreshold value140Contrast20Scale up 2x before OCR

OCR Results

Extracted text will appear here after OCR runs.

Stats

Words

Lines

Characters

Avg confidence

—

Teaching Notes

Grayscale and contrast help separate text from the background before recognition begins.
Thresholding can help clean high-contrast receipts or scans, but it can also destroy faint handwriting if used too aggressively.
Scaling up small images often improves OCR because letter shapes become easier for the recognizer to distinguish.
Bounding boxes help students compare recognition output with the actual text layout on the page.