OCR is picky about file format

Currently the UI says:

> File names must match the form testing_ocr-{sequence number} where sequence number can be any sequence of digits followed by letters (a-z,A-Z) or _.
> 
> Accepted images types: tiff and jpg.
> 
> Files larger than 10MB are not able to be processed with OCR

I've had `.jpg` files fail pretty consistently, there may be specific restrictions on encoding, and I've had a 21MB `.tiff` file work perfectly; its possible that Textract has changed since this was written.

Also uploading a PNG is allowed, the only thing that happens is the OCR usually fails (I think I remember it working with a PNG a couple of times; but I would need to double check). If PNG is really not supported we should stop people from picking them in the file picker and throw up an error for drag-n-drop.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

OCR is picky about file format #247

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

OCR is picky about file format #247

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions