Add a task for automatic text recognition #455

PonteIneptique · 2024-01-25T10:44:04Z

Hi :)
We are in the process of working a pipeline to help people publish their data to huggingface in the context of HTR/OCR groundtruth and HTR-United, and have ourselves a fair amount of data.
I wonder if it could be possible to have a ATR (Automatic Text Recognition) or OCR/HTR (Optical Character Recognition / Handwritten Text Recognition) task to register our datasets under, instead of the quite broader Vision to Text, which seems more focused on image-description datasets ?
Thanks !

coyotte508 · 2024-01-25T10:45:58Z

cc @merveenoyan @osanseviero

osanseviero · 2024-01-25T17:17:14Z

cc @sanchit-gandhi and @Vaibhavs10 for our audio experts :)

Vaibhavs10 · 2024-01-25T17:30:27Z

This is more vision no?

PonteIneptique · 2024-01-25T17:44:36Z

This is more Vision than this is Text (although, depending and who you ask...) but I don't think that Multimodal > Vision-to-text is a good match for HTR/OCR/ATR

osanseviero · 2024-01-25T20:57:21Z

Sorry for my confusion, I read too quickly and did string matching with ASR 🥲

Yes, this is indeed vision, In the past, OCR models have been tagged as image-to-text such as in https://huggingface.co/microsoft/trocr-base-handwritten . I think potentially we could keep image-to-text + add a secondary subtype for this use case (either ocr or atr as suggested). WDYT @merveenoyan @NielsRogge @lhoestq ?

lhoestq · 2024-01-26T13:59:10Z

I'm ok to add a new task_id "ocr" or "optical-character-recognition" under "image-to-text"

merveenoyan · 2024-01-26T16:27:02Z

I agree with @lhoestq.

coyotte508 added the tasks @huggingface/tasks related label Jan 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a task for automatic text recognition #455

Add a task for automatic text recognition #455

PonteIneptique commented Jan 25, 2024

coyotte508 commented Jan 25, 2024

osanseviero commented Jan 25, 2024

Vaibhavs10 commented Jan 25, 2024

PonteIneptique commented Jan 25, 2024

osanseviero commented Jan 25, 2024

lhoestq commented Jan 26, 2024

merveenoyan commented Jan 26, 2024

Add a task for automatic text recognition #455

Add a task for automatic text recognition #455

Comments

PonteIneptique commented Jan 25, 2024

coyotte508 commented Jan 25, 2024

osanseviero commented Jan 25, 2024

Vaibhavs10 commented Jan 25, 2024

PonteIneptique commented Jan 25, 2024

osanseviero commented Jan 25, 2024

lhoestq commented Jan 26, 2024

merveenoyan commented Jan 26, 2024