Automic Workload Automation

 View Only
  • 1.  OCR is possibile in Automic

    Posted Nov 19, 2019 05:07 AM
    Hi All,
    I need to read a pdf file that contains a document scan.
    In Automic is it possible to integrate the OCR functionality?
    Is there a plugin that can perform this scan?

    Thanks a lot!

    Claudia


  • 2.  RE: OCR is possibile in Automic
    Best Answer

    Posted Nov 19, 2019 07:06 AM
    ​Hi.

    There is no built-in OCR in Automic and I don't know of any plugins by Broadcom.

    You can launch any OCR software that has a CLI interface from Automic. A quick Google search seems to indicate that there are several packages, commercial or free, that have CLI interfaces.

    Br,


  • 3.  RE: OCR is possibile in Automic

    Posted Nov 19, 2019 07:20 AM
    Thank you Carsten!

    Claudia


  • 4.  RE: OCR is possibile in Automic

    Posted Nov 19, 2019 07:12 AM
    ​By the way, why PDF?

    Documents to be OCR-treated usually come in the form of image formats, such as JPG or TIFF.

    While PDF can also contain "untreated" text as an image, if your document is a PDF, does it possibly contain electronically legible text already? Can you select (as in copy/paste) the text in it in a PDF viewer such as Acrobat Reader? In that case, you'd not need OCR software but a PDF converter, which can output the desired format. Those also exist for the command line.

    Br,
    Carsten


  • 5.  RE: OCR is possibile in Automic

    Posted Nov 19, 2019 08:45 AM
    Hi Carsten,
    this PDF file does not contain text, but the scanning of an image captured by a web form..
    Thanks for your advice!

    Claudia



  • 6.  RE: OCR is possibile in Automic

    Posted Nov 20, 2019 06:43 AM
    Sorry for the possibly silly question, but why or for what do you need an OCR Software for an image in a PDF file?

    Wouldn't it be much easier storing a JPG or any other picture file instead of converting to PDF and converting again?

    cheers, Wolfgang

    ------------------------------
    Support Info:
    if you are using one of the latest version of UC4 / AWA / One Automation please get in contact with Support to open a ticket.
    Otherwise update/upgrade your system and check if the problem still exists.
    ------------------------------



  • 7.  RE: OCR is possibile in Automic

    Posted Nov 20, 2019 07:12 AM
    Edited by Carsten Schmitz Nov 20, 2019 07:12 AM
    Well, PDF is also an image container.​ Could for example be a file from a multi function copier, those often times send a pdf with graphics in it when you use it to scan an image (or text) and email it somewhere.

    Best,
    Carsten

    ------------------------------
    I will not respond to PM asking for help unless there's an actual reason to keep the discussion off of the public forums.
    ------------------------------