Support > Forums > OpenBots Studio > Extract Invoice

Extract Invoice

Harshita Aralikatte Ramesh

  • Reporter
  • Calender Icon December 13,2022 at 5:07 PM

We are trying to extract information from a pdf which was saved in a folder and save the information in an excel file. What are the methods to do this ?

This forum has 314 topics, 681 replies, and was last updated 23 days ago by Support Agent

Harshita Aralik...

  • Participant
  • Calender Icon December 14,2022 at 3:50 PM

@Rushabh Arbat

Thanks for your reply I will try this

Rushabh Arbat

  • Participant
  • Calender Icon December 14,2022 at 3:48 PM

Hi Harshita Aralikatte Ramesh,

To extract information or text from a pdf we have a following commands which is located under Commands -> Data Manipulation Command -> PDF

Get PDF Text Command - this command extracts all the text from PDF file. Get PDF Form Field Value - this command gets the value of specified form PDF field. Get PDF Images - this command reads all the images from PDF file.

After text is extracted, you can use string manipulation commands or C# code snippets (like Regex or substring) to get the targeted text and use it as input for the Excel commands.

Also, you can find the PDF_TextExtraction Automation sample if you navigate to Library -> Samples - > PDF_TextExtraction Automation To run the sample script, make sure to update the vInputsPath variable with your corresponding Samples input path:

Thanks

You are not authorized to reply, please click here to login and add your reply to this topic.