• Converting PDF formatted documents into Excel spreadsheets

    Home » Forums » AskWoody support » Productivity software by function » Productivity software by function – other » Converting PDF formatted documents into Excel spreadsheets

    Author
    Topic
    #2750238

    I am hoping to find outstanding PDF software that we can use to convert PDF formatted documents into Excel spreadsheets.

    We have been using Adobe Acrobat Standard 2020 for years but itโ€™s export performance to Excel was poor โ€“ if not useful at all.

    Kofax Power PDF standard (now Tungsten Power PDF) came bundled with our new RICOH SCANSNAP 1X1600 SCANNER and we gave it a try.ย  It performed better than Adobe Acrobat but still left much to be desired.

    We have also tried to import data from PDFs using Excelโ€™s import data feature (Data, Get Data, From File, From PDF) but the process is at best awkward.

    We do not want to use one of the โ€œonlineโ€ conversion tools due to the sensitivity of the data we are working with.

    In addition, our preference is to find software that does not require a monthly or yearly subscription.

    So, can anyone, with hands on experience, recommend a program that will do an outstanding job of converting .pdf files to Excel .xlsx files?

    Viewing 7 reply threads
    Author
    Replies
    • #2750244

      If the original document was created by Adobe (or another PDF engine) and not scanned by a scanner, the current subscription version of Adobe does a very good job of exporting.

      Hands on/personal experience however Adobe is expensive.

      The key is how clean the base pdf is.

      Susan Bradley Patch Lady/Prudent patcher

      1 user thanked author for this post.
      • #2750245

        “In addition, our preference is to find software that does not require a monthly or yearly subscription.”

        Not possible (or near impossible) in this current business environment.

        Susan Bradley Patch Lady/Prudent patcher

        1 user thanked author for this post.
    • #2750249

      Kathy, I don’t have any PDF spreadsheets to try converting to Excel, but if you have one that you can share, I can try converting it to Excel (Office 2019, in case it matters).

      Not sure if converting a spreadsheet to PDF and then back to Excel would be the equivalent of what you seek to do. If it is, let me know and I can try that.

       

      1 user thanked author for this post.
      • #2750252

        Cybertooth

        I do not have any โ€œPDF spreadsheetsโ€ that I can share with you.

        However, if you have some time, you may be able to make one using Microsoft Word.

        Type some text into a blank document, then insert a table containing random numbers, then type some more text and save as a PDF.

        Then try to convert the PDF document to Excel.

    • #2750250

      Susan

      We have been working with PDF credit card statements files.

      The statements have been downloaded from the credit card companyโ€™s website as well as scanned copies of statements received in the mail and scanned with Adobe Acrobat Standard 2020.

      In both cases, pages that contain primarily spreadsheet like configurations convert relatively well.

      It is the mixed content pages that are giving us a problem.

      One option that I will try moving forward is to crop the mixed content pages so that only the spreadsheet like configuration is available for export.

      • #2750251

        It’s a little bit old now and a bit handraulic so it might not suit your workflow but I often use Tabula (https://tabula.technology/). If you do try it, it would be interesting to head what yo think about it.

        1 user thanked author for this post.
    • #2750257

      Have you used Tabula to convert PDF documents to Excel?

      Yes, often. Unfortunately though, that’s where the “handraulic” comment comes into play. Tabula runs locally via a browser. It can try to autodetect the tables and you can also manually select the tables you want. It has a couple of different parsing techniques you can play with – if one doesn’t work the other may. It will show it’s results on the web page in a table that you can easily copy/paste into excel.

      1 user thanked author for this post.
    • #2750299

      Just found an alternative to converting PDF credit card statements to Excel.

      Spoke with the tech staff at the credit card company and after some hunting around their web site we found that they post transactions going back 90 days on line in an Excel format.

      After further discussion, the tech said he would bring up the difficulty we were having with their statements and see if they will post the Excel formatted transactions going back more than 90 days.

      It is nice to bank with a local credit union.

    • #2750302

      I suggest trying FineReader for free at Experience FineReader PDF products for free.

      It can recognize text, images and tables. You can edit what it produces, specifically select or convert areas to be treated as tables then re-recognize them and split or join rows and columns before conversion to a spreadsheet file.

      See also Robust features for your digital workplace, in particular:
      Create and convert PDFs:
      Take a digital-first approach by standardizing documentation in the PDF format and capitalizing on its advantages. Convert paper documents or files in multiple formats into searchable PDFs (compliant with ISO specifications) or convert PDFs into Microsoft Word, Excel, and more than 15 other formats to obtain full flexibility when editing and reusing them.

      HP Compaq 6000 Pro SFF PC / Windows 10 Pro / 22H2
      IntelยฎCoreโ„ข2 โ€œWolfdaleโ€ E8400 3.0 GHz / 8.00 GB

      HP ProDesk 400 G5 SFF PC / Windows 11 Pro / 23H2
      IntelยฎCoreโ„ข โ€œCoffee Lakeโ€ i3-8100 3.6 GHz / 16.00 GB
      1 user thanked author for this post.
    • #2750409

      PDFGear is free to download and use, download at https://www.pdfgear.com/

      Install, open your .pdf file, goto tools, convert, pdf to Excel. Works.

      1 user thanked author for this post.
    • #2750489

      In this topic I was hoping to find some outstanding PDF software that we can use to transfer PDF tabular data on to Excel spreadsheets.ย  In undertaking the task, our primary consideration was creating accurate and properly formatted Excel spreadsheets from PDF documents.

      While I was waiting on responses, I tried an alternative approach using our old standby Adobe Acrobat Standard 2020.ย  And it worked!ย  Perfect transfers of PDF tables to Microsoft Excel sheets.

      The approach used involved:

      • Opening the document in Adobe Acrobat Standard 2020,
      • Clicking the Scan & ORC option,
      • Selecting Recognize Text and choosing In This File,
      • Clicking on Recognize Text (in the task bar) and Adobe scanned and converted the document,
      • Then clicking on Export PDF,
      • Choosing Export your PDF to Microsoft Word,
      • Saving and opening the resulting Word document,
      • Blocking and coping the tabular data I wanted to move into Excel,
      • Opening a blank Excel sheet,
      • Clicking into cell A1,
      • Clicking the right mouse button and selecting Keep Source Formatting, and
      • Presto chango โ€“ the date that was blocked and copied from the PDF document appeared correctly and properly formatted on the Excel sheet.

      My search for a way to transfer tabular data from a PDF document to an Excel sheet is over.

      1 user thanked author for this post.
      • #2750496

        Have you tried just opening the PDF with Word directly?ย  I can download a PDF bank statement, open in Word, select the actual bank statement ledger, and paste into Excel.ย  It displays perfectly and all I have to do is widen some of the columns to display all the data.ย  Even the withdrawal amounts show up in red as negative numbers and the deposits as black positive numbers.

        Fits easily in my Excel accounting workbook.

        HTH, Dana:))

        1 user thanked author for this post.
        • #2750575

          Discard:))

          I have tried opening PDF format credit card statements with Word 2021.

          After rung for a few minutes I get the message, โ€œThe Kofax Convert Assistant has successfully converted your PDF/XPS file into Microsoft Word format.โ€

          But the results shown on the screen are useless.

          • #2750665

            This is not a viable solution for your situation, but there is a work around when this occurs.ย  Often the extra coding of some PDF files prevents Word’s PDF convertor from rendering correctly.ย  This extra coding usually has nothing to do with the data displayed in the PDF.

            Open the PDF in your default PDF viewer (I use Chrome).ย  Print the PDF using the Microsoft Print to PDF function.ย  This saves the PDF to a PDF without any underlying code.ย  ย When you open this “Printed” PDF file with Word, most of the time it displays properly and any part can be copied and entered into Excel or edited.

            “Discard:))”ย  Does that refer to me or my comments?

            HTH, Dana:))

            2 users thanked author for this post.
            • #2750703

              Discard:))

              I used the approach you outlined above and it worked.

              Now I have an alternative to Adobe Acrobat for copying and pasting tables in PDF documents to Excel.

              Thank you

    Viewing 7 reply threads
    Reply To: Reply #2750257 in Converting PDF formatted documents into Excel spreadsheets

    You can use BBCodes to format your content.
    Your account can't use all available BBCodes, they will be stripped before saving.

    Your information:




    Cancel