Vinay Sulanki

Vinay Sulanki

  • NA
  • 7
  • 1.6k

Creating xls from PDF with tables using csharp

Oct 12 2011 1:47 AM
Hi All,

The scenario is I want to convert a PDF file to an xls file. The PDF contains texts and table structures. 

It is known that the pdf doesn't have the concept of tables or do have any tags relating to those which may help to figure out that the certain piece of text is in table.

Using ITextSharp we can fetch out individual text and its position information but how to get it with the table structure as the final outcome desired is an excel file. Even if I retrieved text with position information, again placing it in a datatable or to an intermediate html file doesn't seems easy.

Please help me find a suitable solution such as I can create a similar structure that of a PDF file to an xls file. The PDF in this case doesn't have any images, it is only texts and in some cases tables and text in it.

Any help would be really helping.

Answers (6)