Extracting table data from PDF

by Wolfgang Schemmel (‎Perleone‎) (Hannover.pm, Erlangen.pm)

Extracting table data from PDF aimed at Beginner and is held in English. This talk starts on 2016-03-11 at 11:20 for 20 minutes. It takes place at the DATEV.

Transforming a table (for example a spreadsheet) into PDF is trivial, usually you just have to print it as PDF. Getting the same data out of the PDF in a structured, reusable way is quite another story. Using a real-life example (cafeteria menu) I will try to not end up with a jumbled mess of a word salad.

Slides: http://perleone.github.io/talks/extracting.table.data.from.pdf.gpw2016.pdf


Tags: Tags: extraction pdf table

Interest in attending: